The CorporateTaxAnalysis project focuses on analyzing corporate tax statistics in Colombia, using data from the last 10 years provided by the DIAN (National Tax and Customs Office). This project involves working with unordered maps (hash tables), integrating concepts from linear data structures (lists, stacks, queues), search and sorting algorithms, and CSV data processing.
- Student No. 1 Ángel Farfán, Student No. 1 Uniandes Email a.farfana@uniandes.edu.co, Student No. 1 20222183.
- Student No. 2 Juan José Díaz, Student No. 2 Uniandes Email jj.diazo1@uniandes.edu.co, Student No. 2 202220657.
- Student No. 3 Name Andrés Cáceres, Student No. 3 Uniandes Email a.caceresg@uniandes.edu.co, Student No. 3 202214863.
In Colombia, corporate income taxes are levied on all revenues that contribute to an increase in a company's wealth. This project focuses on analyzing corporate tax data from DIAN over the past 10 years. Corporate taxes represent 78% of total tax revenue, with all companies, regardless of size, paying a flat tax rate of 35% on their profits.
The dataset for this challenge comes from DIAN's statistical data on corporate tax revenues over the last 10 years. It includes CSV files with 59 variables related to companies' income, costs, assets, and taxes. The data is provided in files ranging from small subsets to full datasets, allowing for testing and implementation at different scales.
This repository is part of the Data Structures and Algorithms (EDA) teaching framework at Universidad de los Andes. The repository was developed by faculty professors and staff in the Department of Systems and Computer Engineering (DISC) and uses the Non-Object-Oriented Python library DISCLib.
DISClib · DISClib Demo and Examples · Report Bug · Request Feature
IMPORTANT This is a work in progress and is part of a teaching framework for undergraduate college students at Universidad de los Andes. This project Is NOT intended as a full-functional source code project.
The challenge template has four main parts:
- DISClib Root folder with the official course library. For more on its implementation, visit the DISClib Repository.
- App Folder with the model-view-controller (MVC) Python scripts. In here, the students implement their code to complete the challenge.
- Data Folder with CSV data files to load into the application. Students must add the course-provided data files to complete the challenge.
- Docs Folder with reports, data tables, and other documentation. Students add their project report, data tables, and other documentation to complement their code implementation.
As a tax analyst, I want to identify the economic activity with the highest total tax liability (total balance due) for a specific economic sector and year.
As a tax analyst, I want to identify the economic activity with the largest tax credit (total balance in favor) for a specific economic sector and year.
As a tax analyst, I want to find the economic subsector with the lowest total withholdings for a specific year.
As a tax analyst, I want to identify the economic subsector with the highest payroll expenses for a specific year.
As a tax analyst, I want to identify the economic subsector with the highest tax deductions for a specific year.
As a tax analyst, I need to identify the economic sector with the highest total net income for a specific year.
Requirement 7: List the top N economic activities with the lowest total costs and expenses for a subsector (Group)
As a tax analyst, I need to list the top N economic activities with the lowest total costs and expenses for a subsector and a specific year.
As a tax analyst, I need to identify the top N economic activities, in each economic subsector, with the highest total tax liabilities for a specific year.
To use this template, you need to follow the steps below:
- Read the official project document published in the course official site at BrightSpace.
- Distribute the project functionalities and implementation responsibilities among the group members.
- Download the official dataset for the project at the course official site at BrightSpace.
- Unzip and load the dataset into the application at the Data folder.
- Import the necessary modules from DISClib into the MVC scripts at the App folder.
- Implement the missing functions according to the project needs in the MVC scripts at the App folder.
- Evaluate the implementation of the MVC scripts, record your tests and analysis in the documents at the Docs folder (The report MUST BE in PDF format).
For further information and contact, use the following links:
- Official Repository DISClib.
- Repository for Demo and Examples.
If you require further information, please contact us via this email.
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have a suggestion that would make this project better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement".
Don't forget to give the project a star! Thanks again!
- Fork the Project.
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
). - Commit your Changes (
git commit -m 'Add some AmazingFeature'
). - Push to the Branch (
git push origin feature/AmazingFeature
). - Open a Pull Request.
Copyright 2020, Departamento de sistemas y Computación, Universidad de Los Andes. Developed for the class "ISIS1225 - Estructuras de Datos y Algoritmos" or "ISIS1225 - Data Structure and Algorithms" in English.
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more information.
- Dario Correal is the original author and main developer of the library.
- Santiago Arteaga is a contributor and repository administrator.
- Luis Florez is a contributor and developed examples and tutorials for the library.