Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transform #4

Merged
merged 5 commits into from
Sep 22, 2024
Merged

Transform #4

merged 5 commits into from
Sep 22, 2024

Conversation

alifradi
Copy link
Owner

@alifradi alifradi commented Sep 22, 2024

Description

Please include a summary of the changes and the related issue.

Checklist

  • are my functions and their arguments passed separately as JSON file and python script as follows:

JSON

{

"tests": [
    {
        "function_name": "fetch_data",
        "module_name": "data_fetcher",
        "args": {
            "url": "https://ckan0.cf.opendata.inter.prod-toronto.ca/api/3/action/package_show?id=toronto-signature-sites"
        }
    }
]

}

PYTHON

import json
import requests

def fetch_data(url):
return requests.get(url).json()

  • I have performed a self-review of my code.
  • I have commented my code, particularly in hard-to-understand areas.
  • I have made corresponding changes to the documentation.
  • My changes generate no new warnings.
  • I have added tests that prove my fix is effective or that my feature works.
  • New and existing unit tests pass locally with my changes.

General Project Architecture
Data Extraction Layer:
APIs to fetch financial data about countries and commodities.
Python scripts to handle API requests.
Data Transformation Layer:
Python scripts to transform raw data into structured tables.
SQL scripts to assist in data transformation.
Data Loading Layer:
SQL database to store transformed data.
Python scripts to load data into the database.
Automation and Orchestration:
Airflow DAGs to manage ETL workflows.
Automated functions for checkpoints and end-to-end testing.
CI/CD Pipeline:
GitHub Actions for continuous integration and deployment.
Docker to containerize services.
Maintenance:
Scripts to manage database size by removing old data.

alifradi added 5 commits September 19, 2024 21:02
@alifradi alifradi merged commit 93d0bbf into main Sep 22, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant