This project demonstrates the construction of a fully functional Data Warehouse using SQL Server 2022 and Visual Studio 2022. It includes:
- Database Design: Creation of dimension tables and a fact table for a retail sales scenario.
- ETL Processes: Implementation of ETL pipelines to load and transform data from staging tables into the final schema.
- Data Generation: Python scripts to generate synthetic datasets for testing.
- KPI Analysis: SQL queries to calculate key metrics, such as total sales, top-selling products, and revenue by region.
The repository includes SQL scripts, Python scripts for data generation, and documentation on setting up and running the project.
- .gitignore: Configuration file to specify files and directories to be ignored by Git.
- DataWarehouse/: Main directory containing the SQL Server project files.
- bin/Debug/: Build output directory for debug configuration.
- DataWarehouse.dbmdl: Database model file.
- DataWarehouse.jfm: JSON file for project metadata.
- DataWarehouse.sln: Visual Studio solution file.
- DataWarehouse.sqlproj: SQL Server project file.
- DataWarehouse.sqlproj.user: User-specific project settings.
- Python Scripts/: Directory containing Python scripts for data generation.
- Results/: Directory containing the KPIs results.
- SQL Scripts/: Directory containing SQL scripts.
- Data/: Directory containing Tunisian-specific data.
- DataWarehouse Schema/: Directory containing schema diagrams.
- README.md: Project documentation file.