Minimal ETL pipeline using a public API. Simulates a real-world data pipeline with validation-first design.
- Fetch β Grabs user data from JSONPlaceholder API
- Clean β Drops nulls, normalizes email, extracts full name
- Snapshot β Logs daily KPIs (user count, unique emails)
- Python (
requests
,pandas
) - CSV-based persistence (file-system snapshot)
- Modular, script-based orchestration
pip install pandas requests
python main.py