⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Updated
Dec 26, 2024 - Python
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
re_data - fix data issues before your users & CEO would discover them 😊
A Python library for efficient feature ranking and selection on sparse data sets.
Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results
⚡ Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
DEPRECATED! Please move to bunnyxt/tdd-spider. The crawler of https://tdd.bunnyxt.com via python.
Um conjunto de ferramentas simples para uso no monitoramento de dados no site da Câmara dos Deputados
Python-based web scraping project that automates the process of fetching SEC-8K cybersecurity incident filings from the EDGAR database and provides automated email alerts with detailed reports on material breaches or incidents reported by U.S. public companies.
Códigos, plataformas, ferramentas e processos em alta;
Real-time Network Data Monitoring System using RabbitMQ , InfluxDB and Chronograf
Apache Airflow Pipeline extracts JSON files from AWS S3 bucket and inserts these into an AWS Redshift Cluster.
Um conjunto de ferramentas simples para uso no monitoramento de dados no site da Câmara dos Deputados
An online monitor for acquired Schottky data during storage-ring experiments
Add a description, image, and links to the data-monitoring topic page so that developers can more easily learn about it.
To associate your repository with the data-monitoring topic, visit your repo's landing page and select "manage topics."