Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(WIP) Add GX data validation for synapse data warehouse #42

Merged
merged 15 commits into from
Sep 16, 2024
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
13 changes: 13 additions & 0 deletions data_validation/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Great Expectations

Follow instructions here: https://docs.greatexpectations.io/docs/oss/tutorials/quickstart/ to get started.

pip install sqlalchemy python-dotenv snowflake-sqlalchemy boto3 botocore

1. Be sure to export and create the GX data source environmental variable. Follow these [steps](https://docs.greatexpectations.io/docs/oss/guides/connecting_to_your_data/fluent/database/connect_sql_source_data?sql-database-type=snowflake) to set it up:

```
export GX_SNOWFLAKE_CONNECTION="snowflake://<USER_NAME>:<PASSWORD>@<ACCOUNT_NAME_OR_LOCATOR>/<DATABASE_NAME>/<SCHEMA_NAME>?warehouse=<WAREHOUSE_NAME>&role=<ROLE_NAME>"
```

2. Run validation `python validation.py`. Be sure to log into jumpcloud (dpe aws account) and export your credentials in the terminal.
Loading