Moved the feed data for the lookup tables from csv file into src/conf… #67

nargis-sultani · 2023-12-21T08:37:00Z

…ig file. Added custom tests for alembic migration

github-actions · 2023-12-21T08:37:56Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
src
config.py
Project Total

_{This report was generated by python-coverage-comment-action}

nargis-sultani · 2023-12-21T08:40:05Z

closes #66

jcadam14

Everything looks good to me, just had the one comment/question on the feeds/seed data in config.py

jcadam14 · 2023-12-21T16:57:58Z

src/config.py

I don't know if this was discussed already in the other PRs or with Le, but should the individual feed lists go into their corresponding migration scripts to keep the seed data with the creation of the table the data is going into? The config class seems to be specific to env configurations we pull from either env vars or .env files.

I can move it inside its individual scripts. That's the way I initially had it, but then I needed to access the same data for testing. So, I wanted to have them just in one place.

I can go ahead and move it inside the individual scripts.

Gotcha, thought it might be that because of the comment in there. It's great that the tests check that the expected seed data is in tables so hopefully the tests can access the lists in the migrations (which is a little annoying to import each especially since the module names are horrific) or at the worst copy the list to the test testing the table.

We should still have them inside the individual scripts for data integrity, considering the small data set. For testing purposes, you can test just a small sample of the data; or create test resources.

@lchen-2101 Got it, yeah makes more sense.

…sions/seed.py; renamed filenames and functions names that had 'feed' to 'seed'

lchen-2101 · 2023-12-27T15:32:31Z

db_revisions/seed.py

@@ -0,0 +1,119 @@
+"""


We run into the same problem having this file rather than having the csv's. These should live within the scripts themselves; the purpose of the alembic scripts with versions is that each time the upgrade is ran, the same data is populated; we have decide to update the seed data, and we run the script in a different environment, then the 2 environments' data would become out of sync. Let's have these data within each script.

I think if it was a big dataset, we should do the csv route, but version the file appropriately. i.e. address_state_7b6ff51002b5.csv; but since these are small, keeping them within the alembic script itself should be fine.

lchen-2101 · 2023-12-27T15:42:28Z

db_revisions/versions/26a742d97ad9_seed_federal_regulator_table.py

-    data = get_feed_data_from_file("federal_regulator")
-
-    op.bulk_insert(FederalRegulatorDao.__table__, data)
+    op.bulk_insert(FederalRegulatorDao.__table__, federal_regulator_seed)


I think we talked about keeping the table names hardcoded? same kind of data integrity concerns where as we iterate, if we decided to switch the table for the dao, the new script would succeed in the old environment at creating the new table; but it would fail in the new environment as the table would already exist after running the previous version table creation script.

lchen-2101

Let's move the data from seed.py into the individual scripts, otherwise it creates the same problem as the csv files; where it becomes possible for two environments to have different data.

lchen-2101 · 2024-01-02T17:50:49Z

db_revisions/versions/7b6ff51002b5_seed_address_state_table.py

+branch_labels: Union[str, Sequence[str], None] = None
+depends_on: Union[str, Sequence[str], None] = None
+
+address_state_table = Base.metadata.tables.get("address_state")


If I'm not mistaken, this is still tied to the dao, correct? so if the dao has been updated to say "address_state_new" (hypothetical, do not actually do this); this would fail, correct?

Here's a way to decouple the two:

meta = MetaData() meta.reflect(op.get_bind()) table = Table("address_state", meta)

Looking into it.

lchen-2101 · 2024-01-03T19:09:10Z

db_revisions/versions/26a742d97ad9_seed_federal_regulator_table.py

+    meta = MetaData()
+    meta.reflect(op.get_bind())
+    table = Table("federal_regulator", meta)


guessing u r in the process of replacing these with the get_table_by_name in utils? looks good otherwise, so ready to merge once the last changes are implemented.

Done.
Thanks

lchen-2101

LGTM

Moved the feed data for the lookup tables from csv file into src/conf…

b570fd6

…ig file. Added custom tests for alembic migration

nargis-sultani requested review from lchen-2101, guffee23, hkeeler and jcadam14 December 21, 2023 08:37

nargis-sultani linked an issue Dec 21, 2023 that may be closed by this pull request

Add more custom tests for Alembic and add seed data in the scripts for the lookup tables #66

Closed

jcadam14 reviewed Dec 21, 2023

View reviewed changes

Nargis Sultani added 2 commits December 21, 2023 15:26

Address the comment by moving the seed data from config.py to db_revi…

f96e709

…sions/seed.py; renamed filenames and functions names that had 'feed' to 'seed'

Modified comments, changed feed to seed

f584799

jcadam14 approved these changes Dec 27, 2023

View reviewed changes

lchen-2101 reviewed Dec 27, 2023

View reviewed changes

lchen-2101 requested changes Dec 27, 2023

View reviewed changes

Addressed the comments

21052bb

lchen-2101 reviewed Jan 2, 2024

View reviewed changes

addressed tge comment

4fd0d11

lchen-2101 reviewed Jan 3, 2024

View reviewed changes

Addressed the comments

8ee8223

lchen-2101 approved these changes Jan 4, 2024

View reviewed changes

lchen-2101 merged commit 9004692 into main Jan 4, 2024
3 checks passed

lchen-2101 deleted the feature/66_add_more_alembic_custom_tests_and_add_seed_data_inside_script branch January 4, 2024 15:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Moved the feed data for the lookup tables from csv file into src/conf… #67

Moved the feed data for the lookup tables from csv file into src/conf… #67

nargis-sultani commented Dec 21, 2023

github-actions bot commented Dec 21, 2023 •

edited

Loading

nargis-sultani commented Dec 21, 2023

jcadam14 left a comment

jcadam14 Dec 21, 2023

nargis-sultani Dec 21, 2023

jcadam14 Dec 21, 2023

lchen-2101 Dec 27, 2023

nargis-sultani Dec 27, 2023

lchen-2101 Dec 27, 2023

lchen-2101 Dec 27, 2023

nargis-sultani Dec 27, 2023

lchen-2101 Dec 27, 2023

lchen-2101 left a comment

lchen-2101 Jan 2, 2024

nargis-sultani Jan 2, 2024

lchen-2101 Jan 3, 2024

nargis-sultani Jan 4, 2024

lchen-2101 left a comment

Moved the feed data for the lookup tables from csv file into src/conf… #67

Moved the feed data for the lookup tables from csv file into src/conf… #67

Conversation

nargis-sultani commented Dec 21, 2023

github-actions bot commented Dec 21, 2023 • edited Loading

Coverage report

nargis-sultani commented Dec 21, 2023

jcadam14 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lchen-2101 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lchen-2101 left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 21, 2023 •

edited

Loading