Reformate demographic_id and refactor disaggregate.py #25

AHsu98 · 2023-03-03T07:51:32Z

Pretty significant changes here, we should probably update pypi version after this and bump version.

Reformatted the io of how we use demographic_id, now you can optionally put in a list of columns, and we won't have to do the ugly tuple hack if we have a age x location x year slice
Removed stuff like split_dataframe_rate and added an extra argument to split_dataframe

The version in this branch is what I used to make the example from Lauryn's data.

Cleaned up high level splitting api to make the output type an argument rather than use separate functions

Fixed bug with argument order, got rid of split_dataframe_rate

Also updated example notebook for dataframe splitting to match current changes.

Minor fixes to docstrings, typing

zhengp0 · 2023-03-06T20:31:50Z

src/pydisagg/disaggregate.py

-                model=model,
-                observed_total_se=x['obs_se']
+    if demographic_id_columns is not None:
+        splitting_df['demographic_id']=list(


Hi @AHsu98, I don't think create 'demgraphic_id' column is necessary, especially you will delete it at the end of the function. Maybe I am missing something. If you can explain a little here that will be great!

Pandas set_index function can take multiple column to create the index, and reset_index will flatten out the hierarchical index.

Yeah, setting it as demographic id here was kind of a hack, I don't really like it. I set it as a tuple because otherwise I was getting errors around the index name not matching with the population df, I'll make an issue and try to fix it later!

AHsu98 added 5 commits March 2, 2023 21:03

Refactored to make the output type an argument

6c9e310

Cleaned up high level splitting api to make the output type an argument rather than use separate functions

cleaned up disaggregate a bit more

25b4607

Fixed bug with argument order, got rid of split_dataframe_rate

Updated indexing procedure for demographic_id

eecaf6b

Also updated example notebook for dataframe splitting to match current changes.

slight change in raise

1fc49b4

Fixed bug in concatenation axis

55b9cc0

Minor fixes to docstrings, typing

AHsu98 requested a review from zhengp0 March 3, 2023 07:51

zhengp0 reviewed Mar 6, 2023

View reviewed changes

zhengp0 approved these changes Mar 6, 2023

View reviewed changes

AHsu98 mentioned this pull request Mar 6, 2023

Stop creating a demographic_id column when unnecessary #26

Closed

AHsu98 merged commit 704f6ec into main Mar 6, 2023

AHsu98 deleted the reformat-demographic-id branch August 24, 2023 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reformate demographic_id and refactor disaggregate.py #25

Reformate demographic_id and refactor disaggregate.py #25

AHsu98 commented Mar 3, 2023 •

edited

Loading

zhengp0 Mar 6, 2023

AHsu98 Mar 6, 2023

Reformate demographic_id and refactor disaggregate.py #25

Reformate demographic_id and refactor disaggregate.py #25

Conversation

AHsu98 commented Mar 3, 2023 • edited Loading

zhengp0 Mar 6, 2023

Choose a reason for hiding this comment

AHsu98 Mar 6, 2023

Choose a reason for hiding this comment

AHsu98 commented Mar 3, 2023 •

edited

Loading