Separate core and dense state space. #310

tobiasraabe · 2019-12-30T13:08:27Z

Closes #237.

Current behavior

Currently, a part of the state space is simply duplicated by all values of another dimension. The first part comprises experiences and similar variables which are mutually exclusive. We call this the core state space. Types and observables duplicate the core state space which is why we call it the dense state space.

The duplication causes a lot of problems because it unnecessarily requires a lot of memory.

Desired behavior

Remove the duplication to save memory and try to exploit the division for better parallelization.

Solution / Implementation

Core changes

I started by extracting the core state space which is a DataFrame containing not only the core state space dimension but also all covariates which can be computed using solely information of the state space. This costs memory but saves some runtime as we frequently need this information.
There are two kinds of state spaces
- _SingleDimStateSpace is similar to the state space of KW94 and has no dense dimension.
- _MultiDimStateSpace comprises many of the former state spaces for each of the product of dense dimensions. The attribute state_space.sub_state_spaces is a dictionary where the key are tuples of the values of dense state space dimensions. For a model with four types, the keys are [(0,), (1,), (2,), (3,)]. The values of the keys are dictionaries which contain information on the specific covariates for this part of the state space. Because the dense dimensions are constant per sub state space, the covariates are also constant. The keys are the names and values are values.
Access and setting attributes to the state space works via get_attribute, set_attribute and the accessor for data in one period.
There exist a decorator called parallelize_across_dense_dimensions which can be applied to functions whose calculations have no side-effects to other dense dimensions. For now, this is a simple for-loop, but it is easy to replace with joblib. The decorator recognizes, if arguments for the wrapped functions are dictionaries with dense state space dimensions as keys and automatically parallelizes over them.

Additional changes

Covariates were never handled better
- Only relevant covariates are used.
- The order in options is irrelevant as compute_covariates iterates over the covariates until no additional covariate can be computed.
- Covariates are only computed if its dependencies are present without NaNs.
- Covariates are separated into core, dense and mixed covariates.
- There is a function to identify all relevant covariates for a subset of covariates instead of simply computing all covariates.
Dramatically reduced setup runtime for estimation via ML by vectorizing data checks from 50s to 7s for data with 40k obs.
Faster simulation.
random models do not have observables with just one level anymore.
Solving the model is aligned to the simulation and others. First, create the solve function with rp.get_solve_func(params, options), then solve with state_space = solve(params).

…urceEconomics/respy into one-step-ahead-simulation

…t delta is zero.

codecov · 2020-02-12T22:32:54Z

Codecov Report

Merging #310 into master will not change coverage by %.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #310   +/-   ##
=======================================
  Coverage   84.22%   84.22%           
=======================================
  Files          42       42           
  Lines        2732     2732           
=======================================
  Hits         2301     2301           
  Misses        431      431

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8562584...8562584. Read the comment docs.

…313)

janosg

Very nice PR. All comments are minor and it would be ok to merge as is.

respy/tests/utils.py

respy/parallelization.py

respy/pre_processing/process_covariates.py

respy/shared.py

respy/solve.py

respy/state_space.py

tobiasraabe and others added 30 commits October 16, 2019 15:29

Vectorized transform_disturbances.

2727941

Finished one-step-ahead simulation.

f9ee1ad

Merge branch 'develop' into one-step-ahead-simulation

c40bbcc

More fixes.

172579e

Merge branch 'one-step-ahead-simulation' of https://github.com/OpenSo…

5e7454c

…urceEconomics/respy into one-step-ahead-simulation

Fix period error.

c92ef63

Fixed osa sim with lagged choices.

f86f5d4

Simplified calc_vf_and_fu_func.

c51a5f4

Simplified code, removed useless test which boils down to testing tha…

893d243

…t delta is zero.

Some clarifications.

9a68f8b

Merge branch 'develop' into one-step-ahead-simulation

45b2148

Unified n-step and one-step-simulation.

bfa0df5

Merge branch 'develop' into one-step-ahead-simulation

021cb4c

n_periods depends on n-step-ahead or one-step-ahead.

00c8bd3

Use index for simulated data. Closes #255.

ba5e8b8

COrrected common returns in extended model.

ba24198

Rearranged functions.

c30048b

Another fix.

c47a3a6

even more.

c326c77

prevent automatic float conversion.

8e8f8cb

Final fix.

469cb36

Merge branch 'develop' into one-step-ahead-simulation

9a01b12

Merged develop into osa-sim.

1ed645e

Merge branch 'develop' into one-step-ahead-simulation

1bdc2cb

Fixed some errors and started tutorial notebook.

a907374

Added to docs.

9d6c9b3

Merge branch 'develop' into one-step-ahead-simulation

222cb4b

Made notebook better.

a012d49

Fix tests.

15f0785

Merge branch 'master' into one-step-ahead-simulation

f8542f2

Merge branch 'random-tests' into separate-core-dense-state-space

3373bfe

tobiasraabe added a commit that referenced this pull request Feb 13, 2020

Restructure likelihood.py for #310 and estimagic's comparison plot. (#…

820946f

…313)

tobiasraabe added 6 commits February 13, 2020 23:36

Merge branch 'master' into separate-core-dense-state-space

5feb87d

Some additions.

a5b1d18

Fix sphinx errors.

dc760de

Better documentation in the interpolation.

40c25a8

comment.

36d8721

Some fixes.

0e44372

tobiasraabe mentioned this pull request Feb 16, 2020

Implement exogenous processes. #329

Open

tobiasraabe added 4 commits February 16, 2020 15:05

Fix reward parallelization.

6ec5ee3

Fix test.

2c66014

Add joblib dependency.

7982dd9

Stay serial.

e31ccd1

tobiasraabe mentioned this pull request Feb 17, 2020

Update tutorial on observables #289

Closed

Updated notebooks in the documentation.

d4e7f86

tobiasraabe mentioned this pull request Feb 17, 2020

Dynamically write specialized state space functions. #222

Open

tobiasraabe added 4 commits February 17, 2020 12:07

Make docs work.

adf6cf8

Add to changes.

1852585

Fix test.

b68138a

Finishing.

e5c63c3

tobiasraabe requested a review from janosg February 17, 2020 14:21

Merge branch 'master' into separate-core-dense-state-space

cbbaa3d

tobiasraabe added the ready-for-review label Feb 20, 2020

tobiasraabe changed the title ~~[WIP] Separate core and dense state space.~~ Separate core and dense state space. Feb 20, 2020

janosg approved these changes Feb 26, 2020

View reviewed changes

Incorporated Janos' suggestions.

8562584

tobiasraabe merged commit 582b469 into master Feb 28, 2020

tobiasraabe mentioned this pull request Mar 5, 2020

The interpolation is broken. #339

Closed

tobiasraabe deleted the separate-core-dense-state-space branch March 10, 2020 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate core and dense state space. #310

Separate core and dense state space. #310

tobiasraabe commented Dec 30, 2019 •

edited

Loading

codecov bot commented Feb 12, 2020 •

edited

Loading

janosg left a comment

Separate core and dense state space. #310

Separate core and dense state space. #310

Conversation

tobiasraabe commented Dec 30, 2019 • edited Loading

Current behavior

Desired behavior

Solution / Implementation

codecov bot commented Feb 12, 2020 • edited Loading

Codecov Report

janosg left a comment

Choose a reason for hiding this comment

tobiasraabe commented Dec 30, 2019 •

edited

Loading

codecov bot commented Feb 12, 2020 •

edited

Loading