-
Notifications
You must be signed in to change notification settings - Fork 100
Project Meeting 2017.08.04
Ben Stabler edited this page Aug 3, 2017
·
8 revisions
- Removed duplication of sampled alternatives from downstream processing in order to save computation time
- Previously, if an alternative was oversampled by the sampler, then we left all occurrences of it in the alternative set, and then, for the final choice, set the utility of duplicates to NA. The first occurrence has the real utility, whereas the 2nd, 3rd, etc. gets an additional NA added to it in order to turn it off. All the alternatives also have their sample correction factor.
- This prior approach - "with duplication" - meant we could keep the assumed X number of samples per chooser code design/pattern
- Now, the code has been refactored to allow for ragged alternative arrays - i.e. no duplication and different number of alternatives per chooser. If an alternative is oversampled, then the duplicates are removed from the alternative set.
- This means we don't calculate expensive logsums for duplicates and for final choice utilities either
- This save a bunch of time for these modules, ~30% in our testing so far (which depends on the duplication rate)
- Both work and school location choice have been "de-duplicated" (see school and work commits)
- @We still need to test with the full scale example (all zones, all hhs, all skims)
- @We still need to update documentation for the logsums revision
- Updated the Phase 3 Progress Report
- Made some updates to the Phase 3 Amendment SOW, but @still need to complete it