Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dashboard crawls should support date range as a parameter #1196

Closed
adborden opened this issue Dec 6, 2019 · 2 comments
Closed

Dashboard crawls should support date range as a parameter #1196

adborden opened this issue Dec 6, 2019 · 2 comments
Labels
bug Software defect or bug

Comments

@adborden
Copy link
Contributor

adborden commented Dec 6, 2019

Dashboard crawls depend on point-in-time versions of agency data.json files. If the jobs does not run on a specific day, there is no way to go back in time to get the data.json. This means that if a nightly job fails, it means data loss. If it fails several days in a row, it becomes a high priority issue.

Having a time sensitive operation is difficult for O&M.

How to reproduce

  1. Disable the cron for day
  2. Nightly cron does not run
  3. Run the cron job the following day

Expected behavior

Crawl can create data for the missed day.

Actual behavior

Data is lost for the missed day.

@adborden adborden added component/dashboard bug Software defect or bug labels Dec 6, 2019
@adborden
Copy link
Contributor Author

adborden commented Dec 6, 2019

Not sure that we can ever fix this. We don't have a time machine and we don't control agency data.json files. Still, we can document this undesired property and identify mitigations that make it less painful for O&M.

E.g. if the daily snapshotting of data.json became commoditized, the nightly job could be parameterized with a date range, and just operate on the snapshots.

@nickumia-reisys
Copy link
Contributor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Software defect or bug
Projects
None yet
Development

No branches or pull requests

3 participants