-
-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix CA2 audio files #807
Fix CA2 audio files #807
Conversation
Add _download_backwards method to get missing oral arguments since june 14th, 2013 Fix data in specific row and ignore empty row
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@quevon24 thanks for trying to fix this. I think it's overly complicated. The code actually worked - and just needed the base_url to be added to the URL to fix the issue.
And the back scraper is more complicated than I would expect here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me, thanks @quevon24
Url changed from https://www.ca2.uscourts.gov/decisions to: https://ww3.ca2.uscourts.gov/decisions
I added the backscraper required to get the missing audio files since June 14th, 2023
Added pagination support
Added a manual fix for incorrect data in row and handle and empty row