Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix CA2 audio files #807

Merged
merged 8 commits into from
Dec 12, 2023
Merged

Fix CA2 audio files #807

merged 8 commits into from
Dec 12, 2023

Conversation

quevon24
Copy link
Member

Url changed from https://www.ca2.uscourts.gov/decisions to: https://ww3.ca2.uscourts.gov/decisions

I added the backscraper required to get the missing audio files since June 14th, 2023
Added pagination support
Added a manual fix for incorrect data in row and handle and empty row

ERosendo and others added 2 commits December 11, 2023 13:37
Add _download_backwards method to get missing oral arguments since june 14th, 2013
Fix data in specific row and ignore empty row
@quevon24 quevon24 linked an issue Dec 11, 2023 that may be closed by this pull request
Copy link
Contributor

@flooie flooie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@quevon24 thanks for trying to fix this. I think it's overly complicated. The code actually worked - and just needed the base_url to be added to the URL to fix the issue.

And the back scraper is more complicated than I would expect here.

@quevon24 quevon24 requested a review from flooie December 12, 2023 17:36
Copy link
Contributor

@flooie flooie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me, thanks @quevon24

@flooie flooie merged commit 234fe71 into main Dec 12, 2023
@flooie flooie deleted the 803-ca2-podcast-down branch December 12, 2023 21:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

CA2 Podcast Down
3 participants