Scrape Individual Savant Player Pages #354
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I recently started using this library which has greatly aided my work. I noticed a common pattern where I wanted to gather a players year over year stats, such as their position on exit velo leaderboards over years. I couldn't find a way to do this without making multiple queries and then filtering and merging dfs. Luckily this data is available on each player's savant page so I built functionality to gather the tables on those pages for pitcher and batter data.
I added testing and followed the instructions on the contributing.md. My only concern is linking the columns provided by the pandas reading of the html. The initial implementation below is done by matching column names which seems reasonable to me, but it would also be possible to manually parse these tables.