Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for parsing PDF pages in parallel (multiprocessing) #17

Merged
merged 5 commits into from
Apr 3, 2024

Conversation

phoewass
Copy link

Closes #8

Parse pages in parallel using multiprocessing library leveraging all the available CPUs.

Checklist:

  • Process in parallel using the library
  • Process in parallel using the CLI
  • Tests
  • Documentation

@phoewass phoewass changed the title Add support for parsing PDF pages in parallel Add support for parsing PDF pages in parallel (multiprocessing) Mar 29, 2024
camelot/handlers.py Outdated Show resolved Hide resolved
@bosd
Copy link
Collaborator

bosd commented Apr 3, 2024

@foarsitter Should we go ahead and merge this?

@foarsitter foarsitter merged commit d606d88 into py-pdf:main Apr 3, 2024
11 checks passed
@phoewass phoewass deleted the feature/parallel branch April 21, 2024 15:11
bosd referenced this pull request in bosd/pypdf_table_extraction Jul 26, 2024
Parse in parallel using multiprocessing library using available CPUs
@bosd bosd added the enhancement New feature or request label Aug 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants