Add support for parsing PDF pages in parallel (multiprocessing) #17

phoewass · 2024-03-29T03:49:44Z

Closes #8

Parse pages in parallel using multiprocessing library leveraging all the available CPUs.

Checklist:

Process in parallel using the library
Process in parallel using the CLI
Tests
Documentation

Parse in parallel using multiprocessing library using available CPUs

camelot/handlers.py

bosd · 2024-04-03T06:00:16Z

@foarsitter Should we go ahead and merge this?

Parse in parallel using multiprocessing library using available CPUs

phoewass added 4 commits March 29, 2024 04:42

Add support for parsing PDFs in parallel

cf3b809

Parse in parallel using multiprocessing library using available CPUs

Add support for parallel processing in CLI

95efc2a

Add tests

5aa4d27

Update docs

e3cd4d9

phoewass changed the title ~~Add support for parsing PDF pages in parallel~~ Add support for parsing PDF pages in parallel (multiprocessing) Mar 29, 2024

phoewass mentioned this pull request Mar 29, 2024

[WIP] Add support for parsing PDF pages in parallel camelot-dev/camelot#237

Closed

4 tasks

foarsitter reviewed Mar 29, 2024

View reviewed changes

camelot/handlers.py Outdated Show resolved Hide resolved

Use loop instead of list comprehension for clarity

e80528b

foarsitter approved these changes Mar 29, 2024

View reviewed changes

bosd approved these changes Mar 31, 2024

View reviewed changes

foarsitter merged commit d606d88 into py-pdf:main Apr 3, 2024
11 checks passed

phoewass deleted the feature/parallel branch April 21, 2024 15:11

bosd referenced this pull request in bosd/pypdf_table_extraction Jul 26, 2024

Add support for parsing PDF pages in parallel (multiprocessing) (#17)

f74eefc

Parse in parallel using multiprocessing library using available CPUs

bosd added the enhancement New feature or request label Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for parsing PDF pages in parallel (multiprocessing) #17

Add support for parsing PDF pages in parallel (multiprocessing) #17

phoewass commented Mar 29, 2024

bosd commented Apr 3, 2024

Add support for parsing PDF pages in parallel (multiprocessing) #17

Add support for parsing PDF pages in parallel (multiprocessing) #17

Conversation

phoewass commented Mar 29, 2024

bosd commented Apr 3, 2024