better pre-processing pdf files #1697
cloudrage999
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
i think private GPT needs to better parse and pre-process the pdf files , maybe using unstructured io or OCR tools
now that we have lamaparse , i think we should add such things to privategpt
this will make a big difference in getting accurate answers
currently if you want to run privategpt locally, you wont have a good experience if you got bunch of complex pdf files that have tables,messy format,pictures,diagrams etc
Beta Was this translation helpful? Give feedback.
All reactions