Skip to content

This project is using PDF Extract API to convert documents and extract data

License

Notifications You must be signed in to change notification settings

CatchTheTornado/text-extract-api-demo

This branch is 9 commits ahead of CatchTheTornado/llm-pdf-ocr-anonimizer:main.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

f6ef97e · Jan 8, 2025

History

15 Commits
Jul 28, 2024
Jan 8, 2025
Jul 28, 2024
Jul 28, 2024
Jul 28, 2024
Jan 8, 2025
Jul 28, 2024
Jul 28, 2024
Jan 8, 2025
Jan 8, 2025
Jul 28, 2024
Nov 14, 2024
Jul 28, 2024
Jan 8, 2025

Repository files navigation

text-extract-api demo app

This tool uses text-extract-api to extract text from PDF files.

Read how to Get started with the API and CLI.

Warning: This is a demo version of the API and it may not be available at all times. The processing time may be longer than expected. Please DO NOT upload any sensitive, confidential or personal data. You are doing it at your own risk.

Online access

This demo is available online: https://demo.doctractor.com/

How to deploy it locally?

First, run the development server:

npm install
npm run dev

Note: On Apple Silicon please run these commands first (you will neeed Homebrew):

xcode-select --install
brew install pkg-config cairo pango libpng jpeg giflib librsvg
CPLUS_INCLUDE_PATH=/opt/homebrew/include npm install canvas

Open http://localhost:3000 with your browser to see the result.

About

This project is using PDF Extract API to convert documents and extract data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 83.1%
  • JavaScript 13.1%
  • CSS 3.8%