Feed Dat Pup

Convert EPUB and PDF files to plain text - the ultimate standard format.

Usage

GET  /                  # home page
POST /extracts/pdf      # extract PDF text (pass url|file)
POST /extracts/epub     # extract EPUB text (pass url|file)

Example

curl localhost:4567/extracts/pdf -X POST --data 'url=http://www.pdf995.com/samples/pdf.pdf'
# JSON response
curl localhost:4567/extracts/pdf -X POST --data '{"url":"http://www.pdf995.com/samples/pdf.pdf"}' --header 'content-type: application/json'
# JSON response
curl localhost:4567/extracts/epub -X POST --data 'url=http://www.example.com/samples/bookish.epub'
# JSON response
curl localhost:4567/extracts/epub -X POST -F file=@bookish.epub
# JSON response
curl localhost:4567/extracts/pdf -X POST -F file=@pdf.pdf
# JSON response

Example PDF extraction response:

{
    "info": {
        "Author": "Software 995",
        "CreationDate": "12/12/2003 17:30:12",
        "Creator": "Pdf995",
        "Keywords": "pdf, create pdf, software, acrobat, adobe",
        "Producer": "GNU Ghostscript 7.05",
        "Subject": "Create PDF with Pdf 995",
        "Title": "PDF"
    },
    "metadata": null,
    "pages": [
        "<page 1 text>",
        "<page 2 text>",
        "<page 3 text>"
    ],
    "text": "<combined text of all pages>",
    "url": "http://www.pdf995.com/samples/pdf.pdf",
    "version": 1.3
}

Example EPUB extraction response:

{
    "info": {
        "Author": "Samuel Shem",
        "Title": "The House of God"
    },
    "metadata": null,
    "pages": [
        "<page 1 text>",
        "<page 2 text>",
        "<page 3 text>"
    ],
    "text": "<combined text of all pages>",
    "file": "the_house_of_god.epub",
    "version": "2005-1"
}

Installation

# clone the repository...
bundle install
ruby app.rb

What's with the name?

EAT PDF EPUB =~> FEED DAT PUP

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
services		services
uploads		uploads
views		views
.gitignore		.gitignore
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
Rakefile		Rakefile
app.rb		app.rb
config.ru		config.ru
environments.rb		environments.rb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feed Dat Pup

Usage

Example

Installation

What's with the name?

About

Releases 1

Packages

Languages

NarroApp/feed-dat-pup

Folders and files

Latest commit

History

Repository files navigation

Feed Dat Pup

Usage

Example

Installation

What's with the name?

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages