A lightweight tool that transforms PDF documents into audiobooks using Coqui TTS technology.
- PDF text extraction
- High-quality text-to-speech conversion using Coqui TTS
- Simple and user-friendly interface
- Support for long-form content
- Python 3.10 or higher
- pip (Python package installer)
- Clone the repository:
git clone https://github.com/JaffeMarques/pdf_to_mp3_ai.git
cd pdf_to_mp3_ai
- Create a virtual environment (recommended):
python -m venv venv
source venv/bin/activate # On Windows use: venv\Scripts\activate
- Install required packages:
pip install -r requirements.txt
pdf_to_mp3_ai/
├── resources/
│ ├── files/ # Place your PDF files here
│ └── voices/ # Place your custom voice models (.wav) here
├── src/
│ └── ...
├── requirements.txt
└── README.md
- Place your PDF file in the
resources/files
directory - If using custom voices, place the .wav file in the
resources/voices
directory - Run the conversion:
python main.py process
- Only PDF files placed in
resources/files
will be detected by the program - Custom voice models must be placed in
resources/voices
directory - Make sure your PDF files are text-based and not scanned images
This project is licensed under the Mozilla Public License 2.0 - see the LICENSE file for details.
- Coqui TTS for the text-to-speech technology
- All contributors to this project
Contributions are welcome! Please feel free to submit a Pull Request.
If you encounter any problems, please file an issue along with a detailed description.