Speech-to-Text API

This project provides a FastAPI-based web service to convert audio files to text using the SpeechRecognition library.

Features

Convert various audio formats (MP3, WAV, etc.) to text.
Support for multiple languages (currently set to Thai).
Update speech recognition engine and language dynamically via API.
Support for both file upload and base64 encoded audio input.

Requirements

Python 3.9+
FastAPI
SpeechRecognition
PyDub
MoviePy

Installation

Clone the repository:

git clone https://github.com/PongpreechaSuea/Speech2Text.git
cd Speech2Text

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows use venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```

Configuration

Update the config.py file with your desired settings:

Speech recognition engine
Default language
File paths for temporary storage
Server host and port

Usage

Start the server: python app.py Copy
The API will be available at http://localhost:3000 (or the port you configured)
Use the following endpoints:

GET /: Get API information
PUT /v1/api/using/engine: Update speech-to-text engine
PUT /v1/api/using/language: Update speech-to-text language
POST /v1/api/using/speech2text: Convert speech to text (file upload)
POST /v1/api/using_base64/speech2text_base64: Convert speech to text (base64 encoded audio)

API Documentation

Once the server is running, you can access the API documentation at http://localhost:3000/docs

Examples

Convert audio file to text

import requests

url = "http://localhost:3000/v1/api/using/speech2text"
files = {"file": open("audio.mp3", "rb")}
response = requests.post(url, files=files)
print(response.json())

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
src		src
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-to-Text API

Features

Requirements

Installation

Configuration

Usage

API Documentation

Examples

Convert audio file to text

About

Releases

Packages

Languages

PongpreechaSuea/Speech2Text

Folders and files

Latest commit

History

Repository files navigation

Speech-to-Text API

Features

Requirements

Installation

Configuration

Usage

API Documentation

Examples

Convert audio file to text

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages