This repository contains a FastAPI-based web service for speech transcription and synthesis. The service provides endpoints for converting audio to text (speech-to-text) and text to audio (text-to-speech) using the VocalizeService
.
- Speech-to-Text: Convert audio input into text transcriptions.
- Text-to-Speech: Generate audio output from text input.
- URL:
/create-transcription
- Method: POST
- Parameters:
duration
(int): The duration of the audio to be transcribed.
- Response: Transcription of the provided audio duration.
- URL:
/create-speech
- Method: POST
- Parameters:
text
(str): The text to be converted into speech.
- Response: Generated audio from the provided text.
- FastAPI
- Pydantic
- Uvicorn
- Clone the repository:
git clone https://github.com/your-username/speech-service-api.git cd speech-service-api
- Install the dependencies:
pip install -r requirements.txt
- Run the application:
uvicorn main:app --reload