Combine Text-to-Speech and Speech-to-Text into a conversational agent.
Project codename EmilyAI
The purpose of this demo is to showcase how you can build a Conversational AI application that engages users in natural language interactions, mimicking human conversation through natural language processing using Deepgram.
Examples of where you would see this type of application include: virtual assistants for tasks like answering queries and controlling smart devices, educational tutors for personalized learning, healthcare advisors for medical information, and entertainment chat bots for engaging conversations and games.
These applications aim to enhance user experiences by offering efficient and intuitive interactions, reducing the need for human intervention in various tasks and services.
If you have found a bug or if you have a feature request, please report them at this repository issues section. Please do not report security vulnerabilities on the public GitHub issue tracker.
Check out our KNOWN ISSUES before reporting.
- Capture streaming audio using Deepgram Streaming Speech to Text.
- Natural Language responses using an OpenAI LLM.
- Speech to Text conversion using Deepgram Aura Text to Speech.
Deepgram is a foundational AI company providing speech-to-text and language understanding capabilities to make data readable and actionable by human or machines.
Want to start building using this project? Sign-up now for Deepgram and create an API key.
Follow these steps to get started with this starter application.
Go to GitHub and clone the repository.
Install the project dependencies.
npm install
Copy the code from sample.env.local
and create a new file called .env.local
.
DEEPGRAM_STT_DOMAIN=https://api.deepgram.com
DEEPGRAM_API_KEY=YOUR-DG-API-KEY
OPENAI_API_KEY=YOUR-OPENAI-API-KEY
- For
DEEPGRAM_API_KEY
paste in the key you generated in the Deepgram console. - Set
DEEPGRAM_STT_DOMAIN
to behttps://api.deepgram.com
. OPENAI_API_KEY
should be an OpenAI API Key that can access the chat completions API.
Once running, you can access the application in your browser.
npm run dev
We love to hear from you so if you have questions, comments or find a bug in the project, let us know! You can either:
- Open an issue in this repository
- Join the Deepgram Github Discussions Community
- Join the Deepgram Discord Community
This project is licensed under the MIT license. See the LICENSE file for more info.