Skip to content

AI MLB VTuber and streamer with hands-free voice interaction, visual perception and TTS

License

Notifications You must be signed in to change notification settings

Sodiumchloridy/melby-sama

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

banner

This project is built for the Google Cloud x MLB™ Hackathon – Building with Gemini Models

This project may also serve as my backup plan as a CS grad if the job market goes south. Inspired by the AI VTuber Neuro-sama.

Melby-sama

Melby-sama (MLB... Melby... get it?), an AI MLB VTuber and streamer powered by Google's Multimodal AI — the segue to our sponsor: the Google Cloud x MLB Hackathon, powered by Gemini!

forthebadge forthebadge

👀 Demo

Video Demo Melby-sama chilling
Melby-sama reacts to YouTube live chat Live Stream Demo

1. Setup

1. Install Poetry

You can install Poetry by following the official installation guide.

pip install poetry
poetry config virtualenvs.in-project true # to create the virtual environment in the project directory

2. Setup the environment variables

Create a .env file at the root of the project with the following variables:

GEMINI_API_KEY=
SPEECH_KEY=
SPEECH_REGION=

You can get your own GEMINI_API_KEY at Google AI Studio.

You can get the SPEECH_KEY and SPEECH_REGION by following the steps below:

  1. Sign up for an Azure free account at https://azure.microsoft.com/free/cognitive-services.
  2. Create a Speech Services resource in the Azure Portal.
  3. Get the SPEECH_KEY and SPEECH_REGION from the resource.

3. Run the project

At the root of the project, run:

poetry run python src/main.py

4. Set up VTube Studio

  1. Downlaod and launch VTube Studio.
  2. Optional (Advanced) : Port the model's output audio into microphone input of Vtube Studio via Voicemeeter Banana and VB Cable.

5. Set up OBS Studio

  1. Add VTube Studio as Game Capture to source.
  2. Add src/temp/subtitles.txt as Text to source.
  3. If you previously set up Voicemeeter and VB Cable in Step 4.2, you'll need to configure it here too.

About

AI MLB VTuber and streamer with hands-free voice interaction, visual perception and TTS

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages