Skip to content

aarenstade/speech-img-vid-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-img-vid-generator

This does a few things together

  1. Audio input
  2. Google speech to text API call
  3. Bing image request HTML scrape (because Google hates scraping)
  4. Download images and arrange into collage (1280x720)
  5. Create frames for video following timecode of words
  6. Generate video file w/ ffmpeg

To Run

You need a Google Service Account for the API. https://cloud.google.com/speech-to-text/docs/libraries

Add the credential JSON file to the GOOGLE_APPLICATION_CREDENTIALS path. https://cloud.google.com/docs/authentication/getting-started

Download the Chrome Driver https://chromedriver.chromium.org/

Modify chrome driver path variable in video/make_frame.py

Pip install the libraries you dont have. Make sure ffmpeg is installed.

Then, run python3 handler.py and enter the video title & audio file path.

About

Convert voice recording to a video with images.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages