An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
To DO: New project can be found at https://github.com/BB31420/loveListLace
These instructions will help you set up the project on your local machine.
- Python 3.6 or higher
- Pip (Python package manager)
- FFmpeg (a command-line program for video processing)
This helps keep packages seperate to avoid conflicts. Use the venv when running the code and before installing the required packages. The code requires openai 0.28, which is specified in the requirements.txt.
- Navigate to the project directory with
cd
ls -la
- Windows:
python -m venv .venv
thenSet-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser
- Linux:
python3 -m venv .venv
- Activate the venv
- Windows:
venv\Scripts\Activate.ps1
- Linux:
source .venv/bin/activate
- Close venv when finished running main.py, it needs to be active to use the packages
deactivate
- Clone the repository:
git clone https://github.com/BB31420/AI-Auto-Video-Generator.git
- Navigate to the project directory:
cd AI-Auto-Video-Generator
- Install the required Python packages:
pip install -r requirements.txt
- Install FFmpeg:
- On macOS, you can use Homebrew: brew install ffmpeg
- On Linux, you can use your package manager (e.g., apt-get install ffmpeg on Ubuntu)
- On Windows, you can download an installer from the official FFmpeg website
- Install spacy:
python -m spacy download en_core_web_sm
- Edit the file named .env in the project directory and add your API keys:
OPENAI_API_KEY=your_openai_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key
- Replace your_openai_api_key and your_elevenlabs_api_key with your actual API keys.
- Edit the file named caption_generator.py with your desired font path, specify the directory and font name.
- Linux fonts can be found at:
"/usr/share/fonts"
- Windows fonts can be found at:
C:\Windows\Fonts
- Linux fonts can be found at:
- Run the autovideo.py script by navigating to the directory the script and .env file are saved. Output will be generated in the same folder:
- Windows:
python main.py
- Linux: 'python3 main.py'
- Follow the prompts to enter a story prompt and generate a video.
- If you encounter errors related to missing dependencies, make sure you have installed the required Python packages by running
pip install -r requirements.txt
- If you encounter errors related to FFmpeg, make sure it is installed on your system and available in your system's PATH.
- https://platform.openai.com/account/api-keys
- https://beta.elevenlabs.io/subscription Click user icon then profile
- Keep your keys safe
This instructable will guide you through modifying the provided code to focus on generating haikus and fact-based videos about bees. We'll cover changing the prompt, the models, text overlay, and background color and positioning.
- Example prompts
- For haikus, enter a prompt related to haikus:
Create a haiku about nature
- For bee facts, use a prompt related to bees:
Tell me 5 interesting facts about bees
- Changing the models
To change the model, replace the engine parameter in the openai.Completion.create() function. For example, use the text-curie-002 model:
response = openai.Completion.create(
engine="text-curie-002",
prompt=prompt,
max_tokens=400,
n=1,
stop=None,
temperature=0.7,
)
- Changing the duration of each image
- To change the duration of each image, modify the set_duration() parameter in the create_video() function. For example, set the duration to 5 seconds per image:
image_clips = [mpy.ImageClip(img).set_duration(5) for img in image_filenames]
4 . Changing the number of images
- To change the number of images, update the num_prompts parameter in the extract_image_prompts() function. For example, to generate 6 images, change the function call as follows:
def extract_image_prompts(story, num_prompts=5):
- Changing the voice used for the voiceover
- To change the voice for the voiceover, modify the generate_voiceover() function. Update the URL used in the requests.post() call with the desired voice ID. For example, to use a different voice, replace the existing voice ID with a new one (e.g., "21m00Tcm4TlvDq8ikWAM", "yoZ06aMxZJJ28mfd3POQ", or "AZnzlk1XvdvUeBnXmlld"):
response = requests.post("https://api.elevenlabs.io/v1/text-to-speech/NEW_VOICE_ID", headers=headers, json=data)
Now, you can modify the code based on the provided instructions to generate haikus or fact-based videos about bees, with the desired duration for each image, the number of images, and the voice used for the voiceover. Remember to customize the prompts, models, text overlay, background color and positioning, image duration, image count, and voiceover based on your requirements.
Additionally, be mindful of the token limits for your API usage, as exceeding these limits may result in additional charges. You can adjust the max_tokens parameter in the openai.Completion.create() function to control the length of the generated text. Consider setting a lower value to stay within your API limits, especially when generating longer stories or facts.