PictureTales allows you to upload an image, and it will generate a short story based on the image's content using image captioning. The generated story is then converted to audio using text-to-speech technology. You can both see the generated story and listen to it.
story.mp4
- Upload an image.
- Generate a story based on the content of the image.
- Listen to the generated story as an audio file.
- Clone this repository to your local machine.
git clone https://github.com/SartajBhuvaji/Image-to-Story-Generator.git
pip install -r requirements.txt
python app.py
- Create a .env file and paste your HUGGINGFACE, OPEN AI API Keys (Check the dummy_env file)
- Open your web browser and navigate to http://localhost:7860 to access the app.
- Upload an image to the app and click "Generate Story." You will see the generated story and be able to listen to it as audio.
- HuggingFace
- Image to Caption model
- Chat GPT 3.5 LLM
- Text-to-speech