Image to Audio Story Generator

Turn images into captivating audio stories with just a click! This innovative project combines image analysis, natural language generation, and text-to-speech conversion to create audio stories from uploaded images.

The Image to Audio Story Generator is a Streamlit-based web application that leverages cutting-edge technologies to transform images into engaging audio narratives. This project aims to bring creativity and storytelling to a whole new level by automatically generating stories based on the content of uploaded images.

Features

Image-to-Text Conversion: Utilizes the Salesforce/blip-image-captioning-base model to extract text content from images.
AI-Generated Stories: Employs the OpenAI GPT-3.5 Turbo model to generate creative and unique short stories based on the extracted text.
Text-to-Speech: Converts the generated story into an audio file (MP3) using gTTS (Google Text-to-Speech).
User-Friendly Interface: A user-friendly web interface built with Streamlit for easy image upload and story presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Diwali-Festival.jpg		Diwali-Festival.jpg
README.md		README.md
app.py		app.py
audio.mp3		audio.mp3
children.jpg		children.jpg
photo.png		photo.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image to Audio Story Generator

Features

About

Releases

Packages

Languages

MuvvaThriveni/Image_To_Story_Teller

Folders and files

Latest commit

History

Repository files navigation

Image to Audio Story Generator

Features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages