🖼️PixScribe- AI Image-to-Story Generator

Turn images into vivid stories, poems, or captions using cutting-edge AI models!
This application combines vision and language models to let users upload an image, generate a caption using BLIP, and transform it into creative text using TinyLLaMA.

✨ Features

📸 Image Captioning: Uses BLIP to describe uploaded images.
🧠 Text Generation: Uses TinyLLaMA to generate:
- 🎭 Short stories
- 📝 Poems
- 💬 Social media-style captions
🎨 Tone Customization: Choose from 5 tone styles: funny, emotional, dark, creative, and formal.
⚡ Runs on GPU: Optimized for local machines with 6GB+ VRAM (e.g., RTX 4050).

🛠️ Tech Stack

Component	Model / Tool
Image Captioning	`Salesforce/blip-image-captioning-base`
Text Generation	`TinyLlama/TinyLlama-1.1B-Chat-v1.0`
Interface	`Gradio`
Environment	`Python`, `Torch`, `Transformers`

🚀 How It Works

Upload an Image
BLIP generates a caption describing the image
Select the desired Tone and Type (story, poem, caption)
TinyLLaMA creates the output using your custom prompt
Output is displayed with clear formatting

💻 Setup Instructions

Clone this repo

git clone https://github.com/yourusername/image-to-story-generator.git cd image-to-story-generator

Create virtual environment

python -m venv venv source venv/bin/activate # or venv\Scripts\activate on Windows

Install requirements

pip install -r requirements.txt

Run the app

python app.py

🎨 Examples

Image	Caption	Story/Poem Example
🌸 Pink Lotus	“a pink lotus flower with a blue background”	“As the sun shimmers through morning dew...”

📸 Screenshots

📂 Project Structure

. ├── app.py # Gradio UI

├── blip_caption.py # BLIP captioning logic

├── story_generator.py # TinyLLaMA story/poem/caption generator

├── requirements.txt

└── README.md

🧠 Inspiration

This project was built to explore multimodal generative AI — combining computer vision and language models to create meaningful, artistic content from images.

📃 License

MIT License Feel free to fork, modify, and contribute!

Issues

Feel free to add any issues you find

🙋‍♂️ Author

Deshan Senanayake 📧 smddsenyake@gmail.com

🔗 LinkedIn : https://www.linkedin.com/in/deshan-senanayake-7a0695292/

🔗 GitHub : https://github.com/Deshan-Senanayake

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gradio/flagged		.gradio/flagged
.idea		.idea
LICENSE		LICENSE
README.md		README.md
app.py		app.py
blip_caption.py		blip_caption.py
requirements.txt		requirements.txt
story_generator.py		story_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🖼️PixScribe- AI Image-to-Story Generator

✨ Features

🛠️ Tech Stack

🚀 How It Works

💻 Setup Instructions

🎨 Examples

📸 Screenshots

📂 Project Structure

🧠 Inspiration

📃 License

Issues

🙋‍♂️ Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🖼️PixScribe- AI Image-to-Story Generator

✨ Features

🛠️ Tech Stack

🚀 How It Works

💻 Setup Instructions

🎨 Examples

📸 Screenshots

📂 Project Structure

🧠 Inspiration

📃 License

Issues

🙋‍♂️ Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages