Vibe Classification Engine

A powerful AI-powered video processing engine that analyzes fashion videos to detect clothing items, match them with products, and classify the overall aesthetic vibe.

Features

Video Processing: Extracts frames and audio from uploaded videos
Fashion Item Detection: Uses YOLOv8 to detect clothing and fashion accessories
Product Matching: Matches detected items with products using CLIP embeddings
Vibe Classification: Classifies the aesthetic style/vibe of the content
Audio Transcription: Transcribes audio using Whisper for additional context
REST API: FastAPI-based API for easy integration

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended)
FFmpeg installed on your system

Installation

Clone the repository:

git clone https://github.com/yourusername/flickd_ai_engine.git
cd flickd_ai_engine

Create and activate a virtual environment:

python -m venv venv
# On Windows
.\venv\Scripts\activate
# On Unix/MacOS
source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Download required models:

YOLOv8 model will be downloaded automatically on first run
Other models will be downloaded automatically when needed

Project Structure

flickd_ai_engine/
├── data/               # Data files including products.csv
├── frames/            # Temporary frame storage
├── images/            # Product images
├── models/            # Downloaded ML models and stores the embeddings 
├── outputs/           # Processing outputs
├── utils/             # Utility modules
├── main.py            # Main application code
├── requirements.txt   # Python dependencies
└── README.md         # This file

Usage

Starting the API Server

python main.py

The server will start at http://localhost:8000

API Endpoints

POST /process-video: Process a video file
- Input: Video file and optional caption
- Output: JSON with detected items, matched products, and vibe classification
GET /health: Health check endpoint

Example API Usage

import requests

url = "http://localhost:8000/process-video"
files = {"video": open("video.mp4", "rb")}
data = {"caption": "Optional video caption"}

response = requests.post(url, files=files, data=data)
results = response.json()

Features in Detail

Fashion Item Detection

Uses YOLOv8 for object detection
Detects 30+ fashion-related classes
Configurable confidence thresholds

Product Matching

Uses CLIP embeddings for semantic matching
FAISS index for efficient similarity search
Matches detected items with product catalog

Vibe Classification

Classifies content into aesthetic categories:
- Coquette
- Clean Girl
- Cottagecore
- Streetcore
- And more...

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

YOLOv8 by Ultralytics
CLIP by OpenAI
Whisper by OpenAI
FAISS by Facebook Research

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
images		images
utils		utils
.gitignore		.gitignore
README.md		README.md
demo.mp4		demo.mp4
main.py		main.py
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
test_pipeline.py		test_pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vibe Classification Engine

Features

Prerequisites

Installation

Project Structure

Usage

Starting the API Server

API Endpoints

Example API Usage

Features in Detail

Fashion Item Detection

Product Matching

Vibe Classification

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vibe Classification Engine

Features

Prerequisites

Installation

Project Structure

Usage

Starting the API Server

API Endpoints

Example API Usage

Features in Detail

Fashion Item Detection

Product Matching

Vibe Classification

Contributing

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages