Wine Recommendation with Embeddings and LLMs 🍷🤖

This project demonstrates a modern AI-powered wine recommendation system using vector embeddings, Qdrant vector database, and a large language model (LLM) to deliver personalized suggestions based on user queries.

Problem Statement

This project demonstrates how an LLM can make suggestions like a sommalier. It could be deployed on an Edge device, like a digital kiosk, on dining tables in restaurants, making recommendations to consumers. This type of LLM could increase sales via personalization, enhance the customer experience, differentiate brands, and efficiently move inventory with scalable knowledge capture from a sommelier in vector embeddings, making it accessible 24/7 without requiring human staff.

🚀 Features

Vector Embeddings: Uses Sentence-Transformers all-MiniLM-L6-v2 to encode wine tasting notes into numerical vectors.
Vector Database: Leverages Qdrant as an in-memory vector database to store and search embeddings efficiently.
Semantic Search: Finds wines most relevant to a user query, e.g., "Suggest an amazing Malbec from Argentina."
LLM Integration: Connects search results to an OpenAI GPT-4 Turbo model to generate natural language recommendations.
Data Handling: Cleans and samples wine dataset to ensure smooth embedding and indexing.

📂 Dataset

Uses a CSV of top-rated wines.
Filters out entries with missing varieties (NaN) to avoid errors in embeddings.
Samples 700 records for efficient indexing and search.

🛠 Tech Stack

Python 3.9+
SentenceTransformers for embeddings
Qdrant vector database
OpenAI GPT-4 Turbo
Pandas for data processing

🧠 System Architecture

⚡ How It Works

Load and clean the wine dataset.
Sample a subset for efficient processing.
Encode wine tasting notes into embeddings using SentenceTransformer.
Create an in-memory Qdrant collection to store vectors and metadata.
Perform semantic search to find wines matching the user prompt.
Pass the search results to an LLM to generate user-friendly recommendations.

📝 Example Usage

user_prompt = "Suggest an amazing Malbec wine from Argentina"

# Perform vector search
hits = qdrant.search(
    collection_name="top_wines",
    query_vector=encoder.encode(user_prompt).tolist(),
    limit=3
)

# Collect search results
search_results = [hit.payload for hit in hits]

# Generate natural language recommendation using GPT-4
completion = client.chat.completions.create(
    model="gpt-4-turbo",
    messages=[
        {"role": "system", "content": "You are a wine specialist chatbot."},
        {"role": "user", "content": user_prompt},
        {"role": "assistant", "content": str(search_results)}
    ]
)
print(completion.choices[0].message)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
images		images
rag_modules		rag_modules
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Wine Recommendation with Embeddings and LLMs 🍷🤖

Problem Statement

🚀 Features

📂 Dataset

🛠 Tech Stack

🧠 System Architecture

⚡ How It Works

📝 Example Usage

About

Uh oh!

Releases

Packages

Languages

scouring/RAG-LLM-GenAI

Folders and files

Latest commit

History

Repository files navigation

Wine Recommendation with Embeddings and LLMs 🍷🤖

Problem Statement

🚀 Features

📂 Dataset

🛠 Tech Stack

🧠 System Architecture

⚡ How It Works

📝 Example Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages