Recommendation System for Amazon Products with enhanced BERT classifier

Description

We download Amazon product reviews data from https://amazon-reviews-2023.github.io/, load them to a postgres database and develop a recommender system.

embed reviews using a pre-trained embedding BERT or TF-IDF method
build a sentiment classifier (compare various models amongst random forests, gradient boosting and BERT)
create an interactive dashboard for data visualization and model performance analysis
implement the recommendation system
wrap up the app using FastAPI routes and dockerinzing it

Pipeline Snapshot

Wrapped up worflow with Docker

First, make sure you have Docker installed on your machine. If you wish to make all the steps yourself, without using Docker, you can go to the next section.

Run the command make, which will basically execute

docker compose build
docker compose up

From those commands, three Docker images shall be created and started.

db: an image for the postgres database
migrator: an image with alembic utilities enabling database versioning
app: an image with python packages and code for running the FastAPI application and the Dash dashboard You should see the following appear in your terminal.

[+] Running 4/4
 ✔ Network recommendation_system_default       Created                                                                                                                   
 ✔ Container recommendation_system-db-1        Healthy                                                                                                                    
 ✔ Container recommendation_system-migrator-1  Exited                                                                                                                    
 ✔ Container recommendation_system-app-1       Started

Once this is done, you can access

the Dash dashboard by visiting http://127.0.0.1:8050/
the FastAPI application by visiting http://127.0.0.1:8000/docs
the MLFlow tracking service by visiting http://127.0.0.1:5001/

Step-by-step workflow

Let's say your project directory looks like ROOT_DIR := PRE_ROOT_DIR / recommendation_system.

Setup a Postgres database and specify your associated credentials in the config.toml file.
Create a virtual environment named venv and install dependencies and requirements in it

make venv

Activate your virtual environment.

source venv/bin/activate

Download Amazon datasets into your project (run from PRE_ROOT_DIR).

python recommendation_system download datasets

Load downloaded datasets to your Postgres database (run from PRE_ROOT_DIR).

python recommendation_system load datasets

Launch the MLFlow service by running the following.

mlflow server --port="5001" --host="127.0.0.1"

Launch the FastAPI application - it creates routes that will be called by the dashboard (run from ROOT_DIR).

uvicorn src.fastapi_app.main:app

Launch the Dash dashboard have check the result at http://localhost:8000 in your favorite browser (run from ROOT_DIR).

python src/dashboard/dashboard.py

Basic structure

├── LICENSE
|
├── config files (.env, .ini, ...)
|
├── README.md
│
├── docs/               
│
├── requirements.txt  
|
├── __main__.py
│
├── src/                
|     ├── __init__.py
|     └── _version.py
|
└── tests/

Name		Name	Last commit message	Last commit date
Latest commit History 266 Commits
.github		.github
data		data
docker		docker
images		images
notebooks		notebooks
resources		resources
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.sample		.env.sample
.flake8		.flake8
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
__main__.py		__main__.py
alembic.ini		alembic.ini
config.toml.sample		config.toml.sample
docker-compose.yml		docker-compose.yml
mlflow.service.sample		mlflow.service.sample
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation System for Amazon Products with enhanced BERT classifier

Description

Pipeline Snapshot

Wrapped up worflow with Docker

Step-by-step workflow

Basic structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Recommendation System for Amazon Products with enhanced BERT classifier

Description

Pipeline Snapshot

Wrapped up worflow with Docker

Step-by-step workflow

Basic structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages