Skip to content

pepenunez/final-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Grocery recommender

By Pepe Nunez

[final-project] - August 2020 - Data Analytics Bootcamp, Berlin

Content

Project Description

(1) Have you ever moved to a New city and didn't know where to fo your groceries?

(2) Do you struggle to find some ingredients-products in the supermarket?

(3) Are you bored of always buying the same ingredients or are one of those that always forget about some crucial ingredient?

In any case, I'll have you covered with this Grocery Recommender. I've decided to build an tool where you can provide a list of ingredients and will compare 6 differents supermarkets (REWE, Aldi north, Aldi south, Lidl, Edeka, Kaufland) and tell you where you'll be able to find the products that you are looking and the prices. Additionally, it will recommend you additional ingredients that you might be interested in buying based on the rating of other users.

I'd like to build a tool that can help new visitors to Berlin to optimize their grocery shopping process. To do so, the tool must:

  • Find the supermarket with highest product availability and lower prices
  • Translate the ingredients to the product reference in each supermarket
  • Recommend other products that the user might be interested in buying based on their current cart.
  • Allrecipes (Kaggle) - https://www.kaggle.com/elisaxxygao/foodrecsysv1 - 52,821 recipes from 27 categories posted between 2000 and 2018. To ensure the quality of the data, we filter recipes that do not have images and that contain repeated ingredients and zero reviews. Then we obtain raw_data, including 1,160,267 users, 49,698 recipes with 38,131 ingredients and 3,794,003 interactions

  • Supermarkets (Ironhack former student) - https://github.com/kklichowski/Third-Project - A former Student from the Data Analytics Bootcamp successfuly scraped https://www.supermarktcheck.de/ to obtain information about the products offered in REWE, Aldi north, Aldi south, Lidl, Edeka, Kaufland. The data includes:

  • Name of ingredient (German)

  • Price

  • Unit

  • Pack size

  • Supermarket

  • Comparable price

Workflow

  • 00_demo.ipynb
  • 01_allrecipes-cleaning.ipynb - In this notebook I explore, clean and prepare the data from allrecipes.
  • 02_supermarkets-cleaning.ipynb - In this notebook I explore, clean and prepare the data from the supermarkets.
  • 03_allrecipes-modeling.ipynb
  • 04_supermarkets-modeling.ipynb
  • 10_supermarkets-translation-dynamic.ipynb
  • data-insights.ipynb
  • sql_connection.ipynb

About

Work in progress

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors