Project Goals:

We would be analyzing, exploring the different attributes affecting red and white wine quality. We would be creating a model based on to predict the quality of the wine. This data would not be used on future and/or for real life prediction.

Project description:

For this project we would exploring the different factors that affect wine quality. Some of these factors are: fixed acidity, volatile acidity, citric acid, residual sugar, pH, alcohol levels, etc. Focusing on these factors, it would help us identify and predict wine quality.

Project planning:

Planning: During this process we asked ourselves important questions about the project, and the division of task among team members. Data planning will be shown in this readme.
Acquisition: Data was acquired from data.world published by food.Raw data would be downloaded and a csv for red and white wine data set has been created which would be use to pull the data during this project.
Preparation:The red and wine datas would be combines into one dataframe, clean and prepared for exploration.Nulls were handled accordingly and quality assurance was practiced to ensure the validity of each attribute. A column to identify red and white one was created and encoded for moedeling purposes. Outliers were dropped and handled accordingly, and columns were renamed for better identification.
Exploration:
Evaluation and Modeling:
Delivery:

Initial hypotheses and/or questions you have of the data, ideas:

Does Residual Sugar Affect Wine Quality?

2.Does Chlorides Affect Wine Quality?

3.Does Total Sulfur Dioxide Affect Wine Quality?

4.Does Citric Acid Affect Red Wine Quality?

Data dictionary:

Data Set Citation: P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis. Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553. ISSN: 0167-9236.

Instructions on how someone else can reproduce your project and findings:

For an user to succesfully reproduce this project, they must have red and white wine data set downloaded.
The wrangle.py, explore.py and evaluate.py files must be downloaded in the same repository/folder as the final_report to run it successfully.
Once all files are download, user may run the final_report notebook.

Key findings, Recommendations, and Takeaways:

The average amount of alcohol in high quality wine is more than the average amount of alcohol in low quality wine.
The average amount of chlorides in low quality wine is greater than the average amount of chlorides in high quality wine.
The average amount of total sulfur dioxide in low quality wine is greater than the average amount of total sulfur dioxide in high quality wine.
The average amount of residual sugar in low quality wine is greater than the average amount of residual sugar in high quality wine.
We would recommend to use our current model to predict the quality of the wine, prioritzing the features selected above.

Given more time: 6.we would like to explore other features and incorporate other clusters into our model. 7. Explore other hyperparameters and different models to improve accuracy. 8. Explore binning the target variable differently or use it as a continous variable to look at regression models. 9. Collect more data such as: location of winery,type barrels used, and year of when wine grapes where harvested.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
chelly_folder		chelly_folder
keila_folder		keila_folder
.gitignore.py		.gitignore.py
Data Dictionary.png		Data Dictionary.png
README.md		README.md
explore.py		explore.py
final_report.ipynb		final_report.ipynb
model.py		model.py
wrangle.py		wrangle.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Goals:

Project description:

Project planning:

Initial hypotheses and/or questions you have of the data, ideas:

Data dictionary:

Instructions on how someone else can reproduce your project and findings:

Key findings, Recommendations, and Takeaways:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Goals:

Project description:

Project planning:

Initial hypotheses and/or questions you have of the data, ideas:

Data dictionary:

Instructions on how someone else can reproduce your project and findings:

Key findings, Recommendations, and Takeaways:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages