IronKaggle

One day competition of Ironhack's Data Analytics bootcamp. Goal was to build a predicting model for sales that then was to be verified. Cleaned a raw dataset with data of sales from different stores, used feature engineering for feature selection and then applied two diferente models and compared the scores on both: xgboost and Random Forest Regressor. Weighted the bias / variance to decide on which to choose: chose the second.

Model later verified by the teacher on a new dataset and ended being the winner.

Technical Requirements

Data Cleaning and Manipulation: checking and dropping null values / rows / columns, dealing with duplicates, formatting and filtering data;
Combining and Structuring Data:
Data Aggregation and Filtering;
Libraries imported:
- Pandas: import, export the shark_attack.csv - baseline for the project - and manipulate data;
- matplotlib: plotting histograms to verify hypothesis;
- Numpy;
- Seaborn;
- sklearn: metrics, ensemble and model_selection.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
requirements.txt		requirements.txt
salesprediction.ipynb		salesprediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IronKaggle

Technical Requirements

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

IronKaggle

Technical Requirements

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages