This repo contains the full materials for my TopTal screening assignment submission.
The README page displays the directory of all the files used and created by this assignment.
toptal_assignment
│ README.md
│ requirements.txt
└───code
│ │ EDA.ipynb
│ │ Modeling.ipynb
│ │ prediction_pipeline.py
│ │ utils.py
└───data
│ │ dataset.json
│ │ processed_data.csv
│ │ result.csv
│ │ sites_encoded.csv
│ │ verify.json
└───model
│ │ Random_Forest_model.pkl
│ │ countvec.joblib
│ │ feature_engineer_pipeline.pkl
│ │ feature_selector.pkl
│ │ sites_vectorizer.pkl
Clone the project
https://github.com/GeorgeMcIntire/toptal_assignment.gitGo to the project directory
cd toptal_assignmentStart up virtual environment
virtualenv -p python3 envnameActivate virtual environment
source envname/bin/activateRun the following command in terminal to install the required packages
pip3 install -r requirements.txtMake predictions on the verify.json data.
python3 prediction_pipeline.py