GitHub - auringonnousu/performance_comparison_ML_models: Performance Comparison of tree-based models

Performance Comparison of tree-based models as part of my thesis.

You can run the code with the following file:

run_performance_comparison.py

Clone the repository

git clone https://github.com/auringonnousu/performance_comparison_ML_models.git

Navigate to the cloned directory

cd performance_comparison_ML_models

Run the Python script

python run_performance_comparison.py

Or click on this Binder badge:

Conduction of a comparison of classification performance and run-time for Decision Tree Classifier, Random Forest Classifier and Gradient Boosting Classifier.

RandomOverSampler is used to balance the training set.

GridSearch is used to find the best parameters for each model. 5-fold Cross-validation is performed.

The performance is evaluated on the test set.

The Built-in Feature Importance is used to find the most important features for each model.

Computing ROC AUC score for each model.

Training of models with only the most important features and parameter.

Steps:

Encoding using OneHotEncoder()
Applying RandomOverSampler()
Running pipeline per model with cv
Performing GridSearchCV() within pipeline for each model
Training of each model
Performing cross-validation on each model
Write results to df
Visualization of metrics for current models
Training of each model with best parameter and most important features
Evaluation of each model

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSES		LICENSES
.gitattributes		.gitattributes
.gitignore		.gitignore
License.md		License.md
README.md		README.md
kick_after_EDA.csv		kick_after_EDA.csv
kick_after_EDA.csv.license		kick_after_EDA.csv.license
performance_comparison.ipynb		performance_comparison.ipynb
run_performance_comparison.py		run_performance_comparison.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Performance Comparison of tree-based models as part of my thesis.

About

Uh oh!

Releases

Packages

Languages

License

auringonnousu/performance_comparison_ML_models

Folders and files

Latest commit

History

Repository files navigation

Performance Comparison of tree-based models as part of my thesis.

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages