DEFCON

This repository contains a full machine-learning pipeline for training a model, generating predictions, and producing a submission.csv file for Kaggle.

1. Requirements

Python 3.9+
Git
A terminal (Linux, macOS, or Windows PowerShell)

2. Clone the repository

git clone https://github.com/simy46/DEFCON.git
cd DEFCON

3. Venv

Linux / macOS

python3 -m venv venv
source venv/bin/activate

Windows

python -m venv venv
venv\Scripts\Activate

4. Dependencies

pip install -r requirements.txt

5. Config file structure

Each config file has three parts: model, preprocessing, and hyperoptimization.

5.1. model

Defines models base hyperparameters.
When running hyperopt:

parameters set to None fall back to the values here
the final *_best.yaml overwrites this section with optimized values

5.2. preprocessing

Controls all preprocessing steps.
Each step has enabled: true/false.
Only steps with enabled: true are applied.
This structure is identical for every model.

5.3. hyperoptimization

Enables hyperparameter search (grid, random, or optuna).
Only parameters defined here are searched.
Parameters set to None keep their value from the model section.

1. Running the main pipeline

The pipeline is controlled through a YAML config file.

python main.py --config config/model_rt_best.yaml

Other available configs are listed on /config folder. The command that we gave is to run the same configs as our submission.

2. What the script does

Loads the YAML config
Loads training and test data
Applies preprocessing (scaling, PCA, etc.)
Trains the model defined in the config
Generates predictions on the test set
Saves a submission file in submissions/submission_.csv

3. Output

submissions/submission_2025-01-04_14-32-10.csv

1. Run hyperparameter search

This script searches for the best hyperparameters and generates a new optimized config file.

python find_best_params.py --config config/model_<name>_best.yaml

Other available configs are listed on /config folder.

2. What the script does

Loads the YAML config
Loads training and test data
Applies preprocessing (same pipeline as the main script)
Builds the model and its search space
Runs hyperparameter search (Grid Search, Random Search, or Optuna)
Logs the best score and best parameters
Writes a new file: config/model_rt_best.yaml with the updated hyperparameters

3. Output

config/model_<name>_best.yaml

After generating this file, use it on the main.py pipeline

python main.py --config config/model_<name>_best.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
config		config
data		data
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_exploration.py		data_exploration.py
final.ipynb		final.ipynb
find_best_params.py		find_best_params.py
main.py		main.py
requirements.txt		requirements.txt
submission.csv		submission.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEFCON

1. Requirements

2. Clone the repository

3. Venv

Linux / macOS

Windows

4. Dependencies

5. Config file structure

5.1. model

5.2. preprocessing

5.3. hyperoptimization

1. Running the main pipeline

2. What the script does

3. Output

1. Run hyperparameter search

2. What the script does

3. Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DEFCON

1. Requirements

2. Clone the repository

3. Venv

Linux / macOS

Windows

4. Dependencies

5. Config file structure

5.1. model

5.2. preprocessing

5.3. hyperoptimization

1. Running the main pipeline

2. What the script does

3. Output

1. Run hyperparameter search

2. What the script does

3. Output

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages