Add exploratory data analysis, more data preprocessing and features, and more models#9
Open
schance995 wants to merge 18 commits intodorahacksglobal:mainfrom
Open
Add exploratory data analysis, more data preprocessing and features, and more models#9schance995 wants to merge 18 commits intodorahacksglobal:mainfrom
schance995 wants to merge 18 commits intodorahacksglobal:mainfrom
Conversation
- Cast data types to lower precision - Use HistGradientBoostingClassifier for faster classification - Printing the tree itself is no longer useful
It should never be included since the .pyc may be different between computers
- mean normalization - try all randomness tests - confusion matrices
|
currently sitting at 75-77% accuracy |
e1e6749 to
8ef37b7
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Here's @0mWh's and my solution so far. We plan to attempt all 3 UnitaryHack challenges. There are several changes, we look forward to any questions and feedback.
Notebooks
QRNG_ Classification_Main_UnitaryHack windowed.ipynbfor models with more training data fromdata/QRNG_ Classification_Main_UnitaryHack windowed_preprocessed_df_1717557318.csv.zst(generated in same notebook)QRNG_ Classification_Main_UnitaryHack.ipynbfor more models, preprocessing, and exploratory data analysis.process_logical_reduction.ipynbfor distribution analysis and statistical testingChanges so far
Best performance so far
We got 67% on one of our models, but we caution against further interpretation until we implement more robust model testing. A limitation is that we don't have a held-out test set for a fair comparison against other project submissions.
Next steps