Open
Conversation
…n models classes, refactoring ModelAssessment and fixing minor issues.
…ining and forecasting strategies. Other improvements.
…adilla/3W into eduardo/refactor_data_operations
…lti-class inputs with label binarization (one-hot)
…tions Refactor of data operations, trainers and models.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Overview
This PR introduces a new architecture and updates notebooks across the 3W Toolkit. It replaces the core structure with a simpler, more developer-friendly architecture, making it easier for contributors to create and integrate new components.
Given the scope of these changes, we suggest releasing this as version 3.0.0.
Key points:
dataset→preprocessing→feature_extraction→models→training→assessment→data_visualizationThis change defines version 3.0.0, making the toolkit more flexible, modular, and easier to develop with.
All notebooks in the collaboration scope were updated to reflect and demonstrate examples using the new architecture.
Summary of Changes
1. Updated Toolkit Structure
Updated package structure to improve separation of responsibilities:
assessment/clustering/core/data_loader/data_visualization/dataset/feature_extraction/metrics/models/preprocessing/reports/trainer/utils/Unified structure references across documentation
2. Unified Architecture for Contributors
One clear architecture model:
Standard way to add new components (configs + base classes)
Cleaner docstrings with explicit typing to support auto-generated documentation
3. Consistency Across Documentation and Notebooks
Final Notes
This update sets a clear and modern architecture for the 3W Toolkit:
Version 3.0.0 marks a cleaner, more scalable foundation for the toolkit.
Thank you for reviewing this delivery.
🚀🥳🎊✨🎉
By creating this pull request, I confirm that I have read and fully accept and agree with one of the Petrobras' Contributor License Agreements (CLAs):
Our CLAs are based on the Apache Software Foundation's CLAs: