Add comprehensive model support with XGBoost, LightGBM, and advanced … by dgenio · Pull Request #18 · dgenio/skdr-eval

dgenio · 2025-09-14T10:18:35Z

…features

Add ModelFactory for creating classifiers and regressors
Add ModelEvaluator for cross-validation and performance assessment
Add ModelSelector for hyperparameter tuning (grid/random search)
Add support for XGBoost, LightGBM, and HistGradientBoosting
Add get_model_recommendations for intelligent model selection
Add create_model_ensemble for ensemble methods
Include comprehensive error handling and validation
Add automatic model recommendations based on problem characteristics
Update init.py to export model functions
Add comprehensive test suite for all model functionality

Pull Request

📋 Description

Type of Change

🐛 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
📚 Documentation update
🧹 Code refactoring (no functional changes)
⚡ Performance improvement
🧪 Test improvements
🔧 Build/CI improvements

🔗 Related Issues

Fixes #
Related to #

🧪 Testing

Test Coverage

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have added integration tests if applicable

Manual Testing

Tested locally with make check
Tested with example scripts
Tested edge cases

Test commands run:

# List the commands you used to test
make check
python examples/quickstart.py

📝 Changes Made

Code Changes

API Changes

No API changes
Backward compatible API additions
Breaking API changes (requires major version bump)

API changes:

📚 Documentation

I have updated the documentation accordingly
I have updated docstrings for new/modified functions
I have added examples if applicable
I have updated the CHANGELOG.md

✅ Checklist

Code Quality

My code follows the project's style guidelines
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings
I have run ruff check and ruff format
I have run mypy type checking

Testing & CI

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
All CI checks pass
Code coverage is maintained or improved

Documentation & Communication

I have made corresponding changes to the documentation
My commit messages follow the conventional commit format
I have updated the CHANGELOG.md if applicable

🔍 Review Notes

Focus Areas

Questions for Reviewers

📸 Screenshots/Examples

# Example usage of new feature
import skdr_eval

# Your example here

🚀 Deployment Notes

No special deployment considerations
Requires database migrations
Requires environment variable changes
Requires dependency updates

Additional Context:

…features - Add ModelFactory for creating classifiers and regressors - Add ModelEvaluator for cross-validation and performance assessment - Add ModelSelector for hyperparameter tuning (grid/random search) - Add support for XGBoost, LightGBM, and HistGradientBoosting - Add get_model_recommendations for intelligent model selection - Add create_model_ensemble for ensemble methods - Include comprehensive error handling and validation - Add automatic model recommendations based on problem characteristics - Update __init__.py to export model functions - Add comprehensive test suite for all model functionality

Copilot

Pull Request Overview

This PR adds comprehensive model support to the skdr-eval library by introducing new classes and functions for machine learning model creation, evaluation, and selection. The changes include support for additional algorithms like XGBoost, LightGBM, and HistGradientBoosting along with intelligent model recommendations and ensemble methods.

Key changes:

Added ModelFactory, ModelEvaluator, and ModelSelector classes for comprehensive model management
Implemented support for advanced models (XGBoost, LightGBM, HistGradientBoosting) with graceful fallbacks
Added intelligent model recommendation system and ensemble creation capabilities

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
src/skdr_eval/models.py	New module containing core model functionality including factory patterns, evaluation, and selection
tests/test_models.py	Comprehensive test suite covering all model functionality with error handling and integration tests
src/skdr_eval/init.py	Updated exports to include new model functions and classes

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-14T10:19:07Z

src/skdr_eval/__init__.py

    "DRResult",
    "Design",
    "PairwiseDesign",
-    "__version__",
+    "assess_propensity_calibration",
+    "assess_propensity_discrimination",
    "block_bootstrap_ci",
+    "bootstrap_confidence_interval",
    "build_design",
+    "check_propensity_balance",
+    "check_propensity_overlap",
+    "chi_square_test",
+    "comprehensive_propensity_diagnostics",
+    "compute_balance_statistics",
+    "compute_propensity_log_loss",
+    "compute_propensity_statistics",
+    "create_model_ensemble",
    "dr_value_with_clip",
    "evaluate_pairwise_models",
+    "evaluate_propensity_diagnostics",
    "evaluate_sklearn_models",
    "fit_outcome_crossfit",
    "fit_propensity_timecal",
+    "generate_propensity_report",
+    "get_model_recommendations",
    "induce_policy_from_sklearn",
+    "kolmogorov_smirnov_test",
    "make_pairwise_synth",
    "make_synth_logs",
+    "mann_whitney_u_test",
+    "ModelEvaluator",
+    "ModelFactory",
+    "ModelSelector",
+    "multiple_comparison_correction",
+    "permutation_test",
+    "power_analysis",
+    "sample_size_calculation",
+    "t_test",
+    "PropensityDiagnostics",
+    "BootstrapError",
+    "ConfigurationError",
+    "ConvergenceError",
+    "DataValidationError",
+    "EstimationError",
+    "InsufficientDataError",
+    "MemoryError",
+    "ModelValidationError",
+    "OutcomeModelError",
+    "PairwiseEvaluationError",
+    "PolicyInductionError",
+    "PropensityScoreError",
+    "SkdrEvalError",
+    "VersionError",


The all list contains many imports that are not present in the import statements at the top of the file. Only the functions and classes that are actually imported should be included in all to avoid import errors.

dgenio requested a review from Copilot September 14, 2025 10:18

Copilot AI reviewed Sep 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comprehensive model support with XGBoost, LightGBM, and advanced …#18

Add comprehensive model support with XGBoost, LightGBM, and advanced …#18
dgenio wants to merge 1 commit intodevelopfrom
feature/model-support

dgenio commented Sep 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dgenio commented Sep 14, 2025

Pull Request

📋 Description

Type of Change

🔗 Related Issues

🧪 Testing

Test Coverage

Manual Testing

📝 Changes Made

Code Changes

API Changes

API changes:

📚 Documentation

✅ Checklist

Code Quality

Testing & CI

Documentation & Communication

🔍 Review Notes

Focus Areas

Questions for Reviewers

📸 Screenshots/Examples

🚀 Deployment Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants