New notebook series: ValidMind for model validation by validbeck · Pull Request #348 · validmind/validmind-library

validbeck · 2025-04-08T22:12:00Z

Internal Notes for Reviewers

9378

ValidMind for model validation

Brand new series combining the strategies I learned during editing Validate an application scorecard model from @MichaelIngvarRoenning and the model that you "build" in the ValidMind for model development series. This series mirrors the structure of the existing development series, and is built to be incorporated into our updated validation training path:

Model development series	Updated validation training path

🚨 THESE NOTEBOOKS BUILD ON EACH OTHER BUT ARE MEANT TO BE ABLE TO BE RUN INDEPENDENTLY. 🚨

As Jupyter Notebooks are closed environments, it means that the "Setting up" section of the notebooks need to be repetitive as the later cells rely on variable setup and outputs from the initial section of cells. I edited these down to be as streamlined as possible, but this is the same strategy we had to employ in the development series notebooks due to the limitations of notebooks.

1 — Set up ValidMind for validation

LIVE PREVIEW

Quick conceptual overview, this introduces users to ValidMind as a validator and walks them through setting up a model for validation, previewing templates/reports/etc.

2 — Start the validation process

LIVE PREVIEW

Here the validator performs some data quality tests, same as in the development series — the only difference is they also learn to run some comparison tests.
SOMEONE SHOULD DOUBLE CHECK:
- Do the comparison tests make sense / are they accurate examples
- Does the validation report section the user is directed to insert the ClassImbalance test results to make sense

3 — Develop a potential challenger model

LIVE PREVIEW

Here is where the notebooks diverge from the developer series, we instead import the champion logistic regression model created by the development series to evaluate
SOMEONE SHOULD DOUBLE CHECK:
- Is a random forest classification model a reasonable challenger here?
- Do the model validation tests make sense here, and are they accurate?
- LOGGING A FINDING: I noticed that the MinimumAccuracy test doesn't actually pass for our finding! I don't know if that's just an inaccurate assumption, but I thought it was a neat inclusion point for being introduced to findings. SOMEONE PLEASE RUN THIS NOTEBOOK AND LET ME KNOW IF YOU GET THE SAME RESULTS / IF THIS IS A REASONABLE EXAMPLE.

4 — Finalize validation and reporting

LIVE PREVIEW

Here we include the same custom test the development series did, just to walk the user through the process — the only difference is the custom test is run for both models instead of just the champion.
SOMEONE SHOULD DOUBLE CHECK:
- Does the way the custom test is run for both models accurate / still good examples?
- The section on "Verify test runs" I just stole from Michael's notebook and modified with our datasets/models — do these tests still make sense as examples / are they accurate? 🙏🏻

ValidMind for model development

Upon suggestion by @nrichers, I've renumbered these notebooks to be less "opaque" as not everyone is familiar with the uni-course type structure worldwide.
Quick qualitative edits as I "validated" the model built in this series, most notably the "Train simple logistic regression model" section in notebook 2 got a quick edit as there was some repetition in code and the way the dataset split was performed was being flagged in the validation process for incompatible dataset structure: Refer to Slack conversation context

Before	After

@cachafla helped me out with this adjustment so it should pass muster but just in case.

External Release Notes

Check out our new introductory series of notebooks tailored to model validators — ValidMind for model validation:

1 — Set up ValidMind for validation
2 — Start the validation process
3 — Develop a potential challenger model
4 — Finalize validation and reporting

These new notebooks break down using ValidMind for your end-to-end model validation process based on common scenarios. Learn the basics of the ValidMind Library with these interactive notebooks designed to introduce you to basic ValidMind concepts and get you familiar with tasks such as how to run and log quality, performance, comparison, and other types of tests with ValidMind, develop potential challenger models, work with validation report tools, and more. After you've completed your learning journey with these notebooks, you'll have a sample validation report ready to go.

…series

nrichers

@validbeck thank you, I think this new numbering scheme is much simpler and it's great to see these validation notebooks in the tutorials/ folder! 🚀

I will let others address your questions related to SOMEONE SHOULD DOUBLE CHECK but the format changes and the content I read through — and quite enjoyed reading through — LGTM.

MichaelIngvarRoenning

Personally I would merge the first and second notebook. I don't think it makes sense to have a separate notebook for the setting up the library. I think this can be merged with notebook 2.

The rest looks great!

validbeck · 2025-04-09T16:17:47Z

Personally I would merge the first and second notebook. I don't think it makes sense to have a separate notebook for the setting up the library. I think this can be merged with notebook 2.

Thanks Michael! We're aiming to align this with the developer notebook series (that we build on for training, which has 4 modules and one in-depth platform introductory model) so I think it makes more sense to keep the 1st notebook light, but good to know the rest looks good!

github-actions · 2025-04-09T16:32:51Z

PR Summary

This pull request introduces several enhancements to the ValidMind model validation notebooks. The key changes include:

New Model Validation Notebooks: A new series of four introductory notebooks for model validation has been added. These notebooks guide users through setting up the ValidMind Library for validation, starting the model validation process, developing potential challenger models, and finalizing validation and reporting.
Custom Test Implementation: The notebooks now include sections on implementing custom tests, including creating confusion matrix plots and using external test providers. This allows users to extend the default tests provided by ValidMind with their own custom tests.
Improved Documentation and Instructions: The notebooks have been updated with detailed instructions and explanations, including how to log test results, add findings, and assess compliance within the ValidMind Platform.
Sample Data and Models: The notebooks utilize a sample dataset (Bank Customer Churn Prediction) and a sample logistic regression model (lr_model_champion.pkl) to demonstrate the validation process.
Version Update: The version of the ValidMind Library has been updated from 2.8.17 to 2.8.18 in the pyproject.toml and __version__.py files.

Test Suggestions

Run the new model validation notebooks to ensure they execute without errors.
Verify that the custom test implementation correctly logs results to the ValidMind Platform.
Check that the instructions for setting up and using the ValidMind Library are clear and accurate.
Test the functionality of the external test provider by running tests from the my_tests directory.
Ensure that the sample dataset and model are correctly loaded and used in the notebooks.

validbeck added 30 commits March 28, 2025 09:30

Moved the introductory notebooks to their own PR

77011d6

Renamed the old model development series

a267d30

Renamed the new model validation series

575711f

Merge 'main' into beck/sc-9378/create-validmind-for-model-validation-…

ed1956d

…series

Exporting the champion model

5f30adb

Editing validation 1

c8df5e7

Adjusting

4cc6d63

Test

a1b9ad9

WIP new validator 2

7faf0bb

copying some code into validator 2

ad9f0e7

Ugh

8cec0c1

I give up for this week

d8607d8

Merge 'main' beck/sc-9378/create-validmind-for-model-validation-series

477f9c6

Validation 2 - Verify data quality assessments setup

1c7b819

Merge 'main' into beck/sc-9378/create-validmind-for-model-validation-…

d2057d6

…series

Validation 2 - Verify data quality assessments editing

a582003

Validation 2 - Document test results WIP

5ca1a38

Validation 2 - Document test results - add

2411ae5

Validation 3 - setup

ffb1486

Validation 3 - Setting up WIP

e63b94a

Validation 3 - Testing...

04e5196

Save point

913e4d1

Save point

87f4e09

Save point

750e09c

Testing something

8070d6d

Testing more

1264ed6

Applying Andres' fixes

8d73b96

Validation 4 setup

3efcc33

Testing

0a425e5

Applying changes to subsequent notebooks

7bfabdb

validbeck added 10 commits April 7, 2025 11:28

Validation 4 Custom inline test pt6

9bf213b

Validation 4 Custom test provider pt1

af2ef0a

Validation 4 Custom test provider pt2

15bf872

Validation 4 Next steps typo

8d85fc3

Validation 4 Verify test runs setup

ac81d5c

Validation 4 Verify test runs edit

bb90e25

Forgot the raw dataset

8508dbf

Fixing broken API links

4ce5811

Adding ToCs

1d0396b

Next steps - Work with your validation report

8318dba

validbeck added documentation Improvements or additions to documentation highlight Feature to be curated in the release notes labels Apr 8, 2025

validbeck self-assigned this Apr 8, 2025

validbeck requested a review from cachafla April 8, 2025 22:12

validbeck assigned juanmleng Apr 8, 2025

validbeck requested review from MichaelIngvarRoenning and nrichers April 8, 2025 22:12

validbeck unassigned juanmleng Apr 8, 2025

validbeck requested a review from juanmleng April 8, 2025 22:12

validbeck mentioned this pull request Apr 8, 2025

Updated Developer Fundamentals training validmind/documentation#681

Merged

nrichers reviewed Apr 8, 2025

View reviewed changes

MichaelIngvarRoenning reviewed Apr 9, 2025

View reviewed changes

validbeck requested a review from MichaelIngvarRoenning April 9, 2025 16:29

MichaelIngvarRoenning approved these changes Apr 9, 2025

View reviewed changes

2.8.18

7bc1997

validbeck merged commit 6120a89 into main Apr 9, 2025
6 checks passed

validbeck deleted the beck/sc-9378/create-validmind-for-model-validation-series branch April 9, 2025 16:37

validbeck mentioned this pull request Apr 28, 2025

Added new introductory validation notebooks validmind/documentation#689

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New notebook series: ValidMind for model validation#348

New notebook series: ValidMind for model validation#348
validbeck merged 75 commits intomainfrom
beck/sc-9378/create-validmind-for-model-validation-series

validbeck commented Apr 8, 2025 •

edited

Loading

Uh oh!

nrichers left a comment

Uh oh!

MichaelIngvarRoenning left a comment

Uh oh!

validbeck commented Apr 9, 2025

Uh oh!

github-actions bot commented Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

validbeck commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Internal Notes for Reviewers

ValidMind for model validation

1 — Set up ValidMind for validation

2 — Start the validation process

3 — Develop a potential challenger model

4 — Finalize validation and reporting

ValidMind for model development

External Release Notes

Uh oh!

nrichers left a comment

Choose a reason for hiding this comment

Uh oh!

MichaelIngvarRoenning left a comment

Choose a reason for hiding this comment

Uh oh!

validbeck commented Apr 9, 2025

Uh oh!

github-actions bot commented Apr 9, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

validbeck commented Apr 8, 2025 •

edited

Loading