notebooks: Quickstart for model validation by validbeck · Pull Request #376 · validmind/validmind-library

validbeck · 2025-05-22T17:49:48Z

Pull Request Description

sc-10339

What

There is a net-new notebook under notebooks/quickstart/: Quickstart for model validation

This notebook is a companion notebook to our existing "Quickstart for model documentation," and goes over the basics of getting started with model validation with the ValidMind Library with the idea that the model you're validating was created using the model documentation quickstart.

Why

Our validator resources are currently very sparse. This is also a step towards retooling our undeveloped "Get Started" section in the documentation.

How to Test

Pull down this PR: gh pr checkout 376
Open the "Quickstart for model validation" notebook: /notebooks/quickstart/quickstart_model_validation.ipynb
Follow the instructions in the notebook to make sure everything runs correctly, and content is accurate as described.

Pull Request Dependencies

Tip

Refer to the deployment notes below.

External Release Notes

Want to get started with validating models with the ValidMind Library? Check out our brand new Quickstart for model validation notebook:

Learn the basics of using ValidMind to validate models as part of a model validation workflow.
Set up the ValidMind Library in your environment, and independently audit data quality adjustments and a proposed champion model using ValidMind tests for a binary classification model.

Deployment Notes

Changes to the notebooks will be cherry-picked into the documentation repo with this branch when changes are approved validmind-library side via this PR: validmind/documentation#731

Breaking Changes

n/a

Screenshots/Videos (Frontend Only)

n/a

Checklist

Areas Needing Special Review

Important

I took some creative license with the following sections, so someone should check if the examples are relevant, accurate, and properly described:

Running data quality tests > Run data comparison tests
Running model evaluation tests > Run model performance tests, Run diagnostic tests, & Run feature importance tests

Run data comparison tests

Check if the explanatory comments on why we compare the two different sets of paired datasets is accurate, and if these two comparisons are in fact relevant and demonstrative.

Run model performance tests

Check if the lead-in text for why we use the testing dataset for our performance tests is relevant and accurate.

Run diagnostic tests

Check if the lead-in text for why we use the training and testing datasets for our diagnostic tests is relevant and accurate.

Run feature importance tests

Check if the lead-in text for why we use the testing dataset for our feature importance tests is relevant and accurate.

Additional Notes

I also adjusted the following sections in these notebooks as I noticed they were incomplete/out of date:

Validate an application scorecard model

Validate an application scorecard model: Setting up > Assign validator credentials

This was missing the update where you also have to remove yourself as a model owner. Remedied:

Finalize testing and documentation (ValidMind for model development)

Finalize testing and documentation

The next steps section needed some TLC in comparison to the newer validation series, so I spruced it up:

Developing challenger models (ValidMind for model validation)

Developing a potential challenger model: Running model evaluation tests

Since I added more context to why we use certain datasets in the quickstart, I added the same explanations in this introductory notebook as well under the Running model evaluation tests sub-sections:

Run model performance tests
Run diagnostic tests
Run feature importance tests

Note

Refer also to the "Areas Needing Special Review" section above.

FInalize testing and reporting (ValidMind for model validation)

FInalize testing and reporting

Tidied up the Next steps section as well here:

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

github-actions · 2025-05-27T17:04:53Z

PR Summary

This pull request introduces several enhancements and additions to the model validation documentation and quickstart guides within the project. Key changes include:

.gitignore Update: Added a new entry to ensure that the xgboost_model_champion.pkl file in the notebooks/quickstart directory is not ignored, allowing it to be included in version control.
Notebook Enhancements:
- Model Validation Quickstart: A comprehensive new notebook (quickstart_model_validation.ipynb) has been added. This notebook provides a step-by-step guide for using the ValidMind Library to validate models, including setting up the environment, importing datasets, running data quality tests, and evaluating model performance.
- Documentation Improvements: Several existing notebooks have been updated to improve clarity and guidance on using the ValidMind Platform. This includes more detailed instructions on removing oneself as a model owner and developer, and adding oneself as a validator.
- Model Development and Validation Tutorials: Enhanced the documentation with additional guidance on running and logging tests, inserting test results, and collaborating with stakeholders using the ValidMind Platform.
Code and Textual Corrections:
- Corrected minor textual errors and improved the clarity of instructions across various notebooks.
- Enhanced explanations of concepts such as overfitting, robustness, and stability in model evaluation.

These changes aim to improve the user experience and provide clearer guidance for users working with the ValidMind Library and Platform.

Test Suggestions

Run the new quickstart notebook to ensure all steps execute without errors.
Verify that the xgboost_model_champion.pkl file is correctly included in version control and accessible in the quickstart notebook.
Test the updated instructions for removing and adding roles in the model validation process to ensure they are clear and accurate.
Check the enhanced documentation for clarity and completeness, especially the new sections on collaboration and test result logging.
Ensure that all links to external resources and documentation are valid and lead to the correct pages.

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

validbeck · 2025-05-27T17:22:25Z

@LoiAnsah For the record, this is what happens when the JSON is incorrect and the notebook is corrupted... ;)

Can you think about how if this were your notebook, how you would go about fixing it?

LoiAnsah · 2025-05-27T17:31:59Z

@LoiAnsah For the record, this is what happens when the JSON is incorrect and the notebook is corrupted... ;)

Can you think about how if this were your notebook, how you would go about fixing it?

validbeck · 2025-05-27T17:33:41Z

@LoiAnsah, why did you close this PR? In general, you shouldn't be closing or merging other people's PRs on their behalf, especially without communication.

LoiAnsah · 2025-05-27T17:34:59Z

@validbeck I’d check my logs and switch back to the one just before the current one.

LoiAnsah · 2025-05-27T17:36:04Z

@validbeck Apoligies, I closed it by mistake. I was trying to reply to your comment. I select quote reply.

validbeck · 2025-05-27T17:39:24Z

I’d check my logs and switch back to the one just before the current one.

In this case, this is not the right approach — you want to apply the suggested changes. The "roll-back" method is only if you don't want to retain the changes and want to revert to known working version and start fresh. You will encounter many situations like this, where you will need to evaluate on a case-by-case basis how to best approach fixing things. Rolling back is not the only answer, especially if you want to retain later work.

What I did was:

Reopen the notebook in the text view
Locate the incorrect syntax lines (with a help of a formatter like I showed you yesterday)
Fix the incorrect syntax lines

We can go over this together in a session because I want you to interact with Jupyter Notebooks under the hood and what it looks like. In preparation, please:

Pull down the latest version of this PR branch
Roll back to this commit, with the corruption: 4a8e33fbad34628b07bcb50c0d0897bbffe1f3d4

validbeck · 2025-05-27T18:08:21Z

@juanmleng @LoiAnsah I've either committed the suggestions and edited the surrounding context to match the changes, or left explanations via a comment as to why the suggestions weren't applied. Can either of you please take another look, and approve if it looks good enough? 🙏🏻

notebooks/quickstart/quickstart_model_validation.ipynb

github-actions · 2025-05-27T19:50:57Z

PR Summary

This pull request introduces several enhancements and bug fixes to the model validation and documentation notebooks within the project. Key changes include:

.gitignore Update: Added a new entry to ensure that the xgboost_model_champion.pkl file in the notebooks/quickstart directory is not ignored, allowing it to be tracked by Git.
Notebook Enhancements:
- Model Validation Notebooks: Added detailed instructions and clarifications on the steps involved in model validation, including setting up the environment, running tests, and logging results to the ValidMind Platform. The changes improve the clarity and usability of the notebooks for users new to the ValidMind Library.
- Model Documentation Notebooks: Expanded the guidance on working with model documentation, including running additional tests, inserting test results, and collaborating with stakeholders. These enhancements aim to streamline the documentation process and improve collaboration.
New Quickstart Notebook: Introduced a new notebook quickstart_model_validation.ipynb that provides a comprehensive guide to using the ValidMind Library for model validation. This notebook covers importing datasets, running data quality tests, importing and initializing models, and conducting various validation tests.
Textual Improvements: Made several textual improvements across multiple notebooks to enhance readability and provide more context to the users. This includes rephrasing instructions, adding explanations for key concepts, and improving the flow of the content.

These changes collectively aim to improve the user experience and effectiveness of the model validation and documentation process using the ValidMind Library.

Test Suggestions

Verify that the new entry in .gitignore correctly tracks the xgboost_model_champion.pkl file.
Run the updated model validation notebooks to ensure all steps execute without errors.
Check that the new quickstart_model_validation.ipynb notebook provides a clear and comprehensive guide for new users.
Test the logging of test results to the ValidMind Platform to ensure it functions as expected.
Review the textual changes for clarity and accuracy in conveying the intended instructions.

validbeck added 25 commits May 21, 2025 10:17

Draft notebook for model validation quickstart

95bd0fd

Validator intro

e8a46eb

Headings

a1e1746

Champion model export for validation quickstart

bc75cc5

Verify data quality WIP

f535c26

Save point

06923fd

Save point

a6438b0

Save point

2482178

Edit

9863af1

Editing test example

d81a049

Data comparison tests

8214ace

Data comparison tests edit

9eda2cb

Import champion wip

b75ae8b

Modified validator credentials for app scorecard

8d0549a

Performance tests WIP

388ae20

Performance tests edit

4e3ad3b

Performance tests edit2

83e4743

Diagnostic test WIP

f94f06d

Feature importance tests WIP

f6ab1bd

Editing...

2868b51

More editing

15f12bf

ToC

b65ab59

Editing

47a66ad

More context to validation series

1a80c53

Added extended Next steps to model development series

e3967b6

validbeck self-assigned this May 22, 2025

validbeck added the highlight Feature to be curated in the release notes label May 22, 2025

validbeck marked this pull request as ready for review May 22, 2025 18:47

validbeck requested review from LoiAnsah and juanmleng May 22, 2025 18:48

validbeck and others added 6 commits May 27, 2025 10:01

Update notebooks/quickstart/quickstart_model_validation.ipynb

5c2586a

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

e614fb5

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

3107dd4

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

fbb4aee

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

339d696

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

b397956

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Update notebooks/quickstart/quickstart_model_validation.ipynb

4a8e33f

Co-authored-by: Lois Ansah <133300328+LoiAnsah@users.noreply.github.com>

Fixing JSON errors from suggestions

f5b4a85

LoiAnsah closed this May 27, 2025

validbeck reopened this May 27, 2025

validbeck added 3 commits May 27, 2025 10:46

Fixing context around Juan's suggestions

e5d98af

Removing unneeded validation dataset initialization

88396e9

Proofreading Ama's suggestions

6248385

validbeck requested review from LoiAnsah and juanmleng May 27, 2025 18:07

juanmleng reviewed May 27, 2025

View reviewed changes

notebooks/quickstart/quickstart_model_validation.ipynb Show resolved Hide resolved

Readding the validation dataset lol oops

4bf5266

validbeck requested a review from juanmleng May 27, 2025 19:50

juanmleng approved these changes May 27, 2025

View reviewed changes

validbeck merged commit d596838 into main May 28, 2025
7 checks passed

validbeck deleted the beck/sc-10339/create-code-samples-notebook-quickstart-for branch May 28, 2025 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks: Quickstart for model validation#376

notebooks: Quickstart for model validation#376
validbeck merged 44 commits intomainfrom
beck/sc-10339/create-code-samples-notebook-quickstart-for

validbeck commented May 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

Uh oh!

github-actions bot commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

validbeck commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What

Why

How to Test

Pull Request Dependencies

External Release Notes

Deployment Notes

Breaking Changes

Screenshots/Videos (Frontend Only)

Checklist

Areas Needing Special Review

Run data comparison tests

Run model performance tests

Run diagnostic tests

Run feature importance tests

Additional Notes

Validate an application scorecard model

Finalize testing and documentation (ValidMind for model development)

Developing challenger models (ValidMind for model validation)

FInalize testing and reporting (ValidMind for model validation)

Uh oh!

github-actions bot commented May 27, 2025

PR Summary

Test Suggestions

Uh oh!

validbeck commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

LoiAnsah commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

validbeck commented May 27, 2025

Uh oh!

Uh oh!

github-actions bot commented May 27, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

validbeck commented May 22, 2025 •

edited

Loading