[SC-10275] Add Satisfactory vs. Needs Attention Flags to Ongoing Monitoring Tests by juanmleng · Pull Request #375 · validmind/validmind-library

juanmleng · 2025-05-20T12:30:39Z

Pull Request Description

What

Added a new acceptable performance flag feature to the log_metric() function that visually indicates metric status with color-coded badges. When setting passed=True, a green "Satisfactory" badge appears on the chart, while passed=False displays a yellow "Requires Attention" badge. This provides an immediate visual indication of whether metric values meet predefined performance standards, beyond just showing the raw values relative to thresholds.

Why

Monitoring metric values against thresholds is valuable, but users often need an indication of whether a metric is acceptable or problematic according to business rules. This enhancement addresses that need by:

Enabling users to set custom acceptance criteria beyond simple threshold comparisons
Making it immediately obvious which metrics require attention
Supporting compliance and building the foundation for triggering escalations via workflows when metrics breach acceptable performance threshold rules.

How to Test

Initialize the ValidMind library with your API credentials and enable monitoring
Log a metric with passed=True using the following code:

   log_metric(
       key="Test Metric",
       value=0.75,
       recorded_at=datetime.now(),
       thresholds={"medium_risk": 0.6},
       passed=True
   )

Log another metric with passed=False:

   log_metric(
       key="Test Metric",
       value=0.55,
       recorded_at=datetime.now(),
       thresholds={"medium_risk": 0.6},
       passed=False
   )

Create a Metrics Over Time block in the ValidMind Platform for your model
Verify that the metrics show the appropriate badges (green "Satisfactory" for passed=True, yellow "Requires Attention" for passed=False)
Test with a custom evaluation function:

   def custom_evaluator(value):
       return value > 0.6
   
   log_metric(
       key="Test Metric",
       value=0.65,
       recorded_at=datetime.now(),
       thresholds={"medium_risk": 0.6},
       passed=custom_evaluator(0.65)
   )

Pull Request Dependencies

https://github.com/validmind/backend/pull/1501
https://github.com/validmind/frontend/pull/1403

External Release Notes

The ValidMind Library now supports visual status indicators for ongoing monitoring metrics. When using the log_metric() function, you can specify the passed parameter to add status badges to Metrics Over Time blocks.

This new parameter accepts a boolean value:

passed=True: Displays a green "Satisfactory" badge
passed=False: Displays a yellow "Requires Attention" badge

This feature enables straightforward visual assessment of metric performance against defined business rules. Users can either manually set the status or programmatically determine it using custom evaluation functions that implement specific acceptance criteria.

The status indicators are especially useful for:

Quickly identifying metrics that require attention
Supporting compliance documentation with clear visual indicators
Enabling more targeted alerting based on metric status

Deployment Notes

Breaking Changes

Screenshots/Videos (Frontend Only)

Checklist

Areas Needing Special Review

Additional Notes

cachafla

Awesome 👌

…ntion-flags-to-ongoing-monitoring-tests

github-actions · 2025-05-21T05:36:41Z

PR Summary

This pull request introduces a new feature to the log_metric and alog_metric functions in the validmind library. The enhancement involves adding a passed parameter, which allows users to explicitly mark whether a specific metric value should be considered "Satisfactory" or "Requires Attention". This is achieved by:

Adding the passed parameter to both log_metric and alog_metric functions.
Updating the JSON payload to include the passed parameter when logging metrics.
Modifying the Jupyter notebook log_metrics_over_time.ipynb to demonstrate the usage of the passed parameter with examples and visualizations.

The notebook now includes sections that explain how to use the passed parameter to add visual badges to metric visualizations, indicating whether the metric meets the acceptance criteria. Examples are provided to show how to use the parameter directly or through a custom function to evaluate metric performance programmatically.

Additionally, the notebook's table of contents and metadata have been updated to reflect these changes.

Test Suggestions

Test the log_metric function with passed=True and verify that the correct badge is displayed in the visualization.
Test the log_metric function with passed=False and verify that the correct badge is displayed in the visualization.
Test the alog_metric function with passed=True and ensure it behaves as expected asynchronously.
Test the alog_metric function with passed=False and ensure it behaves as expected asynchronously.
Test the log_metric function with a custom passed_fn to ensure it correctly evaluates and applies the badge based on the function's logic.
Verify that the notebook examples execute without errors and produce the expected visual output.

Add acceptable performance flag

dc7150a

juanmleng self-assigned this May 20, 2025

juanmleng added the enhancement New feature or request label May 20, 2025

juanmleng changed the title ~~Add acceptable performance flag~~ [SC-10275] Add Satisfactory vs. Needs Attention Flags to Ongoing Monitoring Tests May 20, 2025

juanmleng requested review from AnilSorathiya, cachafla and validbeck May 20, 2025 15:40

cachafla approved these changes May 21, 2025

View reviewed changes

juanmleng added 2 commits May 21, 2025 07:34

Merge branch 'main' into juan/sc-10275/add-satisfactory-vs-needs-atte…

f890519

…ntion-flags-to-ongoing-monitoring-tests

2.8.25

67cb068

juanmleng merged commit 02fb639 into main May 21, 2025
7 checks passed

juanmleng deleted the juan/sc-10275/add-satisfactory-vs-needs-attention-flags-to-ongoing-monitoring-tests branch May 21, 2025 05:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SC-10275] Add Satisfactory vs. Needs Attention Flags to Ongoing Monitoring Tests#375

[SC-10275] Add Satisfactory vs. Needs Attention Flags to Ongoing Monitoring Tests#375
juanmleng merged 3 commits intomainfrom
juan/sc-10275/add-satisfactory-vs-needs-attention-flags-to-ongoing-monitoring-tests

juanmleng commented May 20, 2025 •

edited

Loading

Uh oh!

cachafla left a comment

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

juanmleng commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What

Why

How to Test

Pull Request Dependencies

External Release Notes

Deployment Notes

Breaking Changes

Screenshots/Videos (Frontend Only)

Checklist

Areas Needing Special Review

Additional Notes

Uh oh!

cachafla left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 21, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

juanmleng commented May 20, 2025 •

edited

Loading