[SC-8568] Add input ids to RawData for enhanced test comparison post processing by juanmleng · Pull Request #317 · validmind/validmind-library

juanmleng · 2025-02-18T14:59:08Z

Internal Notes for Reviewers

Updated vm-library tests to include model and dataset input IDs in RawData objects for comparison tests. This update enables post-processing functions to track and reference which specific models and datasets generated each test result, improving support for comparison test scenarios.
Added a section in how_to/understand_utilize_rawdata.ipynb for illustrating the comparison use case scenario.

External Release Notes

cachafla

Awesome 🫡

AnilSorathiya

nice!

…r-enhanced-test-comparison-post-processing

github-actions · 2025-02-28T12:59:46Z

PR Summary

This pull request introduces significant enhancements to the ValidMind library by improving the utilization of the RawData object across various test functions. The changes include:

Enhancements to RawData Utilization:
- The RawData object now includes additional metadata such as model.input_id and dataset.input_id where applicable. This allows for better tracking and customization of test results.
- Functions across multiple files have been updated to return RawData with enriched information, ensuring that all relevant data is captured for post-processing.
Updates to Unit Tests:
- Unit tests have been updated to accommodate the changes in function signatures, particularly the inclusion of RawData in return values.
- Tests now include assertions to verify that RawData objects are correctly instantiated and contain expected data.
Documentation and Code Quality:
- Inline comments and docstrings have been updated to reflect the new functionality and provide clarity on the usage of RawData.
- Minor formatting adjustments have been made for consistency and readability.

These changes aim to enhance the robustness and flexibility of the ValidMind library, particularly in handling raw data for machine learning model validation and testing.

Test Suggestions

Verify that all updated functions return the expected RawData objects with correct metadata.
Ensure that unit tests correctly assert the presence and structure of RawData in function outputs.
Test the integration of RawData with post-processing functions to confirm that additional metadata is utilized effectively.
Run end-to-end tests to validate that the changes do not break existing functionality.
Check for any performance impacts due to the additional data being processed and returned.

juanmleng added 7 commits February 18, 2025 13:29

Add input ids to NLP tests

53965c0

Update RawData for data validation tests

75f000c

Fix lint

0c05a90

Update RawData for model validation tests

151184c

Update RawData for ongoing monitoring tests

eb84b9c

Update RawData for prompt validation tests

f4922ee

Add section for comparison tests in rawdata notebook

c4db885

juanmleng added internal Not to be externalized in the release notes chore Chore tasks that aren't bugs or new features labels Feb 18, 2025

juanmleng self-assigned this Feb 18, 2025

juanmleng added 2 commits February 18, 2025 16:44

Fix unit tests

c84c679

Remove RawData from RobustnessDiagnosis

18c9a0d

juanmleng requested review from AnilSorathiya, cachafla and johnwalz97 February 18, 2025 16:05

cachafla approved these changes Feb 28, 2025

View reviewed changes

AnilSorathiya approved these changes Feb 28, 2025

View reviewed changes

juanmleng added 2 commits February 28, 2025 13:39

Merge branch 'main' into juan5508/sc-8568/add-input-ids-to-rawdata-fo…

95585d6

…r-enhanced-test-comparison-post-processing

2.8.12

6233b22

juanmleng merged commit 90d2c77 into main Feb 28, 2025
6 checks passed

johnwalz97 deleted the juan5508/sc-8568/add-input-ids-to-rawdata-for-enhanced-test-comparison-post-processing branch August 20, 2025 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SC-8568] Add input ids to RawData for enhanced test comparison post processing#317

[SC-8568] Add input ids to RawData for enhanced test comparison post processing#317
juanmleng merged 11 commits intomainfrom
juan5508/sc-8568/add-input-ids-to-rawdata-for-enhanced-test-comparison-post-processing

juanmleng commented Feb 18, 2025

Uh oh!

cachafla left a comment

Uh oh!

AnilSorathiya left a comment

Uh oh!

github-actions bot commented Feb 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

juanmleng commented Feb 18, 2025

Internal Notes for Reviewers

External Release Notes

Uh oh!

cachafla left a comment

Choose a reason for hiding this comment

Uh oh!

AnilSorathiya left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 28, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants