Skip to content

Notebook to show Raw Data Feature#309

Merged
johnwalz97 merged 8 commits intomainfrom
john6797/sc-8282/documentation-add-raw-data-to-validmind-library
Feb 5, 2025
Merged

Notebook to show Raw Data Feature#309
johnwalz97 merged 8 commits intomainfrom
john6797/sc-8282/documentation-add-raw-data-to-validmind-library

Conversation

@johnwalz97
Copy link
Contributor

@johnwalz97 johnwalz97 commented Feb 4, 2025

Internal Notes for Reviewers

Adding a notebook to directly demo the new Raw Data feature as well as a quick fix to better display raw data via the inspect() function.

External Release Notes

@johnwalz97 johnwalz97 added documentation Improvements or additions to documentation highlight Feature to be curated in the release notes chore Chore tasks that aren't bugs or new features labels Feb 4, 2025
@johnwalz97 johnwalz97 requested a review from validbeck February 4, 2025 18:42
@johnwalz97 johnwalz97 added internal Not to be externalized in the release notes and removed highlight Feature to be curated in the release notes labels Feb 4, 2025
@johnwalz97 johnwalz97 requested a review from juanmleng February 4, 2025 18:52
@github-actions
Copy link
Contributor

github-actions bot commented Feb 4, 2025

PR Summary

This pull request introduces a new Jupyter notebook, understand_utilize_rawdata.ipynb, which provides a comprehensive guide on how to understand and utilize the RawData feature in ValidMind tests. The notebook includes setup instructions, usage examples, and custom test creation using RawData. It demonstrates how to access, inspect, and utilize RawData for creating custom outputs such as ROC curves, Pearson Correlation Matrices, and Precision-Recall Curves.

Additionally, the PR enhances the validmind/utils.py module by improving the handling of generic objects. The handle_generic_object method now includes a try-except block to safely return the class name of an object, and the default method has been added to provide a preview of DataFrame objects, including their type, a preview of the first five rows, and their shape. This enhancement improves the robustness and readability of object handling within the ValidMind library.

Test Suggestions

  • Run the Jupyter notebook to ensure all cells execute without errors and produce the expected outputs.
  • Test the RawData feature by creating custom tests and verifying the output customization using the provided examples.
  • Verify the new default method in validmind/utils.py correctly handles DataFrame objects by returning the expected preview and shape information.
  • Test the handle_generic_object method to ensure it correctly returns the class name of various object types, including those without a __str__ method.

Copy link
Collaborator

@validbeck validbeck left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks John! I cleaned up the notebook, applied our style guidelines, and moved it into how_to. Once this PR is merged I'll also wrap the notebook into our docs site.

@johnwalz97 johnwalz97 merged commit 06175b6 into main Feb 5, 2025
6 checks passed
@johnwalz97 johnwalz97 deleted the john6797/sc-8282/documentation-add-raw-data-to-validmind-library branch February 5, 2025 15:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

chore Chore tasks that aren't bugs or new features documentation Improvements or additions to documentation internal Not to be externalized in the release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants