Extend the predict_fn in the init_model to support multiple output columns by AnilSorathiya · Pull Request #394 · validmind/validmind-library

AnilSorathiya · 2025-07-17T10:23:31Z

Pull Request Description

This change to extend the predict_fn callable function parameter in the init_model to support multiple output columns :

Function can return single value or dictionary
One key would capture as prediction column if it returns dictionary. The prediction column identify by key called prediction.
Other columns in dictionary can be added as extra columns in dataset object.

What and why?

Currently, predict_fn functional call supports output store in a single column as prediction. However, intermediate outputs from function call can't be store for the traceability purpose.
This PR implements the following:

Function can return single value or dictionary
One key would capture as prediction column if it returns dictionary. The prediction column identify by key called prediction.
Other columns in dictionary can be added as extra columns in dataset object.
This could allow us to include intermediate outputs from LangGraph/workflows while invoking assign_predictions. These additional columns could be stored in the VM dataset object for enhanced traceability and analysis for LLM use cases or further analysis.

How to test

run notebooks:

notebooks/agents/langchain_agent_simple_demo.ipynb
notebooks/agents/langgraph_agent_demo.ipynb
notebooks/agents/langgraph_agent_simple_demo.ipynb
notebooks/code_samples/nlp_and_llm/rag_documentation_demo.ipynb
notebooks/code_samples/nlp_and_llm/rag_benchmark_demo.ipynb
notebooks/quickstart/quickstart_model_documentation.ipynb
notebooks/code_samples/credit_risk/application_scorecard_full_suite.ipynb

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

…gentic-model-in-vm-library

…-in-the-init-model-to

validmind/vm_models/dataset/dataset.py

cachafla

Looks good 🙌. I suggest testing traditional notebooks (quickstart model doc, quickstart regression, etc.) to ensure the existing code continues to work without issues.

juanmleng

lgtm! One of those key notebooks to test is the credit scorecard application_scorecard_full_suite.ipynb

github-actions · 2025-07-23T11:53:48Z

PR Summary

This PR introduces several key enhancements and refactors across the project:

LLM Agent and Notebook Updates:
- The notebooks now demonstrate an improved LLM-powered tool selection router with enhanced markdown documentation explaining the benefits of intelligent tool routing.
- The agent functions have been updated to return a structured dictionary that includes the final prediction, raw output, and a list of tools used. Minor formatting changes and refined docstrings make the demo clearer and more instructive.
Utility Module Refactoring:
- The langchain utility module has been streamlined by removing redundant helper functions (such as extra tool extraction and formatting routines) and consolidating functionality to capture only essential tool output data.
- Import paths have been updated to reflect the new structure, improving code clarity and reducing unused imports.
Enhanced Dataset Prediction Assignment:
- The dataset module now supports diverse prediction outputs including simple values as well as dictionary responses. New helper functions (_handle_deprecated_parameters, _handle_dictionary_predictions, etc.) modularize the process of adding prediction and probability columns.
- The prediction assignment workflow now properly distinguishes between computed predictions and precomputed prediction values, ensuring correct column naming and consistent data assignment.
- Comprehensive new unit tests in tests/test_dataset.py cover scenarios for classification, regression, complex dictionary outputs, multiple models, and error-handling (e.g. invalid predict_fn), ensuring robustness of the new functionality.

Overall, these changes enhance the robustness, clarity, and flexibility of both the LLM agent routing demos and the dataset prediction functionality without altering core business logic.

Test Suggestions

Run all new and existing unit tests in tests/test_dataset.py to ensure all prediction scenarios (classification, regression, complex outputs) are correctly handled.
Perform integration testing of the LLM agent flow to verify that the tool routing correctly identifies and logs the correct tool usage.
Simulate edge cases including empty tool outputs and invalid prediction functions to check for proper error handling and warnings.
Validate that the structured return values (including the 'tools_used' list) are correctly propagated to downstream processing.

AnilSorathiya added 15 commits June 24, 2025 11:18

support agent use case

1b3f67a

wrapper function for agent

723fcab

ragas metrics

28d9fbb

update ragas metrics

ecf8e09

fix lint error

53e8879

create helper functions

1662368

Merge branch 'main' into anilsorathiya/sc-10863/add-support-for-llm-a…

cc84cbc

…gentic-model-in-vm-library

delete old notebook

6f09780

update description for each section

0bb731e

simplify agent

e758979

simple demo notebook using langchain agent

7c35cfe

Update description of the simplified langgraph agent demo notebook

9bb70e9

add brief description to tests

894d52a

add brief description to tests

d86a9af

Allow dict return type predict_fn

884000f

AnilSorathiya added documentation Improvements or additions to documentation enhancement New feature or request labels Jul 17, 2025

AnilSorathiya added 6 commits July 18, 2025 16:55

update notebook and refactor utils

fbd5aa9

lint fix

daceabf

Merge branch 'main' into anilsorathiya/sc-11324/extend-the-predict-fn…

5f8823a

…-in-the-init-model-to

fix the test failure

70a5636

new unit tests for multiple columns return in assign_predictions

33b06fb

update notebooks to return multiple values in predict_fn

8e12bd2

AnilSorathiya removed the enhancement New feature or request label Jul 18, 2025

AnilSorathiya marked this pull request as ready for review July 18, 2025 18:15

AnilSorathiya requested review from cachafla, johnwalz97, juanmleng and nibalizer July 18, 2025 18:15

juanmleng reviewed Jul 22, 2025

View reviewed changes

validmind/vm_models/dataset/dataset.py Show resolved Hide resolved

cachafla approved these changes Jul 23, 2025

View reviewed changes

juanmleng approved these changes Jul 23, 2025

View reviewed changes

append input_id in column names

cd29fca

AnilSorathiya merged commit 3454dda into main Jul 23, 2025
7 checks passed

AnilSorathiya deleted the anilsorathiya/sc-11324/extend-the-predict-fn-in-the-init-model-to branch July 23, 2025 12:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend the predict_fn in the init_model to support multiple output columns#394

Extend the predict_fn in the init_model to support multiple output columns#394
AnilSorathiya merged 22 commits intomainfrom
anilsorathiya/sc-11324/extend-the-predict-fn-in-the-init-model-to

AnilSorathiya commented Jul 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

cachafla left a comment

Uh oh!

juanmleng left a comment

Uh oh!

github-actions bot commented Jul 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AnilSorathiya commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What and why?

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

Uh oh!

Uh oh!

cachafla left a comment

Choose a reason for hiding this comment

Uh oh!

juanmleng left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 23, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AnilSorathiya commented Jul 17, 2025 •

edited

Loading