Hook post-hoc binary metrics and plots into standard evaluation framework by smcolby · Pull Request #492 · OpenADMET/openadmet-models

smcolby · 2026-02-26T21:02:34Z

Description

This PR integrates post-hoc binary classification into the standard openadmet evaluation and orchestration framework. By mirroring the established regression pattern, PosthocBinaryMetrics and the newly created PosthocBinaryPlots can now be dynamically instantiated and called by the pipeline.

Key Changes

API Standardization: Updated PosthocBinaryMetrics.evaluate to accept y_true, y_pred, and cutoff. It now returns the standard nested dictionary format expected by the workflow ({"task_0": {"precision": {"value": ...}}}).
New Plotting Class: Implemented PosthocBinaryPlots to generate post-hoc classification scatter plots and confusion matrices. It returns a dictionary of matplotlib.figure.Figure objects.
Dynamic Registration: Registered both classes via the @evaluators.register decorator.
Rigorous Unit Tests: Added test_posthoc_binary_metrics_evaluate and test_posthoc_binary_plots_evaluate to test_eval.py. These tests strictly verify mathematical outputs and object instantiation, adhering to the project's rule against tautological or lazy (assert True) testing.

Status

Ready to go

Developers certificate of origin

I certify that this contribution is covered by the MIT License here and the Developer Certificate of Origin at https://developercertificate.org/.

…ensemble tests

Added an index filtering step to FeatureConcatenator. Previously, if different featurizers dropped different molecules, the raw arrays were still concatenated, resulting in shape mismatches or mismatched rows. The features are now strictly masked to the common indices prior to concatenation.

This overhaul replaces slow, high-dependency integration tests with true unit tests utilizing pytest-mock and synthetic data fixtures. Key changes include swapping tautological file-writing mocks for internal state assertions, enforcing strict disjoint set validation for chemical splitters, and implementing rigorous mathematical validation for uncertainty quantification and evaluation metrics. These updates significantly improve execution speed and cross-platform stability by replacing fragile floating-point equality with robust approximate comparisons and isolating testing boundaries for featurizers, inference orchestration, and CLI logic.

for more information, see https://pre-commit.ci

Updated PosthocBinaryMetrics and created PosthocBinaryPlots to conform to the standard evaluate API, returning nested metric dictionaries and matplotlib objects. Registered both classes and added strict unit tests to verify mathematical accuracy and figure generation.

for more information, see https://pre-commit.ci

smcolby · 2026-02-26T21:29:55Z

FYI we just did this to demo copilot functionality. Probably still worth merging for posterity, but generally we're less interested in binary classification workflows.

codecov-commenter · 2026-02-27T00:23:47Z

Codecov Report

❌ Patch coverage is 77.27273% with 25 lines in your changes missing coverage. Please review.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

smcolby · 2026-02-28T00:08:05Z

Will resolve #143

smcolby and others added 11 commits February 25, 2026 16:19

Implement minor fixes to dummy regressor to enable reuse in comittee …

c7c4687

…ensemble tests

Refactor tests such that they are "unit" rather than "integration"

611b644

Add additional tests for active learning modules

d5a9e8f

Add pytest-mock dependency

28f083b

Add instructions to avoid common testing pitfalls

518aa1e

[pre-commit.ci] auto fixes from pre-commit.com hooks

1df313a

for more information, see https://pre-commit.ci

Add documentation to unit tests

45ef511

Add testing for different splits and ensemble

5505f25

smcolby self-assigned this Feb 26, 2026

[pre-commit.ci] auto fixes from pre-commit.com hooks

c9a3666

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hook post-hoc binary metrics and plots into standard evaluation framework#492

Hook post-hoc binary metrics and plots into standard evaluation framework#492
smcolby wants to merge 12 commits intomainfrom
copilot-test

smcolby commented Feb 26, 2026

Uh oh!

smcolby commented Feb 26, 2026

Uh oh!

codecov-commenter commented Feb 27, 2026 •

edited

Loading

Uh oh!

smcolby commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

smcolby commented Feb 26, 2026

Description

Key Changes

Status

Developers certificate of origin

Uh oh!

smcolby commented Feb 26, 2026

Uh oh!

codecov-commenter commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

smcolby commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov-commenter commented Feb 27, 2026 •

edited

Loading