Mining high loss images for data cleaning. #1292
bw4sz
started this conversation in
Show and tell
Replies: 1 comment
-
|
This script is intended as a standalone utility / example, not a core integration. Even without full Label Studio integration, this helps surface label noise that can otherwise appear as poor model performance. Happy to: "move this to "simplify it further" "or just share it as a reference example if that’s preferable" |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Very often we have incomplete labels in both training and validation, where the results suggest that predictions are incorrect, but its actually the ground truth that is wrong. To help find those images, we can loop through and sort images by their loss and look at the high loss to relabel them. While the label studio integration isn't ready yet, this script will help motivate this idea and worth sharing with the community.
Beta Was this translation helpful? Give feedback.
All reactions