-
Notifications
You must be signed in to change notification settings - Fork 10
Description
you've probably seen all the super powerful general purpose AI models out there now.. things like google gemma3 which can describe images.
it would still be nice to have this dataset we built over the years as ..
(a) our contribution to the general data lake . There is the accusation that general AI models "steal" data. the more we can show voluntary data the better.
(b) could be used for benchmarks (we could use other vision models and see how well they predict out labels)
There's good models to generate vector embeddings from text aswell eg huggingface text embedding comparison. these could sift through similar labels .. we could train vision nets that go straight into that embedding space per pixel.
Q1
would you be able to 'make productive' (add to your official label list) some of the most used label suggestions .. it would let more people see what got done here, & browse & download Maybe you could add the most 100 used labels.. or all the labels with more than 100 annotations
off the top of my head some of these would be good
window
left/cat right/cat left/dog right/dog left/car right/car
handle
head/insect thorax/insect abdomen/insect wing/insect
tabletop wooden_tabletop
fuselage
cockpit
various parts without their object prefix e.g. head foot hand
Q2
Did you have an export of this dataset into 'LabelMe' format.. it would be great to release it in a form that goes straight into other labelling tools
It might all seem like a drop in the ocean .. but every drop counts
Thanks for keeping this project going and your server running for so long !