Skip to content

Hi again .. minor request? #322

@dobkeratops

Description

@dobkeratops

you've probably seen all the super powerful general purpose AI models out there now.. things like google gemma3 which can describe images.

it would still be nice to have this dataset we built over the years as ..

(a) our contribution to the general data lake . There is the accusation that general AI models "steal" data. the more we can show voluntary data the better.
(b) could be used for benchmarks (we could use other vision models and see how well they predict out labels)
There's good models to generate vector embeddings from text aswell eg huggingface text embedding comparison. these could sift through similar labels .. we could train vision nets that go straight into that embedding space per pixel.

Q1
would you be able to 'make productive' (add to your official label list) some of the most used label suggestions .. it would let more people see what got done here, & browse & download Maybe you could add the most 100 used labels.. or all the labels with more than 100 annotations
off the top of my head some of these would be good

window
left/cat right/cat left/dog right/dog left/car right/car
handle
head/insect thorax/insect abdomen/insect wing/insect
tabletop wooden_tabletop
fuselage
cockpit

various parts without their object prefix e.g. head foot hand

Q2
Did you have an export of this dataset into 'LabelMe' format.. it would be great to release it in a form that goes straight into other labelling tools

It might all seem like a drop in the ocean .. but every drop counts

Thanks for keeping this project going and your server running for so long !

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions