Getting very low confidence score for certain obvious text prompts

Hi, I have been trying out different things with EfficientSAM3 and most of the times it works great. Thanks for the great work. 


However, I have noticed that when using distilled image encoder +  text encoder, there are sometimes very low confidence score masks for simple prompt and images. 

Using the sam3/efficientsam3_examples/efficientsam3_image_predictor_example.ipynb notebook, 
with the provided example image and prompt ("a shoe") , the results are quite good. 

<img width="986" height="571" alt="Image" src="https://github.com/user-attachments/assets/ea8e3d9b-533f-4137-b9e2-c05df7c7119d" />

But with a different image of a dog and text prompt "dog", I had to lower the confidence threshold by a lot to get any meaningful masks. 

```
dog_image ="dog6/01.jpg"
image = Image.open(dog_image)
width, height = image.size
processor = Sam3Processor(model, confidence_threshold=0.02)
inference_state = processor.set_image(image)

processor.reset_all_prompts(inference_state)
inference_state = processor.set_text_prompt(state=inference_state, prompt="dog")

img0 = Image.open(dog_image)
plot_results(img0, inference_state)
```

<img width="680" height="664" alt="Image" src="https://github.com/user-attachments/assets/05cac71a-8586-45a8-b713-e64341075aaa" />


After taking a closer look, I see that this is mostly due to the `presence_logit_dec` being too low, even for simple cases like this. 

Prompt: 'dog' 
Presence score (single value): 0.0320 
Top-5 classification probs: [0.7695, 0.4902, 0.3438, 0.332, 0.2734] 
Top-5 final probs (class × presence): [0.0247, 0.0156, 0.011, 0.0106, 0.0087] 


Question/Discussion : 
1) Is this expected and is an effect of distillation or the dataset used for distillation itself? 
2) Perhaps EfficientSam3 need a less aggressive function to be applied on presence_logit_dec. Currently it is sigmoid which leads to presence probability being pushed to either extremes. 
Have you already considered it experiments around this ? 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting very low confidence score for certain obvious text prompts #17

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Getting very low confidence score for certain obvious text prompts #17

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions