feat(models): use tool for litellm structured_output when supports_response_schema=false #957

dbschmigelski · 2025-10-01T18:25:16Z

Description

In LiteLLMModel we currently attempt to perform structured_output using the model's native capabilities. This is checked using the supports_response_schema method.

This has two edge cases which are causing customer failures.

The first obvious one is that if a model does not support structured output we are failing hard. This means that if a customer changes the model id, code that previously worked will now fail with ValueError("Model does not support response_format").
The second is what appears to be a bug in LiteLLM. supports_response_schema does not appear to work with proxies. Looking at their code, they are not passing along proxy information, meaning there is no ability to perform a runtime check for proxies.

To fix this, when supports_response_schema returns false, we now fallback to the similar tool approach used in other model providers. A fix in LiteLLM will be explored but in the immediate term we need to unblock customers.

Related Issues

#862
#909

Type of Change

New feature

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…chema=false

dbschmigelski · 2025-10-01T18:26:59Z

src/strands/models/litellm.py

+        
        if len(response.choices) > 1:
            raise ValueError("Multiple choices found in the response.")
+        if not response.choices or response.choices[0].finish_reason != "tool_calls":


A good point was raised in #483 proposing that we can remove this check - since we are not even using the tool function arguments and instead are using the message.

But, that change is left to be completed in #483 and is considered out of scope for this PR.

wouldn't this break the logic here? are structured output responses from all models returned as tool_calls?

According to #483, there are some models that don't return the responses with tool_calls. So the claim the contributor makes is we are currently blocking the models that do not.

But that is something that is currently broken not directly related to this PR.

This PR addresses the case when supports_response_schema = false. #483, is when supports_response_schema=True, but there is no tool_calls

pgrayy · 2025-10-07T14:53:00Z

tests_integ/models/test_model_litellm.py

+
+
+def test_structured_output_unsupported_model(model, nested_weather):
+    # Mock supports_response_schema to return False to test fallback mechanism


Is there a real model we can use so we can avoid mocking?

We could create a proxy, but I wanted to avoid the case where litellm updates and then our test fails.

Why do we need a proxy? Can we just provide an api key to litellm, and choose a model that would work with this? Otherwise, this is more of a unit test.

It's still verifying that the tool extraction works

The models that we have access to I believe are all statically defined to support structured output

Unshure · 2025-10-07T16:52:41Z

src/strands/models/litellm.py

+
+        yield {"output": result}

+    async def _structured_output_using_response_schema(


Can we add debug logging for the different paths, so we know which is used

Unshure · 2025-10-07T16:54:01Z

tests_integ/models/test_model_litellm.py

+
+
+def test_structured_output_unsupported_model(model, nested_weather):
+    # Mock supports_response_schema to return False to test fallback mechanism


Why do we need a proxy? Can we just provide an api key to litellm, and choose a model that would work with this? Otherwise, this is more of a unit test.

src/strands/models/litellm.py

feat(models): use tool for structured_output when supports_response_s…

2eb3464

…chema=false

dbschmigelski temporarily deployed to auto-approve October 1, 2025 18:25 — with GitHub Actions Inactive

dbschmigelski commented Oct 1, 2025

View reviewed changes

fix: linting

70d2dfd

dbschmigelski temporarily deployed to auto-approve October 1, 2025 18:28 — with GitHub Actions Inactive

pgrayy reviewed Oct 7, 2025

View reviewed changes

Unshure reviewed Oct 7, 2025

View reviewed changes

dbschmigelski mentioned this pull request Oct 8, 2025

[BUG] : structured_output hard fails when LiteLLM returns JSON without tool_calls finish_reason #1005

Open

3 tasks

dbschmigelski changed the title ~~feat(models): use tool for structured_output when supports_response_schema=false~~ feat(models): use tool for litellm structured_output when supports_response_schema=false Oct 8, 2025

JackYPCOnline reviewed Oct 8, 2025

View reviewed changes

src/strands/models/litellm.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(models): use tool for litellm structured_output when supports_response_schema=false #957

feat(models): use tool for litellm structured_output when supports_response_schema=false #957

dbschmigelski commented Oct 1, 2025

Uh oh!

dbschmigelski Oct 1, 2025

Uh oh!

mkmeral Oct 3, 2025

Uh oh!

dbschmigelski Oct 3, 2025

Uh oh!

pgrayy Oct 7, 2025

Uh oh!

dbschmigelski Oct 7, 2025 •

edited

Loading

Uh oh!

Unshure Oct 7, 2025

Uh oh!

dbschmigelski Oct 7, 2025

Uh oh!

dbschmigelski Oct 7, 2025

Uh oh!

Unshure Oct 7, 2025

Uh oh!

Unshure Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!



		def test_structured_output_unsupported_model(model, nested_weather):
		# Mock supports_response_schema to return False to test fallback mechanism


		yield {"output": result}

		async def _structured_output_using_response_schema(

feat(models): use tool for litellm structured_output when supports_response_schema=false #957

Are you sure you want to change the base?

feat(models): use tool for litellm structured_output when supports_response_schema=false #957

Conversation

dbschmigelski commented Oct 1, 2025

Description

Related Issues

Type of Change

Testing

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dbschmigelski Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dbschmigelski Oct 7, 2025 •

edited

Loading