Pass `s3://` file URLs directly to API in `BedrockConverseModel` #3663

mochow13 · 2025-12-07T22:11:06Z

DouweM · 2025-12-09T23:42:11Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

                        format = item.media_type.split('/')[1]
                        assert format in ('jpeg', 'png', 'gif', 'webp'), f'Unsupported image format: {format}'
-                        image: ImageBlockTypeDef = {'format': format, 'source': {'bytes': downloaded_item['data']}}
+                        image: ImageBlockTypeDef = {'format': format, 'source': cast(Any, source)}


Instead of casting this to Any, can we fix the source type hint to be DocumentSourceTypeDef?

DouweM · 2025-12-09T23:42:19Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

                            'name': name,
                            'format': item.format,
-                            'source': {'bytes': downloaded_item['data']},
+                            'source': cast(Any, source),


tests/models/test_bedrock.py

DouweM · 2025-12-09T23:45:11Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

-                    format = downloaded_item['data_type']
+                    source: dict[str, Any]
+                    if item.url.startswith('s3://'):
+                        source = {'s3Location': {'uri': item.url}}


There's also a bucketOwner field that users may want to set. Maybe we can tell them to encode it as a query param on the URL, and parse it out here?

Do you mean something like s3://my-bucket/key?bucketOwner=owner?

Yep that's what I was thinking

DouweM · 2025-12-09T23:45:37Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

+                    if item.url.startswith('s3://'):
+                        source = {'s3Location': {'uri': item.url}}
+                    else:
+                        downloaded_item = await download_item(item, data_format='bytes', type_format='extension')


download_item currently has logic gating for gs:// URLs; let's check s3:// URLs there as well

It seems the existing code in download_item checks for gs:// and youtube URLs:

if item.url.startswith('gs://'): raise UserError('Downloading from protocol "gs://" is not supported.') elif isinstance(item, VideoUrl) and item.is_youtube: raise UserError('Downloading YouTube videos is not supported.')

What check do you mean for s3:// here?

Same check raising an error saying that download_item does not support s3:// URLs

Ah ok need to stop supporting download altogether. Updated.
Kept the check pretty simple. Not sure whether we should go for a proper url parsing here since the expectation is just bucketOwner param.

If it's not drastically more code, I'd prefer proper URL parsing

DouweM · 2025-12-09T23:47:06Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

We should document this feature in input.md. At the bottom there's already a section on uploaded files to Google, can you mention S3 files + BedrockConverseModel there as well please?

@DouweM I have updated the doc. Added a paragraph for S3 + BedrockConverseModel. Also updated the section above to mention that S3 files will not be downloaded.

Looking at the doc, I am a bit confused. Given that we have updated download_item to skip downloading s3:// URLs altogether, won't it apply to all models? For example, if we pass a s3:// URL to a model that doesn't support downloading itself, because of our check, Pydantic AI will also stop downloading for that particular model, right?

Not sure if I question is clear enough. Basically, in this MR, we are passing the s3:// URL directly to BedrockConverseModel but we have updated downloaed_item function to raise an error if s3:// is passed. This would mean we will stop downloading from s3://URLs for other models, no?

…seModel

DouweM · 2025-12-12T21:47:31Z

pydantic_ai_slim/pydantic_ai/models/bedrock.py

                        format = item.media_type.split('/')[1]
                        assert format in ('jpeg', 'png', 'gif', 'webp'), f'Unsupported image format: {format}'
-                        image: ImageBlockTypeDef = {'format': format, 'source': {'bytes': downloaded_item['data']}}
+                        image: ImageBlockTypeDef = {'format': format, 'source': cast(DocumentSourceTypeDef, source)}


We shouldn't need to cast if hint the type of source to be source: DocumentSourceTypeDef

DouweM changed the title ~~#3621 - Pass s3:// file URLs directly to API in BedrockConverseModel~~ Pass s3:// file URLs directly to API in BedrockConverseModel Dec 9, 2025

DouweM requested changes Dec 9, 2025

View reviewed changes

DouweM self-assigned this Dec 9, 2025

DouweM added the awaiting author revision label Dec 9, 2025

mochow13 force-pushed the issue-3621 branch from 44ca407 to 3d59b7f Compare December 10, 2025 18:32

Motta Kin and others added 3 commits December 12, 2025 21:11

pydantic#3621 - Pass s3:// file URLs directly to API in BedrockConver…

6e6f667

…seModel

Cast source to specific type; update tests to use _map_messages

9631781

Add support for bucketOwner; update tests

fb584c4

mochow13 force-pushed the issue-3621 branch from 3d59b7f to fb584c4 Compare December 12, 2025 20:31

Avoid supporing download item from s3

87cea32

DouweM requested changes Dec 12, 2025

View reviewed changes

Update input.md

a77eeaa

mochow13 force-pushed the issue-3621 branch from 534f130 to a77eeaa Compare December 12, 2025 22:11

mochow13 added 2 commits December 12, 2025 23:22

Fix sentence structure for doc update

076635e

Merge branch 'main' into issue-3621

4d8b111

Pass s3:// file URLs directly to API in BedrockConverseModel #3663

Are you sure you want to change the base?

Pass s3:// file URLs directly to API in BedrockConverseModel #3663

Conversation

mochow13 commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mochow13 Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Pass `s3://` file URLs directly to API in `BedrockConverseModel` #3663

Pass `s3://` file URLs directly to API in `BedrockConverseModel` #3663

mochow13 commented Dec 7, 2025 •

edited

Loading

mochow13 Dec 12, 2025 •

edited

Loading