Updates OpenAI to use Responses API by JasonTheAdams · Pull Request #161 · WordPress/php-ai-client

JasonTheAdams · 2025-12-30T22:17:04Z

Resolves #96

This PR migrates the OpenAI provider implementation from the legacy Chat Completions API (/v1/chat/completions) to the newer Responses API (/v1/responses) for text generation and Images API (/v1/images/generations) for image generation. The OpenAiTextGenerationModel and OpenAiImageGenerationModel classes have been completely rewritten to extend AbstractApiBasedModel directly, following the same implementation pattern used by the Google and Anthropic providers. The OpenAiCompatible helper classes remain available for other providers that use OpenAI-compatible APIs.

What's New

Built-in web search tool — GPT models can now use OpenAI's native web_search tool via the webSearch config option
Built-in code interpreter — Access the code_interpreter tool via customOptions for code execution capabilities
Continuing a previous chat — Use previous_response_id with a previous response id to continue the conversation
Responses API for image generation — gpt-image-* models now use the Responses API with the image_generation tool, providing a consistent API surface
Structured output support — JSON schema output via the text.format parameter
Function calling — Full support for custom function declarations with the new output format
Comprehensive test coverage — 39 new unit tests covering both text and image generation models
Document support — the text generation models can receive documents (e.g. PDFs) now

The image generation tool means the output is multi-modal, so for now I'm not adding support for this. We're discussing a solution in #160 and then can circle back on adding support for this.

Bonus

I updated the cli.php to support stdin and file references. This was useful for testing having it examine an image and describe it to me (multi-modal input).

Testing

I tested out the text generation as well as image generation, web search tool, code interpretation, and making an accurate picture of my crazy dog jumping out of a helicopter:

github-actions · 2025-12-30T22:17:14Z

The following accounts have interacted with this PR and/or linked issues. I will continue to update these lists as activity occurs. You can also manually ask me to refresh this list by adding the props-bot label.

If you're merging code through a pull request on GitHub, copy and paste the following into the bottom of the merge commit message.

Co-authored-by: JasonTheAdams <jason_the_adams@git.wordpress.org>
Co-authored-by: felixarntz <flixos90@git.wordpress.org>
Co-authored-by: raftaar1191 <raftaar1191@git.wordpress.org>

To understand the WordPress project's expectations around crediting contributors, please review the Contributor Attribution page in the Core Handbook.

felixarntz

@JasonTheAdams Thanks for working on this!

I haven't done a full review yet, but my early feedback focuses mostly on concerns with the image generation implementation. I'm not convinced why that needs to use the Responses API.

felixarntz · 2025-12-31T01:40:47Z

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php

+ * Class for an OpenAI image generation model using the Responses API.
 *
- * @since 0.1.0
+ * This uses the Responses API with the built-in image_generation tool.


I didn't expect to see the Responses API being used for image generation. Given you did that, I assume it's possible, also given your example, but is that the case for all image generation models? Have you tried the implementation with the older models like dall-e-3? Just want to make sure they can also be used with the Responses API.

My original expectation was that we would only need to update the OpenAiTextGenerationModel to use the Responses API.

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php

JasonTheAdams · 2025-12-31T19:40:10Z

@felixarntz So you're completely reasonable to wonder why I'm using the Responses API and not the Images API. After much research, I come out the other side wondering if OpenAI could have made it any more convoluted. 😂 😭

I'm going to write this out so we're on the same page, though I'm sure you know a good bit of it.

So images can be generated using both the Images API and the Responses API. The Responses API is, as a whole, the more capable API, providing things like message history, caching (to help with cost), stateful history, multimodal output, and so forth. The only thing it can't currently do is use gpt-image-1.5 (only v1). For all intents and purposes, it's the more capable API. But it has trade-offs:

Image generation is a tool of the base model. So you can't actually use gpt-image-1 directly; you have to use something like gpt-5 which then calls the image model. This means you effectively need to specify two models.
The image prompt is adjusted by the base model. So if you put "a king charles spaniel" the base model will attempt to rewrite that into a "better" image prompt. This may be useful, but it may also adjust it in a surprising way.
As mentioned, you can't use the gpt-image-1.5 model yet.

It's weird. I don't love my first pass, but it does work. I'm thinking of making a decision matrix so it conditionally uses the most "optimal" API. Something like:

flowchart TD
  A([Start: intent = generateImage]) --> B{Has message history}
  B -- Yes --> R[Use Responses API]
  B -- No --> C{Has previous_response_id}
  C -- Yes --> R
  C -- No --> D{Specified model is non-image}
  D -- Yes --> R
  D -- No --> E{Needs multimodal output}
  E -- Yes --> R
  E -- No --> F{Needs pre-render reasoning}
  F -- Yes --> R
  F -- No --> I[Use Images API]

I'd also consider adding a custom option so someone can specify the base and/or image model if the Responses API is used.

What do you think?

felixarntz · 2026-01-01T19:19:01Z

@JasonTheAdams Thanks for outlining this in so much detail! Your rationale makes a lot of sense to me.

Two follow up questions:

"Has previous_response_id" - what does this mean? What is a "previous response ID"?
"Specified model is non-image" - unless we intentionally make that possible, right now it wouldn't be. We wouldn't assign a non-image model the image generation capability, at least not now. And I'm not sure why that would make sense. If we want to enable someone to use the Responses API for image generation, I think that should rather be a conscious decision where you can specify the image generation model as usual, but then through some other more advanced option the "host" model for the Responses API.

For now, as of for this PR, I would suggest to keep things simple for image generation and stick with only the Images API: While the Responses API is better in many ways, it's also a lot more complex to think through and properly implement for the image generation use-case, and while most branches of the decision tree you shared end up in the Responses API, I think in 95% of actual usage it'll end up in the one branch that goes to the Images API.

The most valuable benefit of the Responses API for image generation IMO is that it can deal with messages history.

But we can add support for that separately, let's open an issue. The purpose of this PR is primarily to migrate chat/completions usage to the Responses API, since for that the latter is clearly recommended, so I'd say let's leave image generation (almost?) untouched.

JasonTheAdams · 2026-01-05T23:03:41Z

"Has previous_response_id" - what does this mean? What is a "previous response ID"?

The Responses API supports stateful prompting. You can connect it with the Conversations API or use this previous response ID.

In our system, the response ID would be sent back as "additional data" in the GenerativeAiResult, which could then be passed as a custom config option.

"Specified model is non-image"

I see what you mean. I'd be fine with that for now. 👍

I like the idea of keeping this simple for images and opening an Issue afterwards. I'll update this to go that route!

JasonTheAdams · 2026-01-05T23:34:55Z

Back to you, @felixarntz! I switched the Image Generation portion to using the Images API. Here's some fun tests of "A happy Belgian Malinois dog playing in a sunny field":

Model: gpt-image-1

Model: dall-e-3

JasonTheAdams · 2026-01-05T23:55:37Z

I did add support for previous_response_id to the text generation because it was very simple to add and could certainly be useful for some folks.

felixarntz

@JasonTheAdams The Responses API implementation looks solid, a few small points there.

For image generation, now I'm confused why you're not reusing the existing abstract OpenAI compatible implementation, which already relies on that same endpoint.

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php

src/ProviderImplementations/OpenAi/OpenAiTextGenerationModel.php

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php

src/ProviderImplementations/OpenAi/OpenAiTextGenerationModel.php

src/Tools/DTO/CodeExecution.php

felixarntz

@JasonTheAdams LGTM - awesome work! One small last nit-pick, but good to go.

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php

Co-authored-by: Felix Arntz <flixos90@gmail.com>

feat: updates OpenAI to use Responses API

6e1ae13

JasonTheAdams requested a review from felixarntz December 30, 2025 22:17

JasonTheAdams self-assigned this Dec 30, 2025

JasonTheAdams added this to the 0.4.0 milestone Dec 30, 2025

JasonTheAdams added the [Type] Enhancement A suggestion for improvement. label Dec 30, 2025

JasonTheAdams added 4 commits December 30, 2025 15:56

fix: corrects custom options

d19795c

feat: adds cli support for stin or file referencing

e24f55f

fix: corrects always false return

8a86119

test: corrects incorrect namespace

6f21cb6

felixarntz requested changes Dec 31, 2025

View reviewed changes

feat: switches to using Images API for image generation model

00caa6e

JasonTheAdams requested a review from felixarntz January 5, 2026 23:32

refactor: removes image generation from text model

aac8cbe

feat: adds support for document inputs

437d0d9

felixarntz requested changes Jan 8, 2026

View reviewed changes

JasonTheAdams added 3 commits January 8, 2026 16:15

refactor: uses OpenAi compatible image model class

32291e8

refactor: simplifies redundancies

932f26d

fix: corrects function calling in input array

bf04bc6

JasonTheAdams requested a review from felixarntz January 9, 2026 05:57

felixarntz reviewed Jan 9, 2026

View reviewed changes

JasonTheAdams force-pushed the add/proper-openai-provider-implementation branch from 1f41f2b to bf04bc6 Compare January 13, 2026 00:09

JasonTheAdams added 3 commits January 12, 2026 17:38

refactor: cleans up function call/response handling

037e873

refactor: uses parent method

97a5d6a

chore: corrects @since tags

dfb63a7

JasonTheAdams added 3 commits January 14, 2026 15:37

fix: adds unsetting output format back for DALL-E models

da89e4d

refactor: removes code interpretation for now

603b98b

refactor: moves function call/response part validation to OpenAI

8a563be

JasonTheAdams requested a review from felixarntz January 14, 2026 23:03

test: fixes linting

725b868

felixarntz approved these changes Jan 16, 2026

View reviewed changes

src/ProviderImplementations/OpenAi/OpenAiImageGenerationModel.php Outdated Show resolved Hide resolved

JasonTheAdams and others added 5 commits January 16, 2026 10:24

refactor: uses $params instead of bypassing data

25fbc4e

Co-authored-by: Felix Arntz <flixos90@gmail.com>

refactor: improves image generation parms typing

18aa79b

Merge branch 'trunk' into add/proper-openai-provider-implementation

896ed95

refactor: removes stream method as part of broader removal

d7b7629

test: removes streaming test

70a097d

JasonTheAdams merged commit b2257c2 into trunk Jan 16, 2026
5 checks passed

JasonTheAdams deleted the add/proper-openai-provider-implementation branch January 16, 2026 18:06

Conversation

JasonTheAdams commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's New

Bonus

Testing

Uh oh!

github-actions bot commented Dec 30, 2025

Uh oh!

felixarntz left a comment

Choose a reason for hiding this comment

Uh oh!

felixarntz Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JasonTheAdams commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felixarntz commented Jan 1, 2026

Uh oh!

JasonTheAdams commented Jan 5, 2026

Uh oh!

JasonTheAdams commented Jan 5, 2026

Uh oh!

JasonTheAdams commented Jan 5, 2026

Uh oh!

felixarntz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

felixarntz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JasonTheAdams commented Dec 30, 2025 •

edited

Loading

JasonTheAdams commented Dec 31, 2025 •

edited

Loading