OpenAI did it
https://openai.com/index/introducing-codex/
They presented a cloud-based software engineering agent called Codex, powered by Codex-1 (a specialized version of o3), which should not be confused with the 2021 Codex model or the Codex CLI agent programming tool released last month.
Seriously, I recently wrote that it's currently very important to solve the problem of orchestrating AI programming agents' tasks, and it seems from the video presentation that they have done just that. It's not yet available in the standard Plus plan, only in Pro ($200/month), so not everyone will be able to try it.
Codex handles small, well-defined tasks well, but according to users feedback, it struggles with follow-up requests in the chat so far. This means you need to first break down the work into a set of tasks that will not change afterward.
Codex is not intended for "vibe coding" and is best suited for experienced engineers working with stable repositories: adding features or fixing bugs. It has a simple interface, similar to the familiar ChatGPT, with a text field for describing the task and "Ask" and "Code" buttons.
https://www.youtube.com/watch?v=utujQfglbk8
There's a button similar to "play" that sends the task to the agent in the cloud in the background. It queues the task, then shows a detailed execution log. In the video presentation, it looks like a significant achievement for the field of AI programming agents.
By the way, Cursor also added a preview of the background agent feature for a limited number of users in the new version 0.50.
#openaicodex