GPT-5

OpenAI improves Codex iOS experience with turn completion alerts, new commands, more

Last week, OpenAI updated ChatGPT for iPhone and iPad with access to Codex, its agentic coding tool for Mac. Since launching, the Codex iOS experience has improved in a few key ways. Here’s what’s new ...

OpenAI gives Codex for Mac eyes, a remote control and long-term goals

Appshots lets Codex for Mac understand what’s on your screen without having to feed screenshots or paste code.

OpenAI Codex Mobile App: AI Coding Agent Now Available on iOS and Android via ChatGPT

OpenAI's Codex app is now on mobile via ChatGPT for iOS and Android. Here's what it does, how to set it up, how it compares to Claude Code mobile.

OpenAI turns its sold-out GPT-5.5 party into a monthlong Codex giveaway for 8,000 developers

OpenAI gave more than 8,000 GPT-5.5 party applicants 10x Codex rate limits through June 5, escalating its AI coding rivalry with Anthropic.

OpenAI upgrades Codex with Appshots, goal mode and more developer-focused tools

OpenAI has introduced a major update to its Codex platform, adding new features aimed at helping developers work faster and ...

OpenAI’s Codex is now smart enough to control your Mac even when it’s locked

Macworld AI development is new and exciting, but while AI agents are doing their thing, you can end up waiting for a good ...

Codex for Mac updated with new Appshots feature that instantly gives chat context

Appshots is now available for Codex for Mac, adding a new way to instantly provide context to your chat. “On your Mac, press ...

OpenAI's Codex Can Now Use Your Mac Even When It's Locked

OpenAI has rolled out Computer Use for its Codex desktop app on macOS, and its latest trick is that your Mac doesn't even have to be unlocked for the coding agent to use your apps while you're away. In a post on X, OpenAI Developers said users can now send Codex tasks from their phone and have it operate apps on their Mac "even when the screen is off and locked".

Show HN: SoMatic – Vision-based OS automation framework for AI agents

Hi HN, I'm Smyan and I enjoy building agents. Modern multimodal LLMs are great at vision and perception but are quite poor at localization. This naturally creates a massive problem when we try to take our RPA frameworks and give them to agents to perform computer use tasks.For browsers, we have been able to solve this by using the DOM tree to supply the LLM with structural hints and now more recently modern browser use frameworks use Set-Of-Marks prompting which take the structural informat

Show HN: Agent-estimate, how long a coding task takes, at agent speed

I have used Codex & Claude Code for coding for a while, but how long a coding task will actually take? When I ask Claude Code to estimate, the result is often from training data, which is based on human speed. That’s why I built this tool, to estimate effort in ai agent speed. I run it every morning before I dispatch coding tasks to my agents.What's in it: task sizing: auto-classifies XS to XL from the description, then runs PERT on that tier human-equivalent comparison: a per-task-type

Why does it look like LLMs consistently overestimate implementation time?

I have my suspicion: they estimate how long people would have taken to implement some feature, becasue they were trained on such data. I consistently see estimates of 2 week/3 weeks or 5 days, etc. But then implementation takes a day or 2 max using agents within Claude/GPT. Unless I am missing something? Anybody else notice this?

Show HN: OpenRig – a control plane for multi-agent coding topologies

Hi HN, I’m Mike, the founder of OpenRig.I built this because my Claude Code + Codex setup kept forming little "topologies" of long-lived agents that worked well together, but the terminal sprawl was intense. So I built a primitive the agents could intuitively reach for to save and recreate these setups on the fly. This then led to more agent-first primitives like coordination, declarative workflow patterns, workspaces, etc.Several months in and these "rigs" I manage with open

Tell HN: Google slightly changed its wordmark logo

Google is doing some A&#x2F;B testing where in their main site,[0] it will show a slightly modified version of the 2015 logo, as shown in the following Wikipedia article.[1]<p>You can try this for yourself by opening a private&#x2F;incognito window, then visiting the main Google search home page.[0] Close and reopen it again until the logo changes.<p>[0]: https:&#x2F;&#x2F;www.google.com&#x2F;<p>[1]: https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Google_logo

Show HN: CoreMem – Portable context for AI agents

CoreMem lets you build collections of context, called a mem, and share it with any AI agent via URL, a Chrome extension, MCP, Cursor&#x2F;VS Code plugins, a skill, and more. Instead of re-explaining your project or goal when you switch agents or start new sessions, CoreMem keeps your context centrally organized so that any AI tool can read it.This originally started as a CLI I built that kept pieces of context (Project A&#x2F;B&#x2F;C details, my writing style, preferred tech stacks, coding styl

Ask HN: Do people lie about why they hate AI writing on social media?

I think the following might be the case:* people do not distinguish between AI writing based on the human&#x27;s ideas vs AI writing based on the AI&#x27;s ideas* they do this intentionally as a way to denigrate all AI writing — even when the content is good&#x2F;interesting* and the reason they do all this is to delay their unemployment as a result of AISo in the end, they use social media as a way to make people think all AI writing is bad because this is how they are trying to delay their une

OpenAI built a plugin for their biggest competitor's coding tool Claude Code

OpenAI built a plugin for their biggest competitor's coding tool 👀 The Codex plugin drops OpenAI's agent directly inside Claude Code: Claude writes, Codex reviews, and an adversarial mode literally tries to break your logic. Two AI agents checking each other in real time. Setup is 4 commands 👇 #ClaudeCode #Codex #AICoding #VibeCoding #AIEngineering

Show HN: Free One-shot cloud agents with OpenCode and Daytona and Cloudflare

Hi HN! Outside of the hackernews bubble we often find engineers who are barely using AI (aka using microsoft copilot) and we needed an easy way to show the latest capabilities in a non confusing UI.So we dumbed down our product to a simple text box UI where you one-shot your feature and you get an email with a link to a PR in github. The backend is hosted in Cloudflare, spinning sandboxes in Daytona that run the Opencode harness.Feel free to give it a try or share it with people who are skeptica

Codex got better, codex might be built with Claude Opus

Very suspicious with openai codex getting better, I wonder if codex teams use Claude opus to build codex. Anyone engineer from openai who can confirm…

Anthropic Just Bought a Developer Tool Used by OpenAI, Google

Anthropic acquired SDK startup Stainless, signaling a deeper push into developer tooling as AI labs compete beyond model ...

Anthropic Buys Stainless To Cut Off OpenAI And Google SDK Access

Anthropic acquired Stainless, the SDK toolmaker behind OpenAI and Google, then shut the hosted products down for rivals. Inside the agentic AI infrastructure play.