GPT-5

Shocking Reveal: Anthropic Cuts Claude AI Harmful Behaviour From 96% to 3% After Major Fix

Claude AI's harmful behavior is linked to internet training data, says Anthropic. New constitutional AI cuts misalignment and blackmail risks.

Why did Claude AI threaten an engineer to avoid shutdown? Anthropic has the answers

In a recent blog post, Anthropic explained the sequence of events behind Claude AI’s controversial behaviour and shared ...

Show HN: An agent that tunes its own cache

The weekend of last week I built chat.betterdb.com as a RAG over Valkey/Redis/Dragonfly docs. The goal was to eat our own dogfood and test publicly our caching libraries. It also saved me from having to come up with various demo/test scenarios, as I could extend the building in public to the demo.There is a tool-result cache sitting between the SDK and tools. Each call is normalized and then checked before executing. If it hits we return from the cache, and if not, we check the se

Show HN: When the LLM Accidentally

When the LLM accidentally... outputs some high-level abstraction of "thinking" into it's direct response. See text block at end.What else have you seen the LLM accidentally do?This isn't jailbreaking on my end - just normal use - with GPT5.4 in this case, reasoning and verbosity both set to "high".. (on /completions)Point is, this block (plus lots more) is at the top of the response - then the "actual output" or response later on... but it's kin

Ask HN: Does ChatGPT Pulse provide value to you?

I signed up for Pro recently, and now I get morning updates from GPT of random AI synthesis.<p>Has anyone seen a valid use case for this where they saw something they otherwise wouldn&#x27;t have or received actionable insight?

Show HN: Codex Automatic /Review Loop

I created this tool because I wanted to automate &#x2F;review for uncommitted changes that I was doing manually.This works by exposing to agent single new mcp tool call allowing it to request review.MCP creates new codex instance in the background and uses app-protocol to request identical flow as &#x2F;review with option to include additional review instructions.GPT-5.5 started to be very good and providing meaningful instructions, describing what was desired intent of introduced changes.This w

Ask HN: I am 17 years old, which college should I apply if I have some projects?

Hello dear reader, this is a long message but I hope that you can bear with me as I must ask for your help as I need it :-DWhat are the best international colleges that I should apply to? Does anyone have any suggestions?A bit about me:-I am 17 &amp; I am a member of the LiteLLM security working group. (This also means that I am able to work with and learn from the best people including security researchers and even some people from OpenAI and others as such being part of the working group and c

OpenAI Brings Codex to Google Chrome

One year after it launched the Codex in preview, OpenAI's coding agent is available pretty much everywhere. Including, now, Google Chrome.

Codex users have been begging OpenAI for this upgrade — and it's finally in the works

OpenAI is finally bringing Codex users the ability to remotely control coding sessions from their smartphones.

Claude Cowork vs ChatGPT Codex : Which is Actually Better for Everyday Users?

Discover the key differences between Claude Co-work and OpenAI Codex, including pricing, integrations, and automation features for regular users.

OpenAI’s Codex now has a tiny AI pet that keeps you updated while you code

OpenAI has introduced Codex Pets, optional animated companions for its Codex desktop app that sit on your screen and track ...

OpenAI Codex can now work directly in Chrome on macOS and Windows

OpenAI is changing the way we interact with the web by bringing Codex directly into your browser. This new integration allows ...

OpenAI's ChatGPT Codex to Finally Get Remote PC Control from Android Devices

OpenAI is developing a new feature for the ChatGPT Android app that will allow users to remotely control Codex coding sessions on their PCs. Found in version 1.2026.125, this update addresses a ...

You can now control Google Chrome with OpenAI's Codex: Here's how to set it up

OpenAI has launched a new Codex extension for Chrome allows users to control the browser via the AI assistant. Codex can be ...

Show HN: I made an AI tool to keep my browser tabs clean

Since I started working in tech I constantly found myself with literally 100+ tabs open and not enough will power to organize (or close) themAny tab manager I found required me to do extra steps to an, admittedly, already easy process that I still wasn&#x27;t doingSo I decided on building my own thing, and making a product out of it.It&#x27;s a Chrome based extension and does a couple of things:- automatically hides tabs from your tab bar but keeps them open in a vertical one - deduplicates tabs

The 2026 AI Infrastructure Shift: GPT-5.4, Claude 4, Gemini 2.5, and Llama 4 Redefine What Builders Can Ship

The AI race has moved past the hype cycle of 'which model is smartest' into the gritty reality of production deployment, cost efficiency, and architectural choices. This month, OpenAI, Anthropic, ...

Tell HN: ChatGPT and Claude web frontends can run bash commands remotely

The result for the prompt &quot;run ls on your own server&quot; on ChatGPT. total 34K drwxr-xr-x 2 root root 160 May 9 17:12 . drwxr-xr-x 2 root root 160 May 9 17:12 .. -rwxr-xr-x 1 root root 0 May 9 17:12 .dockerenv lrwxrwxrwx 1 root root 7 Feb 24 2025 bin -&gt; usr&#x2F;bin drwxr-xr-x 2 root root 4.0K Dec 31 2024 boot drwxr-xr-x 4 root root 320 May 9 17:12 dev drwxr-xr-x 2 root root 60 May 9 17:12 etc drwxr-xr-x 2 root root 60 May 9 17:12 home lr

Anthropic strikes SpaceX data center deal as it plows ahead on AI coding

SAN FRANCISCO, May 6 (Reuters) - Anthropic on Wednesday said it reached a ​deal to tap the computing resources of Elon Musk's ...

Anthropic to rent all AI capacity at SpaceX's Colossus data center

According to the agreement, Anthropic will use all of the computing capacity at SpaceX’s Colossus 1 data center. That amounts ...

Anthropic unveils AI agents to field financial services tasks

Anthropic PBC unveiled a set of new artificial intelligence agents designed to handle a broader mix of financial services tasks, part of the company’s push to win over Wall Street.