GPT-5

Show HN: Assign tasks to 7 AI agents with -mentions, autonomous mode, OpenClaw

I posted Mysti here a couple months ago and got a lot of feedback that shaped where I took it. Quick recap: it's a VS Code extension that lets you use multiple AI coding agents through one interface, including having them collaborate on problems.Three things I want to highlight in this release.@-mentions for task delegation. You can now assign work to specific agents inline. Something like: @claude write the API handlers then @gemini review them for security issues then @claude fix what gem

Show HN: AgentDX – Open-source linter and LLM benchmark for MCP servers

MCP servers are proliferating fast, but most have vague tool descriptions and incomplete schemas that make LLMs pick the wrong tool or fill parameters incorrectly.AgentDX is a CLI that measures this. Two commands:- `npx agentdx lint` — static analysis of tool descriptions, schemas, and naming. 18 rules, zero config, no API key. Produces a lint score.- `npx agentdx bench` — sends your tool definitions to an LLM (Anthropic, OpenAI, or Ollama) and evaluates tool selection accuracy, parameter correc

Show HN: Axon – Agentic AI with mandatory user approval and audit logging

Hey HN,I built AXON because I wanted AI agents that can actually do things — but with real security controls.Every tool call (file ops, web search, shell commands, email, code execution) requires explicit user approval before execution. Parameters and risk level are shown, you approve or deny. Everything is logged.Key features: - Multi-agent system (different roles, models, permissions per agent) - Multi-LLM: Ollama (fully local), Claude, OpenAI, Gemini, Groq, OpenRouter - 100% on-premise, no cl

Show HN: Mock any HTTP request from DevTools, with AI-generation and zero setup

Hi HN,I built this after using Requestly, Mokku, Mockiato, Tweak, and Mockoon. Each one either paywalled the features I actually needed, required a separate server running on my machine, or just didn't fit the way I work.The browser is already open. DevTools is already open. I wanted the mocking to live there too, not in a separate app I have to remember to start.So roughly a month ago, I started building my own tools. It intercepts network requests at the browser level and lets you swap th

Show HN: Trust Protocols for Anthropic/OpenAI/Gemini

Much of my work right now involves complex, long-running, multi-agentic teams of agents. I kept running into the same problem: “How do I keep these guys in line?” Rules weren’t cutting it, and we needed a scalable, agentic-native STANDARD I could count on. There wasn’t one. So I built one.Here are two open-source protocols that extend A2A, granting AI agents behavioral contracts and runtime integrity monitoring:- Agent Alignment Protocol (AAP): What an agent can do / has done. - Agent In

Palantir is caught in the middle of a brewing fight between Anthropic and the Pentagon

The Defense Department is threatening to blacklist Anthropic over limits on military use, potentially putting one of its top ...

As AI jitters rattle IT stocks, Infosys partners with Anthropic to build ‘enterprise-grade’ AI agents

Under the partnership, Infosys plans to integrate Anthropic's Claude models into its Topaz AI platform to build so-called "agentic" systems.

Pentagon Weighs Axing $200M Anthropic Deal in Moral Standoff Over AI Safeguards

The Pentagon may cut a $200 million Anthropic deal after the AI firm refused to lift moral safeguards on surveillance and autonomous weapons use.

Anthropic Launches Claude Sonnet 4.6 as Default Model for Free and Paid Users

Anthropic rolls out Claude Sonnet 4.6 as its new default model, bringing stronger reasoning and coding power to free and paid users alike.

How Anthropic Sweetens the Deal for Its Cloud Providers

Anthropic expects to pay Amazon, Google and Microsoft at least $80 billion to run its Claude AI on their cloud servers through 2029, according to the startup’s most optimistic recent forecasts. That’s ...

Anthropic has another new model—software stocks are going to hate it

Anthropic is leading the charge in user development of autonomous agentic systems, and its Tuesday debut of Claude Sonnet 4.6 ...

Hegseth ‘close’ to blacklisting AI firm Anthropic as heated negotiations hit boiling point: report

Defense Secretary Pete Hegseth is allegedly “close” to cutting off the Pentagon’s ties to AI firm Anthropic and placing it on ...

Anthropic’s Claude Commercial Against ChatGPT Ads Delivers Real Results

Anthropic’s Super Bowl ads mocking ChatGPT ads delivered the biggest user boost of any AI company. including an 11% jump in daily users and 6.5% more website visits.

Anthropic releases Claude Sonnet 4.6, continuing breakneck pace of AI model releases

Claude Sonnet 4.6 is more consistent with coding and is better at following coding instructions, Anthropic said.

Anthropic was supposed to be a ‘safe’ alternative to OpenAI, but CEO Dario Amodei admits his company struggles to balance safety with profits

Anthropic’s CEO says intense commercial pressure is testing the company’s safety-first AI mission.

OpenAI Retires GPT-4o Again, Stirring Backlash From Loyal ChatGPT Users

OpenAI retires GPT-4o despite fierce user loyalty. This only triggered debate over AI attachment, safety concerns, and the future of ChatGPT models.

OpenAI removes GPT-4o and GPT-4.1 models from ChatGPT starting today: Here’s why

OpenAI has retired GPT-4o and other older models from ChatGPT today, citing a shift to newer GPT-5 systems amid controversy and legal scrutiny.

OpenAI removes access to sycophancy-prone ChatGPT-4o model

The model is known for its overly sycophantic nature and its role in several lawsuits involving users' unhealthy relationships to the chatbot.

Can GPT-5.2 solve a complex physics problem? AI achieves a path-breaking scientific breakthrough after solving a decade-long mystery

An advanced AI system has solved a decade-old theoretical physics puzzle, proposing a new formula for gluon interactions. The AI, GPT-5.2 Pro, spent 12 hours developing a mathematical proof, revealing ...

Farewell, GPT-4o: I tested it one last time against GPT-5.2 — here’s what we’re losing

ChatGPT-4o is gone. We ran 9 final tests against GPT-5.2 to see what changed — and what users may miss.