GPT-5
Show HN: Assign tasks to 7 AI agents with -mentions, autonomous mode, OpenClaw
I posted Mysti here a couple months ago and got a lot of feedback that shaped where I took it. Quick recap: it's a VS Code extension that lets you use multiple AI coding agents through one interface, including having them collaborate on problems.Three things I want to highlight in this release.@-mentions for task delegation. You can now assign work to specific agents inline. Something like: @claude write the API handlers then @gemini review them for security issues then @claude fix what gem
Show HN: AgentDX – Open-source linter and LLM benchmark for MCP servers
MCP servers are proliferating fast, but most have vague tool descriptions and incomplete schemas that make LLMs pick the wrong tool or fill parameters incorrectly.AgentDX is a CLI that measures this. Two commands:- `npx agentdx lint` — static analysis of tool descriptions, schemas, and naming. 18 rules, zero config, no API key. Produces a lint score.- `npx agentdx bench` — sends your tool definitions to an LLM (Anthropic, OpenAI, or Ollama) and evaluates tool selection accuracy, parameter correc
Show HN: Axon – Agentic AI with mandatory user approval and audit logging
Hey HN,I built AXON because I wanted AI agents that can actually do things — but with real security controls.Every tool call (file ops, web search, shell commands, email, code execution) requires explicit user approval before execution. Parameters and risk level are shown, you approve or deny. Everything is logged.Key features:
- Multi-agent system (different roles, models, permissions per agent)
- Multi-LLM: Ollama (fully local), Claude, OpenAI, Gemini, Groq, OpenRouter
- 100% on-premise, no cl
Show HN: Mock any HTTP request from DevTools, with AI-generation and zero setup
Hi HN,I built this after using Requestly, Mokku, Mockiato, Tweak, and Mockoon. Each one either paywalled the features I actually needed, required a separate server running on my machine, or just didn't fit the way I work.The browser is already open. DevTools is already open. I wanted the mocking to live there too, not in a separate app I have to remember to start.So roughly a month ago, I started building my own tools. It intercepts network requests at the browser level and lets you swap th
Show HN: Trust Protocols for Anthropic/OpenAI/Gemini
Much of my work right now involves complex, long-running, multi-agentic teams of agents. I kept running into the same problem: “How do I keep these guys in line?” Rules weren’t cutting it, and we needed a scalable, agentic-native STANDARD I could count on. There wasn’t one. So I built one.Here are two open-source protocols that extend A2A, granting AI agents behavioral contracts and runtime integrity monitoring:- Agent Alignment Protocol (AAP): What an agent can do / has done.
- Agent In
Palantir is caught in the middle of a brewing fight between Anthropic and the Pentagon
The Defense Department is threatening to blacklist Anthropic over limits on military use, potentially putting one of its top ...
As AI jitters rattle IT stocks, Infosys partners with Anthropic to build ‘enterprise-grade’ AI agents
Under the partnership, Infosys plans to integrate Anthropic's Claude models into its Topaz AI platform to build so-called "agentic" systems.
Pentagon Weighs Axing $200M Anthropic Deal in Moral Standoff Over AI Safeguards
The Pentagon may cut a $200 million Anthropic deal after the AI firm refused to lift moral safeguards on surveillance and autonomous weapons use.
Anthropic Launches Claude Sonnet 4.6 as Default Model for Free and Paid Users
Anthropic rolls out Claude Sonnet 4.6 as its new default model, bringing stronger reasoning and coding power to free and paid users alike.
How Anthropic Sweetens the Deal for Its Cloud Providers
Anthropic expects to pay Amazon, Google and Microsoft at least $80 billion to run its Claude AI on their cloud servers through 2029, according to the startup’s most optimistic recent forecasts. That’s ...
Anthropic has another new model—software stocks are going to hate it
Anthropic is leading the charge in user development of autonomous agentic systems, and its Tuesday debut of Claude Sonnet 4.6 ...
Hegseth ‘close’ to blacklisting AI firm Anthropic as heated negotiations hit boiling point: report
Defense Secretary Pete Hegseth is allegedly “close” to cutting off the Pentagon’s ties to AI firm Anthropic and placing it on ...
Anthropic’s Claude Commercial Against ChatGPT Ads Delivers Real Results
Anthropic’s Super Bowl ads mocking ChatGPT ads delivered the biggest user boost of any AI company. including an 11% jump in daily users and 6.5% more website visits.
Anthropic releases Claude Sonnet 4.6, continuing breakneck pace of AI model releases
Claude Sonnet 4.6 is more consistent with coding and is better at following coding instructions, Anthropic said.
Anthropic was supposed to be a ‘safe’ alternative to OpenAI, but CEO Dario Amodei admits his company struggles to balance safety with profits
Anthropic’s CEO says intense commercial pressure is testing the company’s safety-first AI mission.
OpenAI Retires GPT-4o Again, Stirring Backlash From Loyal ChatGPT Users
OpenAI retires GPT-4o despite fierce user loyalty. This only triggered debate over AI attachment, safety concerns, and the future of ChatGPT models.
OpenAI removes GPT-4o and GPT-4.1 models from ChatGPT starting today: Here’s why
OpenAI has retired GPT-4o and other older models from ChatGPT today, citing a shift to newer GPT-5 systems amid controversy and legal scrutiny.
OpenAI removes access to sycophancy-prone ChatGPT-4o model
The model is known for its overly sycophantic nature and its role in several lawsuits involving users' unhealthy relationships to the chatbot.
Can GPT-5.2 solve a complex physics problem? AI achieves a path-breaking scientific breakthrough after solving a decade-long mystery
An advanced AI system has solved a decade-old theoretical physics puzzle, proposing a new formula for gluon interactions. The AI, GPT-5.2 Pro, spent 12 hours developing a mathematical proof, revealing ...
Farewell, GPT-4o: I tested it one last time against GPT-5.2 — here’s what we’re losing
ChatGPT-4o is gone. We ran 9 final tests against GPT-5.2 to see what changed — and what users may miss.