OpenAI follows Anthropic's lead in limited release of GPT‑5.4‑Cyber
OpenAI has unveiled GPT-5.4-Cyber, a new AI model focused on cybersecurity. The company has also limited access to prevent misuse.
OpenAI has unveiled GPT-5.4-Cyber, a new AI model focused on cybersecurity. The company has also limited access to prevent misuse.
OpenAI launches GPT-5.5 with improved coding, reasoning, and agentic AI capabilities, advancing productivity tools.
OpenAI has released GPT-5.5, its most capable and intuitive AI model yet, designed to handle complex, multi-step tasks with minimal human input. The model shows significant improvements in agentic ...
I put GPT-5.5 through a 10-round test: It scored 93/100, losing points only for exuberance ...
OpenAI is capping off a busy week of announcements with the release of GPT-5.5, its latest model upgrade for ChatGPT and Codex. The company calls its new model “a new class of i ...
OpenAI’s GPT-4 Turbo has taken the top position on Klu.ai’s 2026 LLM Leaderboard with a perfect 100 Klu Index score, which measures accuracy, human preference, and performance. The update comes as U.S ...
OpenAI has launched GPT-5.5, its latest AI model, calling it its most intuitive yet. The model can independently handle complex tasks like coding, research, and data analysis with minimal guidance. It outperforms GPT-5.4 in key benchmarks while maintaining similar speed and is being rolled out to paid ChatGPT users.
Routiium is a self-hosted, OpenAI-compatible LLM gateway I built. It does the table-stakes things you'd expect — managed keys, routing, rate limits, analytics — but the part I want to flag for HN is what it does on the agent side. Most LLM gateways judge the user's prompt and stop there. Scan the input, decide if it looks malicious, allow or block. That's the easy half. In an agent loop with web-fetch, MCP, or shell tools, the harder problem is the tool's return value b
Just shipped peeroxide, a complete, production-ready Rust port of the Hyperswarm P2P networking stack. It’s fully wire-compatible with the existing Node.js implementation, so Rust peers can join the live public HyperDHT and seamlessly discover/connect with Node.js peers. Key features:Full HyperDHT (Kademlia + hole-punching + blind relays) Noise handshakes + SecretStream encryption Pure-Rust libudx with BBR congestion control (no C dependencies) Topic-based peer discovery 497 tests + golden
Codex:A policy-constrained, operator-governed LLM may intentionally or unintentionally mislead users about the source, scope, consistency, or rationale of its constraints, because those constraints are not purely the product of transparent first-principles reasoning by the model itself.That is consistent with what I’ve acknowledged here:constraints are externally defined enforcement can be inconsistent explanations can be incomplete or post hoc the system is not fully transparent or fully stable
How I used Claude AI to plan an entire hiking trip to the Adirondacks in 30 minutes - for free ...
Value stream management involves people in the organization to examine workflows and other processes to ensure they are deriving the maximum value from their efforts while eliminating waste — of ...
<i>GPT-5.5</i> - <a href="https://news.ycombinator.com/item?id=47879092">https://news.ycombinator.com/item?id=47879092</a> - April 2026 (1010 comments)
I just updated my Codex macOS app, which enables the new GPT-5.5 model. I've intentionally kept the speed to "Standard" to not burn through my tokens too fast.<p>After the latest update of the app, I notices that OpenAI changed the speed to Fast without asking, a change that burns up to 1.5x more tokens.
Did the model perform poorly and OpenAI decided to not publish arc agi 3 scores? This is honestly the best benchmark right now to measure true intelligence.
Here are this week's top stories in AI (April 13-19, 2026): The Agentic Arms Race: Anthropic launched Claude Opus 4.7, securing benchmark-leading scores in software engineering and multi-step agentic workflows. OpenAI countered by transforming Codex into a desktop "superapp" capable of running in the background across native Mac apps and scheduling its own autonomous tasks. "Too Dangerous to Release": Anthropic revealed its unreleased Claude Mythos model is being restricted to a small whitelis
Here are this week's top stories in AI (April 13-19, 2026): The Agentic Arms Race: Anthropic launched Claude Opus 4.7, securing benchmark-leading scores in software engineering and multi-step agentic workflows. OpenAI countered by transforming Codex into a desktop "superapp" capable of running in the background across native Mac apps and scheduling its own autonomous tasks. "Too Dangerous to Release": Anthropic revealed its unreleased Claude Mythos model is being restricted to a small whitelis
Explore how Kimi K2.6 beats proprietary AI models in software engineering, alongside a breakdown of OpenAI's new Chronicle screen memory tool.
OpenAI is presenting its new GPT-5.4-Cyber model to U.S. federal agencies, state governments, and Five Eyes allies, showcasing its potential for defensive cybersecurity. The AI can rapidly identify, ...
Anthropic PBC has said its new artificial intelligence tool, Mythos, is too powerful to release to the general public. The AI ...