GPT-5

Federal appeals court rejects Anthropic bid to block Pentagon blacklist in AI dispute

A federal court rejected Anthropic's bid to block the Department of War from blacklisting the artificial intelligence company ...

Powell, Bessent discussed Anthropic's Mythos AI cyber threat with major US banks

Anthropic rolled out the new Mythos AI model to a select group of companies over concerns that hackers could exploit its capabilities.

Anthropic’s New Powerful Mythos Model Has Cybersecurity Experts Worried

Anthropic says it is testing a powerful new AI model that can spot serious weaknesses in software, and releasing it as part ...

The “SaaS-Pocalypse” Continues: Cloudflare, ServiceNow, CrowdStrike Under Fire as Anthropic Rewrites the Rules

The so-called “SaaS-pocalypse” is back, and it’s hitting harder today than it did yesterday. Anthropic’s release of “Claude ...

Palantir stock falls after 'Big Short' investor Michael Burry says Anthropic is 'eating Palantir's lunch'

Palantir's stock is down. Michael Burry said Wednesday that Anthropic's rapid rise is a threat to Palantir as enterprise AI ...

What Anthropic’s too-dangerous-to-release AI model means for its upcoming IPO

Anthropic says it has built its most capable AI model ever. It's also decided you can't have it.

Anthropic’s new AI model triggered an emergency banking meeting. It’s a reason to buy cybersecurity stocks.

Anthropic’s new Mythos agent has created a stir in the cybersecurity world.

Galaxy S26's Perplexity AI Agent is Swapping 'Hey Plex' for a Longer Wake Word

The post Galaxy S26's Perplexity AI Agent is Swapping 'Hey Plex' for a Longer Wake Word appeared first on Android Headlines.

Perplexity AI Revenue Soars 50% in 30 Days Following Major Product Launch

Perplexity AI's annual recurring revenue reached $450M after a 50% monthly jump driven by its Computer agent platform and ...

Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090

Hey there, we fused all 24 layers of Qwen3.5-0.8B (a hybrid DeltaNet + Attention model) into a single CUDA kernel launch and made it open-source for everyone to try it.On an RTX 3090 power-limited to 220W: - 411 tok/s vs 229 tok/s on M5 Max (1.8x) - 1.87 tok/J, beating M5 Max efficiency - 1.55x faster decode than llama.cpp on the same GPU - 3.4x faster prefillThe RTX 3090 launched in 2020. Everyone calls it power-hungry. It isn't, the software is. The conventional wisdom NVID

GPT-5.4 launched as the most powerful model ever and I switched back to Claude in a week

It's all about use cases, but I'm not sure GPT-5.4 is even the stronger model ...

Claude AI Goes Down Again As Outages Pile Up

Claude AI outages drag on as Anthropic battles repeated errors over multiple days, worrying enterprise users about ...

Show HN: MCP Gateway – Zero-Trust Access to MCP Tool Servers

MCP servers work great locally over stdio... until you need one on another machine, or want to share tools across a team. Most approaches to that involve exposing endpoints and managing network access. I built a tool that keeps remote MCP as simple and secure as local stdio.mcp-bridge wraps any stdio MCP server in one command and shares it over a zrok/OpenZiti overlay. mcp-gateway aggregates multiple backends... local stdio servers and remote shares... into a single connection with a unifie

Show HN: AI agents are bad at API integrations – we fixed it

Hi, we're Sohaib and Hannan from APIMatic. We've been building tools to help Developers integrate with APIs for 5+ years at APIMatic. We're now trying to help AI agents do the same.This started from a conversation at PayPal DevDay 2025. The PayPal developer experience team were monitoring developers using AI agents to integrate PayPal APIs, and the agents kept reaching for outdated docs and deprecated SDK versions, often falling back to training data instead of reading the current

Show HN: Memoriki – LLM Wiki+MemPalace for persistent personal knowledge bases

Memoriki is a template for building personal knowledge bases where the LLM does all the maintenance work.It combines Karpathy's LLM Wiki pattern (structured markdown wiki maintained by an LLM) with MemPalace (an MCP server that adds semantic search and a temporal knowledge graph).Three layers: - Wiki pages with [[wiki-links]] and YAML frontmatter - the LLM creates and maintains these - Semantic search via embeddings (ChromaDB) - find things by meaning, not keywords - Knowledge graph with ty

Show HN: RepoWarden – Autopilot for your GitHub dependency updates

RepoWarden monitors your repos and opens PRs for dependency updates and security patches automatically. I got tired of manually managing Dependabot PRs across dozens of repos, so I built something that handles the whole lifecycle - from detecting outdated deps to opening well-described PRs.I’ve taken a lot of care to ensure security of this app. Each “run” exists in a fresh container in cloudflare with its own network. And there are a myriad of protections against dependency poisoning and other

Show HN: Ollama-client-rs, a Rust client for Ollama

I built a Rust client library for the Ollama API as a hands-on experiment with Rust and local LLM tooling, inspired by Il Codice di Marco YT videos and the Gemma 4 release.Current scope: - async + blocking clients - chat / generate / embed - model operations - streaming responsesThe implementation was handwritten, with AI used more as development support than code generation: - Claude Code (Opus 4.6) during development - GPT 5.4 for adversarial code review and robustness/security

Hybrid Attention

TLDR: Forked pytorch and triton internals . Changed attention so its linear first layer , middle quadratic layer, last linear layer Inference got much faster with a low perplexity hit in tests .Full attention O(n²): 17.96s / 5.6 tok/sHybridAttention O(n·W + n·D): 0.35s / 286.6 tok/sI have been building a small Rust focused language model from scratch in PyTorch. This is not a finetune. It is byte level, trained from random initialization on a Rust heavy corpus assembled here:

I tried to create my own security agent

Claude just previewed their Mythos model and they say it is "strikingly capable at computer security tasks" and they didn't release it to the general public. I tried creating a minimal version and the results are surprising. It found many vulnerabilities in critical open source projects and I was able to validate them. I used gpt-5.3-codex and it was able to find vulnerabilities in WebKit - browser engine used in Safari, some very popular android apps, open source web panel. The v

LLM plays an 8-bit Commander X16 game using structured "smart senses"

I connected the ChatGPT API (model gpt-4o) to an 8-bit shoot-'em-up game, PvP-AI, running on a Commander X16 emulator.Instead of pixels or audio, the model receives structured text summaries, what I’m calling "smart senses", based on the game's existing touch and EMF-style inputs.The LLM maintains notes between turns, develops strategies across games, and even discovers an exploit in the built-in AI's behavior.Write-up, technical details, and three sequential gameplay re