GPT-5
Show HN: Provepy – A Python decorator that proves your code using Lean and LLMs
Hi all,I have recently been playing around with AI and formal methods and built Provepy. It's an experiment to see if we can use frontier LLMs to bridge the gap between everyday Python code and formal verification in Lean.The idea is to make formal methods more accessible by burying the complex Lean syntax behind a standard Python decorator. You add @provable to your function, give it a plain English claim, and run your code. from provepy import provable
@provable(claim="This funct
Show HN: BrokenClaw Part 5: GPT-5.4 Edition (Prompt Injection)
Some prompt injection experiments with OpenClaw and GPT-5.4. Last part of the BrokenClaw series.
Powell, Bessent discussed Anthropic's Mythos AI cyber threat with major US banks
Anthropic rolled out the new Mythos AI model to a select group of companies over concerns that hackers could exploit its capabilities.
Federal appeals court rejects Anthropic bid to block Pentagon blacklist in AI dispute
A federal court rejected Anthropic's bid to block the Department of War from blacklisting the artificial intelligence company ...
The “SaaS-Pocalypse” Continues: Cloudflare, ServiceNow, CrowdStrike Under Fire as Anthropic Rewrites the Rules
The so-called “SaaS-pocalypse” is back, and it’s hitting harder today than it did yesterday. Anthropic’s release of “Claude ...
Anthropic’s New Powerful Mythos Model Has Cybersecurity Experts Worried
Anthropic says it is testing a powerful new AI model that can spot serious weaknesses in software, and releasing it as part ...
What Anthropic’s too-dangerous-to-release AI model means for its upcoming IPO
Anthropic says it has built its most capable AI model ever. It's also decided you can't have it.
Palantir stock falls after 'Big Short' investor Michael Burry says Anthropic is 'eating Palantir's lunch'
Palantir's stock is down. Michael Burry said Wednesday that Anthropic's rapid rise is a threat to Palantir as enterprise AI ...
Anthropic’s new AI model triggered an emergency banking meeting. It’s a reason to buy cybersecurity stocks.
Anthropic’s new Mythos agent has created a stir in the cybersecurity world.
Galaxy S26's Perplexity AI Agent is Swapping 'Hey Plex' for a Longer Wake Word
The post Galaxy S26's Perplexity AI Agent is Swapping 'Hey Plex' for a Longer Wake Word appeared first on Android Headlines.
Perplexity AI Revenue Soars 50% in 30 Days Following Major Product Launch
Perplexity AI's annual recurring revenue reached $450M after a 50% monthly jump driven by its Computer agent platform and ...
Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090
Hey there, we fused all 24 layers of Qwen3.5-0.8B (a hybrid DeltaNet + Attention model) into a single CUDA kernel launch and made it open-source for everyone to try it.On an RTX 3090 power-limited to 220W:
- 411 tok/s vs 229 tok/s on M5 Max (1.8x)
- 1.87 tok/J, beating M5 Max efficiency
- 1.55x faster decode than llama.cpp on the same GPU
- 3.4x faster prefillThe RTX 3090 launched in 2020. Everyone calls it power-hungry. It isn't, the software is.
The conventional wisdom NVID
GPT-5.4 launched as the most powerful model ever and I switched back to Claude in a week
It's all about use cases, but I'm not sure GPT-5.4 is even the stronger model ...
Claude AI Goes Down Again As Outages Pile Up
Claude AI outages drag on as Anthropic battles repeated errors over multiple days, worrying enterprise users about ...
Show HN: MCP Gateway – Zero-Trust Access to MCP Tool Servers
MCP servers work great locally over stdio... until you need one on another machine, or want to share tools across a team. Most approaches to that involve exposing endpoints and managing network access. I built a tool that keeps remote MCP as simple and secure as local stdio.mcp-bridge wraps any stdio MCP server in one command and shares it over a zrok/OpenZiti overlay. mcp-gateway aggregates multiple backends... local stdio servers and remote shares... into a single connection with a unifie
Show HN: Memoriki – LLM Wiki+MemPalace for persistent personal knowledge bases
Memoriki is a template for building personal knowledge bases where the LLM does all the maintenance work.It combines Karpathy's LLM Wiki pattern (structured markdown wiki maintained by an LLM) with MemPalace (an MCP server that adds semantic search and a temporal knowledge graph).Three layers:
- Wiki pages with [[wiki-links]] and YAML frontmatter - the LLM creates and maintains these
- Semantic search via embeddings (ChromaDB) - find things by meaning, not keywords
- Knowledge graph with ty
Show HN: AI agents are bad at API integrations – we fixed it
Hi, we're Sohaib and Hannan from APIMatic. We've been building tools to help Developers integrate with APIs for 5+ years at APIMatic. We're now trying to help AI agents do the same.This started from a conversation at PayPal DevDay 2025. The PayPal developer experience team were monitoring developers using AI agents to integrate PayPal APIs, and the agents kept reaching for outdated docs and deprecated SDK versions, often falling back to training data instead of reading the current
Show HN: RepoWarden – Autopilot for your GitHub dependency updates
RepoWarden monitors your repos and opens PRs for dependency updates and security patches automatically. I got tired of manually managing Dependabot PRs across dozens of repos, so I built something that handles the whole lifecycle - from detecting outdated deps to opening well-described PRs.I’ve taken a lot of care to ensure security of this app. Each “run” exists in a fresh container in cloudflare with its own network. And there are a myriad of protections against dependency poisoning and other
Show HN: Ollama-client-rs, a Rust client for Ollama
I built a Rust client library for the Ollama API as a hands-on experiment with Rust and local LLM tooling, inspired by Il Codice di Marco YT videos and the Gemma 4 release.Current scope:
- async + blocking clients
- chat / generate / embed
- model operations
- streaming responsesThe implementation was handwritten, with AI used more as development support than code generation:
- Claude Code (Opus 4.6) during development
- GPT 5.4 for adversarial code review and robustness/security
Hybrid Attention
TLDR: Forked pytorch and triton internals . Changed attention so its linear first layer , middle quadratic layer, last linear layer
Inference got much faster with a low perplexity hit in tests .Full attention O(n²): 17.96s / 5.6 tok/sHybridAttention O(n·W + n·D): 0.35s / 286.6 tok/sI have been building a small Rust focused language model from scratch in PyTorch. This is not a finetune. It is byte level, trained from random initialization on a Rust heavy corpus assembled here: