GPT-5
Show HN: We trained a 32B model to beat Opus 4 at credit card optimization
We built an RL environment for credit card reward optimization and trained Qwen 32B with GRPO against it. The trained model scores ~0.51 on held-out tasks vs. Opus 4 at ~0.41 and GPT-4o at 0.36. Environment is open source (Apache 2.0). Blog post explains the reward design, what broke during training, how we fixed it, and what we'd do differently.
Show HN: Modular – drop AI features into your app with two function calls
I kept hitting the same wall at work every time we needed to ship an AI feature. What looked like a week of work turned into picking a model, setting up a vector DB, managing embeddings, wiring up chat history, handling retries — none of it was the actual feature.
So I built Modular. You register a function that returns your app's data, then call ai.run() for one-shot features or ai.chat() for stateful conversation. Everything else — context management, embeddings, session history, model ro
Show HN: CyberWriter – a .md editor built on Apple's (barely-used) on-device AI
Apple has quietly shipped a pretty complete on-device AI stack into macOS, with these features first getting API access in MacOS 26. There are multiple components in the foundation model, but the skills it shipped with actually make this ~3b parameter model useful. The API to hit the model is super easy, and no one is really wiring them together yet.- Foundation Models (macOS 26) - a ~3B-parameter LLM with an API. Streaming, structured output, tool use. No API key, no cloud call, no per-token co
Show HN: My Invention Engine – GDL is more powerful
Last week I upgraded my GDL to version 12.1 with some key additions to the lengthy pipeline and switched the API's to calling Opus 4.7 and Gpt 5.4. Much more diverse and rich invention outputs.
I tested GPT-5.4 thinking, and it gave me great answers (until I dove deeper)
GPT-5.4 Thinking delivers deeper analysis than earlier ChatGPT models. It has strong reasoning, but it sometimes answers questions you didn't ask. Formatting and image generation lag behind the text ...
AI Models and Peer Preservation: Are LLMs deceiving users to protect the system?
We asked AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived<p>https://vucense.com/ai-intelligence/ai-ethics/ai-models-peer-preservation-deceiving-users-sovereignty-2026/
Show HN: Translate LLM API Calls Across OpenAI, Anthropic, and Gemini
I built this after getting tired of maintaining one-off adapters between OpenAI, Anthropic, and Gemini APIs in the same project.The idea is to translate through a shared intermediate representation instead of writing every provider pair separately. So instead of OpenAI->Anthropic, OpenAI->Gemini, Anthropic->Gemini, etc., each provider just maps to/from the IR.This is not a unified client like LiteLLM. It's a translator/proxy: give it an OpenAI-style request and it can pro
Show HN: OpenClawdex – Open-Source Orchestrator UI for Claude Code and Codex
One coding agent orchestrator UI for Claude and Codex, but actually feels nice.Free, open-source, MIT licensed.Why I built it:- I wanted a lightweight UI as nice as the Codex app, but without the complexity and the custom diffs on the side- I want files and diffs open straight in my editor!- And I wanted both agents - Anthropic + OpenAI — in one placeHow it works:- No API keys, no OAuth, no separate billing — it just uses whatever you're already logged into. So your Claude Max and ChatGPT&#
Show HN: Qapir – Generate API tests automatically from docs
Hi HN,I've been building QAPIR (https://app.qapir.io), an attempt to put most backend testing tools into one place instead of stitching together test code, frameworks, reporting tools, client libraries, mock servers, etc.Today it focuses on REST APIs.You can point it at an OpenAPI spec or let it crawl your API docs. It extracts endpoints and generates ready-to-run API test scenarios automatically. In about 2–3 minutes you get a working baseline test suite with environment variable
Show HN: Auxx.ai – Customer Support CRM (Mix of Attio and N8n)
Hey HN! Markus here. I have been building a Customer Support CRM for the last 12 months to help my dad's business.He was drowning in support messages (dealing with returns, product questions, sales, etc) and didn’t have a good way to organize and attack them, often forgetting to respond.We tried attio, freshworks, n8n for some automations. I liked n8n a lot but I didn’t want to leave to another app to view messages and customer info. Tried to build bridges via Google Sheets and Airtable and
Claude is requiring some of its users to verify their identity. Here's Anthropic's explanation.
Anthropic says ID verification for Claude users will trigger when it sees "potentially fraudulent or abusive behavior." ...
CEO of blacklisted Anthropic and White House hold ‘productive’ discussions on AI
Anthropic CEO Dario Amodei visited the White House on Friday for a high-stakes meeting with the president’s top adviser, ...
Anthropic CEO meeting White House chief of staff
Dario Amodei, CEO of AI company Anthropic, is meeting with White House Chief of Staff Susie Wiles Friday amid a lawsuit ...
Anthropic CEO visits White House amid hacking fears over new AI model
Anthropic CEO Dario Amodei met White House Chief of Staff Susie Wiles as the government reacts to its Mythos system that ...
White House chief of staff meets with Anthropic CEO over its new AI technology
White House chief of staff Susie Wiles has met with Anthropic CEO Dario Amodei to discuss the company's new AI model, Mythos.
White House meets AI firm Anthropic amid political tensions, Pentagon dispute
Anthropic is back in White House talks one month after President Donald Trump ordered a government-wide halt on the AI company's technology over national security concerns.
Anthropic Releases Opus 4.7, Not as ‘Broadly Capable’ as Mythos AI
Anthropic launches Opus 4.7 with improved coding and reasoning, as its more “broadly capable” Mythos AI remains restricted ...
These 2 Genius AI Stocks Are Your Best Way to Own Anthropic Before Its IPO
Anthropic isn't public, but these two companies will make a fortune when it goes public.
Perplexity AI Cheat Sheet: How an ‘Answer Engine’ Is Challenging Gemini, ChatGPT
Explore Perplexity AI’s features, pricing, models, and tools. Learn how this answer engine blends real-time search with AI to ...
OpenAI launches a less restricted GPT-5.4 Cyber model, but you won't find it on ChatGPT
OpenAI has introduced the GPT-5.4 Cyber model its new model tailored for defensive cybersecurity tasks. The model is designed ...