GPT-5

Everyone is focusing on AI, we're focusing on humans

Well, as the title says, it seems that everyone is looking to build something AI related. I guess it’s the AI bubble. While AI is great in many industries, in language learning and exchange we believe humans are the core. Especially when wanting to achieve fluency. Talking to natives cannot be replaced by anything unless there is a very sophisticated AI that looks entirely like a human, acts like one, has the culture, and everything comes into place, which I believe is still too far away.All I s

Show HN: AI coding agent for VS Code with pay-as-you-go pricing- no subscription

I built LLM OneStop Code—an AI coding agent for VS Code that works like Claude Code or Cursor, but with one key difference: pure pay-as-you-go pricing. No monthly subscription required.The problem with existing tools: - Cursor: $20/month for Pro (even if you barely use it) - GitHub Copilot: $10/month minimum - Claude Code: Rate-limited by API usage tier + monthly capsLLM OneStop Code charges only for what you use—billed in credits at cost + 5%. If you code 2 hours this month and 40 the

Claude AI Gets Weirdly Slow After 9 PM (I Noticed It While Reviewing Code)

I ran into something interesting recently while using Claude AI to review some of my code.During the day the responses were *fast*. I could paste a file, ask for suggestions, iterate quickly, and the workflow felt smooth.But when I tried doing the same thing later in the evening — around *9 PM and after* — the experience changed a lot.Responses suddenly took *much longer*. Sometimes it would sit there “thinking” for quite a while before returning the review.At first I assumed it was something

Show HN: PDR AI – Open-source startup accelerator engine for non-technical chaos

Show HN: PDR AI – Open-source startup accelerator engine for non-technical chaos (marketing, PRDs, onboarding)A couple weeks ago I shared PDR AI as an open-source tool for startup doc mess[](https://news.ycombinator.com/item?id=47258661). Since then I've doubled down on the core vision: it's not just another RAG chat—it's an AI-powered accelerator engine that helps technical founders skip the non-technical pitfalls and move faster.As a solo technical founder, I wast

Show HN: SafeAgent – exactly-once execution guard for AI agent side effects

LLM agents retry tool calls constantly.Retries can happen because of: - model loops - HTTP timeouts - queue retries - orchestration restartsIf the tool triggers something irreversible you can end up with duplicate side effects:retry → duplicate payment retry → duplicate email retry → duplicate ticket retry → duplicate tradeSafeAgent is a small Python guard that sits between the agent decision and the side effect.Pattern:agent decision → deterministic request_id generated → execution gu

Show HN: AgentArmor – open-source 8-layer security framework for AI agents

I've been talking to founders building AI agents across fintech, devtools, and productivity – and almost none of them have any real security layer. Their agents read emails, call APIs, execute code, and write to databases with essentially no guardrails beyond "we trust the LLM."So I built AgentArmor: an open-source framework that wraps any agentic architecture with 8 independent security layers, each targeting a distinct attack surface in the agent's data flow.The 8 laye

Show HN: GitAgent – An open standard that turns any Git repo into an AI agent

We built GitAgent because we kept seeing the same problem: every agent framework defines agents differently, and switching frameworks means rewriting everything.GitAgent is a spec that defines an AI agent as files in a git repo.Three core files — agent.yaml (config), SOUL.md (personality/instructions), and SKILL.md (capabilities) — and you get a portable agent definition that exports to Claude Code, OpenAI Agents SDK, CrewAI, Google ADK, LangChain, and others.What you get for free by being

Show HN: I built a Chrome extension to block Instagram's feed and keep only DMs

Hey HN, I built this because I kept opening Instagram to reply to a DM and resurfacing 30–40 minutes later deep in Reels. The problem wasn't willpower, it was that Instagram puts the addictive stuff between you and the useful stuff.Mindful Instagram replaces the Instagram home page with a quiet overlay: an inspirational quote, your DM notifications (fetched from Instagram's own API), a daily open counter, and a 1-minute breathing exercise. Messages, Create, and Profile are always acces

Show HN: Docgen – A C++ AI CLI to solve documentation hell with local LLMs

Hi HN,I’m a solo dev who got tired of the "documentation hell", either spending hours writing docs that immediately become outdated, or having no docs at all. I wanted a tool that treats documentation generation as a standard build step, so I built Docgen.Docgen is a lightweight AI CLI tool written in C++ that automates docs-as-code. It sits in your repo (via a .docgen folder and a Docfile) and generates Markdown files next to your source.A few technical details on how it works under t

Toolpack SDK, an Open Source TypeScript SDK for Building AI-Powered Applications

Just Released Toolpack SDK — a completely Open-Source unified TypeScript SDK for AI developmentIf you've worked with multiple LLM providers, you know the pain: each has different APIs, different tool formats, different quirks.Toolpack SDK gives you a single interface across OpenAI, Anthropic, Gemini, and Ollama.It comes with 77 built-in tools for file ops, git, databases, web scraping, code analysis, and shell commands. You can also create and integrate your own custom tools.The workflow en

Show HN: Iris – first MCP-native eval and observability tool for AI agents

I kept running into the same problem building AI agents: once they're running, I have no idea what they're actually doing. Traditional monitoring shows me HTTP 200. It can't tell me the output was wrong, that the agent leaked a user's email address, or that a single tool call in the chain is burning through tokens.So I built Iris. It's an open-source MCP server — not an SDK, not a proxy. Any MCP-compatible agent (Claude Desktop, Cursor, or anything built with the MCP SDK

Kraken – open-source autonomous dev agent for the terminal

Hey HN,I've been building Kraken for the past month — an open source autonomous dev agent that runs entirely in your terminal.The architecture is a three-process system: a Rust scheduler (cron + file watchers), a Go LLM gateway (supports OpenAI, Anthropic and OpenRouter), and a TypeScript/React TUI built with OpenTUI. All three communicate over ConnectRPC on localhost.A few things I wanted to get right from the start:- Model-agnostic: uses a custom XML-based tool-calling protocol i

Trump order cutting ties with Anthropic likely coming this week, sources say

President Trump will issue an executive order to remove Anthropic's AI technology from agencies across the executive branch, sources familiar with the matter tell CBS News.

Microsoft says court should temporarily block Pentagon's blacklist of Anthropic

Microsoft threw its support behind Anthropic and advocated for a temporary restraining order to the Pentagon's supply chain risk designation.

Anthropic Launches Institute to Examine AI’s Impact on Jobs, Security, and Society

Anthropic launches the Anthropic Institute to study AI’s impact on jobs, governance, and security as the company warns rapid advances could reshape society.

Anthropic's Democratic ties under fire as Trump admin severs Pentagon contracts

As the Trump administration and Anthropic battle it out over AI restrictions, the company's roster of former Democratic staffers raises questions.

OpenAI's GPT-4.5 dominates multiple categories on Chatbot Arena0 0

OpenAI's GPT-4.5 has debuted, claiming top performance in various categories on Chatbot Arena, particularly in multi-turn conversations Last week, OpenAI introduced GPT-4.5, its largest frontier model ...

GitHub Copilot unlocks OpenAI's GPT-5.4 in VSCode and more

GitHub Copilot has added OpenAI’s GPT-5.4 coding model, bringing improvements to reasoning and multi step development tasks.

Emil Michael explains why US flags Claude AI as security risk

The US Defence Department has formally designated Anthropic’s Claude AI models as a national security supply-chain risk, marking the first time an American AI company has received this...

Claude AI Goes Down for Thousands, Downdetector Reports

Thousands reported issues with Claude AI, leading to a potential outage. Learn more about the recent problems and solutions.