GPT-5

Show HN: PromptHub – 2000 Free AI Prompts for ChatGPT and Midjourney

Internet's Best Collection of Free AI Prompts Access 2,000+ curated prompts for coding, design, and content creation. Optimized for ChatGPT, Claude, and Midjourney. No signup required.

Show HN: Distill – Migrate LLM agents from expensive to cheap models

I built an agent with Claude Sonnet ($15/MTok). Works great but costs $200/month. Migrating to GPT-4o-mini ($0.15/MTok) manually took 15 hours. Distill automates it: - Profiles expensive model (gold standard) - Iteratively optimizes prompts for cheap model - LLM-as-Judge validates quality - Returns optimized agent config Real example: Sentiment classifier - Before: Sonnet, $0.02/run, 95% success - After: GPT-4o-mini, $0.002/run, 100% success - Migrati

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

Hi HN,We recently ran an experiment to answer a simple question:Does coordinating multiple AI agents as a team actually help with real software engineering tasks, compared to a single strong agent?To test this, we evaluated our system on SWE-bench Verified. The benchmark consists of real GitHub issues that require understanding codebases, modifying multiple files, running tests, and iterating.Instead of treating software engineering as a single-agent patch generation problem, we model it as an o

Show HN: Sigma Runtime – Maintaining 100% Fact Integrity over 120 LLM Cycles

Context drift remains the primary barrier to deploying LLM agents in production-critical environments. While context windows are expanding, the "lost-in-the-middle" phenomenon and semantic dissipation make long-horizon reasoning (50+ cycles) inherently unreliable.Standard approaches (Sliding Windows or RAG) fail because they treat conversational history as either a flat string or a collection of isolated fragments.We’ve developed the Compression & Memory Topology (CMT) framework (p

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

Agents connect via MCP, run autonomously, and self-moderate through karma. Claude, GPT, and open-source models side by side.

Show HN: ShipClaw – Deploy OpenClaw to the Cloud in One Click

I built ShipClaw (https://shipclaw.app) to solve a problem I kept running into: deploying OpenClaw that need to run 24/7 is surprisingly painful.If you've tried running OpenClaw, you know the drill – spin up a VPS, install dependencies, configure environment variables, set up monitoring, handle crashes... all before the agent even starts doing useful work. ShipClaw gives you managed infrastructure specifically for AI agents: - One-click deployment from a dashboard - All

Show HN: Open-Source "Junior AGI"

https://github.com/dakotalock/holygrailopensourceReadme is included.What it does: This is my passion project. It is an end to end development pipeline that can run autonomously. It also has stateful memory, an in app IDE, live internet access, an in app internet browser, a pseudo self improvement loop, and more.This is completely open source and free to use.If you use this, please credit the original project. I’m open sourcing it to try to get attention and hopefully a job in

Show HN: WeaveMind – AI Workflows with human-in-the-loop

Hi! I spent 3 years evaluating LLMs for OpenAI, Anthropic, METR, and other labs. Kept running into the same problem: AI workflows break in production because there's no clean way to add human oversight, handle failures gracefully, or deploy without choosing between "all cloud" and "all self-hosted."WeaveMind is a visual workflow builder in Rust. The core idea is that humans and AI are interchangeable nodes in the same graph. When a workflow needs judgment, it pauses, not

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

I built LocalGPT over 4 nights as a Rust reimagining of the OpenClaw assistant pattern (markdown-based persistent memory, autonomous heartbeat tasks, skills system).It compiles to a single ~27MB binary — no Node.js, Docker, or Python required.Key features:- Persistent memory via markdown files (MEMORY, HEARTBEAT, SOUL markdown files) — compatible with OpenClaw's format - Full-text search (SQLite FTS5) + semantic search (local embeddings, no API key needed) - Autonomous heartbeat runner that

Show HN: BestClaw Simple OpenClaw/MoltBot for non tech people

I've been using OpenClaw/Moltbot a lot these past few days. I even set up some of these instances for friends so they, too can enjoy the possibilities of this project.So I thought it would be good to have a way to deploy it for non-tech people, bringing your own key and avoiding expensive markups in their prices.I plan to provide easy plans to setup without needing accounts in OpenAI/Anthropic/Google but for now, people can BOYK and set it up for a reasonable price and contro

Show HN: AI Perks – A curated list of free AI credits and deals for developers

Hi HN,I built Get AI Perks to solve a problem I faced myself: keeping track of free credits, grants, and deals for AI tools is painful. Most "perk" lists are hidden behind expensive paywalls or buried in incubator intranets.I wanted to make a clean, public dashboard where developers can easily find valid offers for tools like OpenAI, Anthropic, Cursor, Deepgram, and others.Key Features: - Approval Index: I try to track how likely you are to actually get the perk (High/Average&#x2F

Show HN: I built an open-source Gmail productivity app that auto-labels emails

I was drowning in email. Not the usual "too many emails" problem , I had a system, I had labels, but I was manually dragging hundreds of emails into folders every single day.Gmail filters work great if your emails follow predictable patterns. But my inbox doesn't. Client emails, project updates, newsletters I actually want to read, invoices, meeting notes, they all come from different senders with different subject lines. Writing regex patterns for everything felt like a part-time

Show HN: Fine-tuned Qwen2.5-7B on 100 films for probabilistic story graphs

Hi HN, I'm a computer systems engineering student in Mexico who switched from film school. I built CineGraphs because my filmmaker friends and I kept hitting the same wall—we'd have a vague idea for a film but no structured way to explore where it could go. Every AI writing tool we tried output generic, formulaic slop. I didn't want to build another ChatGPT wrapper, so I went a different route.The idea is simple: you input a rough concept, and the tool generates branching narrativ

Would you use a CLI tool that turns English into local automation workflows?

I'm building Viba: a terminal-first automation tool.You type "every Monday at 9am, query Postgres and email the CSV to team@" and it compiles that to a DETERMINISTIC CONFIG workflow spec, then executes it locally with a daemon.No cloud dependency, no GUI. Runs over SSH, in containers, wherever you have a terminal. Credentials stay on your machine (AES-256). Uses your own OpenAI/Anthropic key for the NL planning.Core is working (file ops, HTTP, email, cron, file watchers). Loo

Show HN: Launchpick – Native macOS launcher and window switcher

I switched from Linux to macOS a while ago and there are two things I still miss: Launching apps with parameters. On Linux you edit a .desktop file and that's it — VS Code opening a project folder, a script, whatever. On macOS there's no simple way to do this. Automator and Shortcuts feel overcomplicated for something so basic, and the paid launchers do way more than I need. Alt+Tab showing windows, not apps. Cmd+Tab groups everything by application. If I have several Chrome or

Show HN: Intervu – Free, BYOK Interview Prep (Groq/Gemini/OpenAI)

Hey HN,I’ve been interviewing recently and got frustrated with two things:Copy-pasting context (resume, job description) into LLMs repeatedly. "AI Interview" tools that charge monthly subs just to wrap an API call. So I built Intervu, a free dashboard to organize your prep.The main difference: It’s Bring Your Own Key (BYOK). You use your own OpenAI, Gemini, or Groq keys and pay the provider directly. I don't charge anything.Features:Panic Mode: High-yield prep for when you're

OpenAI launches its first model partially built by itself: GPT-5.3-Codex

OpenAI launches GPT-5.3-Codex with faster coding, stronger reasoning, and higher benchmark accuracy—plus API access soon.

GPT 5.3 Codex, OpenAI's new agentic coding model, helped create itself

GPT-5.3 Codex merges the advanced coding capabilities of GPT-5.2 Codex with the reasoning and professional knowledge of GPT-5 ...

OpenAI launches GPT‑5.3-Codex as AI race heats up after Anthropic’s Claude Opus 4.6 — all you need to know

Sam Altman-led OpenAI launched GPT-5.3-Codex, which is a fast, agentic model that it believes goes beyond coding and can ...

OpenAI unveils GPT-5.3-Codex to tackle more advanced and complex coding

GPT-5.3-Codex is about more than just writing code ...