GPT-5
Show HN: X-AnyLabeling – An open-source multimodal annotation ecosystem for CV
Hi HN,Over the past year I’ve been building X-AnyLabeling, an open-source project that started as a labeling tool and gradually evolved into a multimodal annotation ecosystem.The original problem wasn’t drawing boxes or masks. It was that annotation, inference, and training are usually fragmented into separate tools, which makes iteration slow and painful — especially for small teams.X-AnyLabeling tries to unify these pieces:- A desktop-first annotation client (cross-platform, pure Python)
- Plu
Show HN: Browser-Use as a REST API with VNC, persistent sessions, and tools
I wrapped browser-use in a REST API with features I needed for production use.browser-use is great but it's a Python library. I wanted an API I could call from anywhere with proper session management, visibility into what's happening, and extensibility.Features:
- VNC streaming: watch the browser live at /vnc.html
- Session management: launch browsers, reuse across tasks, save profiles
- Custom tools: register HTTP endpoints the agent can call (APIs, webhooks, etc.)
- Task control
Show HN: LLMatcher – Find your perfect AI through blind voting
Hey HN! I built LLMatcher in 10 hours to solve a problem I kept having: which AI model should I actually use?Instead of trusting marketing claims, I created a blind testing platform where you compare two anonymous AI responses and vote for the better one.After 50 votes, you get personalized recommendations based on YOUR preferences — not some generic benchmark.Key features:
- Top AI models (GPT-5.2, Claude Opus 4.5, Gemini 3 Pro, Grok 4+)
- Blind side-by-side comparison
- Personal AI toolkit (un
Journaling and Prompting
I used Notion for several years for journaling, but I found the cognitive cost of switching into its DSL wasn’t worth it for me. Notion is built on blocks, things like databases built on top. Even when I exported my notes to Markdown, it still reflected Notion’s internal data structure instead of giving me something clean and portable.For example, the inline database ends up as a table with href links to other parts of the document - nice, but not very useful when I want plain text I can actuall
Show HN: Full Signal – Get the signal of Twitter without doomscrolling
I kept opening Twitter to catch up on people I follow. 20 minutes later I'd close the app having scrolled through rage-bait and missed what I actually wanted to see.<p>So I built Full Signal. You pick the accounts. Every morning, their tweets get turned into a 5-minute digest — highlights, conversations, gems. No scrolling.<p><a href="https://fullsignal.xyz" rel="nofollow">https://fullsignal.xyz</a><p>Would love feedback.
A keyboard ring capable of alphanumeric output
Hello,Testing the waters here, as I just scored technical feasibility on this after working on it for a 2-3 years. Have to be shallow about the details as it is being patented.I have designed a keyboard ring capable of silent typing using HID only. The raw data is processed by an AI system delivering the complete sentences to the host device clipboard or in the companion app. I got the hardware working, but have not yet decided on the software layer.The product while worn was originally meant to
Are GPT-5.2's new powers enough to surpass Gemini 3? Try it and see
Are GPT-5.2's new powers enough to surpass Gemini 3? Try it and see ...
How does GPT-4 work and how can you start using it in ChatGPT?
OpenAI, the company behind the viral chatbot ChatGPT, has announced the release of GPT-4. In a blog post, the San Francisco artificial intelligence lab co-founded by Elon Musk and Sam Altman in 2015 ...
GPT-5.2 vs Grok 4 — How does Musk’s AI compare on benchmarks, price, and features?
For GPT-5.2, you have to get the pro version of ChatGPT, which starts at $20 per month or $200 per month, depending on what ...
Show HN: DailyGame.online – a minimal daily puzzle arcade built with GPT-5.2
Hi HN — I owned the domain dailygame.online and wanted to build something small that could eventually make money, but I didn’t have a clear idea at first. Around the same time GPT-5.2 came out, so I treated this as an experiment: can I ship a playable “daily games” site quickly if I use GPT-5.2 as a pair programmer?I started by giving one strict constraint: no flashy UI. The homepage should be a tiny centered box (ASCII/terminal-like) listing “Today’s Daily Games”.Then I had GPT-5.2 impleme
Happy 3 years, ChatGPT: Here is a complete version-by-version journey from GPT-3.5 to GPT-5.1
Since its debut in November 2022, ChatGPT has evolved rapidly through multiple versions: GPT-3.5, GPT-4, GPT-4o, GPT-4.5, GPT ...
Ask HN: Go all-in on AI Boom vs. enjoy parenthood?
After many years of working hard, my wife and I finally decided to have a kid recently. I am looking forward to spending a lot more time with her, while working just a 9-5. However, the AI boom seems like a short-lived, once in a life-time opportunity. Now I am considering if I should dive in fully even if it means sacrificing time with family?Options:
1) Joining an AI startup
2) Founding an AI startup
3) Stay at my big-tech roleI make good money at my current role, but I am extremely passiona
Accenture, Anthropic strike multi-year partnership to boost AI adoption
Accenture and Anthropic on Tuesday announced an expansion of their partnership through a new business group where around ...
Anthropic’s “Soul Overview” for Claude Has Leaked
AI tinkerer Richard Weiss came across a fascinating document that describes the "soul" of AI company Anthropic's Claude 4.5 ...
Anthropic and Accenture sign multi-year AI strategic partnership
The two companies are launching the Accenture Anthropic Business Group to bring Anthropic's AI to Accenture's employees.
OpenAI, Anthropic, and Block join new Linux Foundation effort to standardize the AI agent era
Anthropic, Block, and OpenAI are backing the Linux Foundation’s new Agentic AI Foundation, donating MCP, Goose, and AGENTS.md ...
Show HN: QonQrete – Local-first multi-agent system for sandboxed code generation
I’ve been working on an open-source project called QonQrete and would like feedback from HN.What it isQonQrete is a local-first, agent-based orchestration system for code generation. It coordinates multiple LLM “agents” to plan, write, and review code, while keeping execution inside a sandbox on your own infrastructure. Think of it as a construction yard for AI-assisted development that you run yourself.Why I built itMost multi-agent demos I saw had two issues:– Security: generated code often ru
Show HN: RAG-TUI – Visual chunking debugger for RAG pipelines in the terminal
I built this because I was tired of guessing chunk_size=1000 and overlap=200 in my RAG pipelines and hoping for the best.RAG-TUI is an open-source terminal tool that lets you visualize exactly how your text is being split before you index it. It helps you tune parameters in real-time and spot issues like sentences getting cut in half.The Stack:
Built with Textual (Python) for the TUI.
Chonkie for token-based chunking.
Usearch for local vector search.
Integrates with Ollama for entirely local
Open Source "Notch" for AI, Agents and Automation
I built AI Thing (https://aithing.dev) so we can use AI in a transparent, secure way — and get not just the features other platforms charge for, but a whole lot more, without any payment plans.A lot of platforms charge to use agents, MCP servers, or automations. Some even put basic MCP functionality behind a “premium” tag. So I just made everything free.Watch the demo on YouTube: https://www.youtube.com/watch?v=R3azWzrv9IM&feature=youtu.be------------
Key Features
--
Building an AI cost-optimizer and AI Slop Prevention tool Looking for feedback."
Hey — Looking for feedback on my AI cost-optimization + “AI Slop Prevention” tool
I'm Zach, and I’ve been building AI features for a while now. Like many of you, I started noticing the same painful problems every time I shipped anything that used LLMs.
The problem (from a developer’s perspective)
AI bills get out of control fast. Even if you log usage, you still can't answer:
• “Which model is burning money?”
• “Why did this prompt suddenly cost 10× more?”
• “Is this output identical