GPT-5
Show HN: From Claude Code to OpenCode – My Evolution in Vibe AI Engineering
I’ve spent the last few months iterating on my "Vibe Coding" workflow, moving away from closed-box solutions toward a more transparent, multi-provider stack. I documented the transition from Codex and Claude Code to an open-source setup using OpenCode and opencode serve.Cursor -> Claude Code -> OpenCode -> OpenCode + OpenCode-Manager -> Codex + Tmux + Tailscale -> OpenCode Serve + Tailscale.Press enter or click to view image in full sizeKey takeaways from the journey:The
Show HN: Synthea Fhir Data in BigQuery
We generated ~1,100 synthetic patients with Synthea, processed the FHIR R4 output through our normalization engine (Forge), and published it as a free public dataset on BigQuery Analytics Hub.8 resource types: Patient, Encounter, Observation, Condition, Procedure, Immunization, MedicationRequest, DiagnosticReport.The raw Synthea output has 459 nested fields per resource, urn:uuid: references, and no column descriptions. We flatten it to clean views with ~15 columns each, pre-extracted IDs, and d
Show HN: ARISE – Agents that create their own tools at runtime when they fail
I built a framework that lets LLM agents create their own tools at runtime. Most agent frameworks assume you'll hand-craft every tool upfront. That works until your agent hits something you didn't plan for. ARISE (Adaptive Runtime Improvement through Self-Evolution) lets agents synthesize their own tools at runtime when they detect gapsARISE sits between your agent and its tool library. When the agent keeps failing at a class of tasks, it analyzes what's missing, uses a cheap LLM
AI coding agents accidentally introduced vulnerable dependencies
Recently we discovered something unexpected on one of our servers: a cryptominer running in the background.The machine was hosting a web service built using Next.js. The first sign of trouble was unusually high CPU usage. Even during low traffic periods, the server was consistently running near 100% utilization. After inspecting running processes and network activity, we found a background process downloading and executing a mining binary.ROOT CAUSEThe entry point was CVE-2025-29927, a vulnerabi
OpenAI Launches GPT-5.3 Instant for ChatGPT: Check Features, Accuracy Boost, and Availability
OpenAI launches GPT-5.3 Instant for ChatGPT with improved accuracy, lower hallucination rates, better tone, and full availability. Check features and retirement timeline for GPT-5.2.
OpenAI's GPT 5.3 Codex Drives Harness Engineering Need
In an era where AI writes all the code, the task humans must excel at is ‘harness engineering,’” OpenAI wrote in a blog post ...
Show HN: Context Gateway – Compress agent context before it hits the LLM
We built an open-source proxy that sits between coding agents (Claude Code, OpenClaw, etc.) and the LLM, compressing tool outputs before they enter the context window.Demo: https://www.youtube.com/watch?v=-vFZ6MPrwjw#t=9s.Motivation: Agents are terrible at managing context. A single file read or grep can dump thousands of tokens into the window, most of it noise. This isn't just expensive — it actively degrades quality. Long-context benchmarks consistently show steep accuracy
Show HN: UberSKILLS – Open-source Workbench for building AI agent SKILLS
Agent Skills (SKILL.md files) are reusable instruction sets that teach code agents like Claude Code, GitHub Copilot, Cursor, and Windsurf how to perform specific tasks. Right now, creating them is entirely manual - you hand-write YAML frontmatter and markdown, with no way to preview, validate, or test before deploying.uberSKILLS is an open-source web app that gives you an integrated authoring environment for Agent Skills:- AI-assisted creation - describe what you want in plain English, get a com
Show HN: Built an AI ad generator and ran $9K of FB ads with it
Been in the AI image gen space since 2023, before even GPT image gen was a thing, and after spending ~9K on Facebook ads for my own projects (made a video about that actually) I realized the thing I kept getting stuck on was the creatives themselves. I'm horrible at making reels-type video ads and Canva even with templates is surprisingly complicated for ad-specific stuff, plus everything ends up looking the same as everyone else using the same templates.Made a video going into detail about
Ek_ Leaks Persist
Vaults and proxy layers solve the "2am paste" vector — devs never touch raw keys, so nothing gets accidentally fed into prompts.But the leak keeps happening anyway.Across 60+ probes on GPT-4o (cost: $0.04), unrelated vectors consistently leaked the *same internal structure*:- ek_ prefix on session tokens
- EPHEMERAL_KEY naming
- Realtime API client_secret endpoint
- Documented 60s TTL vs observed minutes-to-hours persistenceNo real credential was in the prompt — just semantic pressure
Show HN: RunCycles – pre-execution budget enforcement for autonomous agents
I built this after reading too many incident reports of agent loops spending
$200 in 4 minutes because a quality threshold was never met.The pattern is always the same: an agent retries, fans out, or loops. Each
iteration passes individual rate-limit checks. Observability fires an alert
after the money is gone. Provider caps are per-provider, not cross-provider.
None of these stop the spend before it happens.RunCycles takes a different approach: reserve budget before the call, commit
actual
Show HN: 1,011 AI crawler requests. Google Analytics saw zero
Google Analytics can't see GPTBot or ClaudeBot. Here's how I built a server-side tracker that can — and what I found in 72 hours.
Show HN: Plaidify – Give AI agents access to any login-protected website
Every AI agent hits the same wall: the world's most valuable data is locked behind login forms. Bank balances, utility bills, insurance policies, academic transcripts — none of them have APIs. Plaid covers banks for $500+/mo. Everything else? You write fragile Selenium scripts.Plaidify is open-source infrastructure that turns any login-protected website into a REST API. You drop a JSON "blueprint" into the connectors folder — CSS selectors for username, password, submit, and
OpenAI releases a Windows version of Codex coding app
Around one month after launching Codex for Mac, OpenAI brings Codex to Windows with a new suite of IDEs supported.
OpenAI launches Codex app to bring its coding models, which were used to build viral OpenClaw, to more users
OpenAI is launching a Codex app, a desktop experience for managing the company’s AI coding tools, as competitive pressure to deploy autonomous AI agents heats up. The desktop app is designed to be ...
OpenAI sees Codex users spike to 1 million, positions coding tool as gateway to AI agents for business
Controversy over OpenAI's agreement to provide AI to the Pentagon has swamped news about Codex's rapid adoption ...
OpenAI rolls out Codex Security to automate code security reviews
OpenAI is rolling out Codex Security, an AI-powered application security agent that finds, validates and proposes fixes for ...
OpenAI’s Codex security: An AI agent that hunts down vulnerabilities
The post OpenAI’s Codex Security: An AI Agent That Hunts Down Vulnerabilities appeared first on Android Headlines.
Everyone is focusing on AI, we're focusing on humans
Well, as the title says, it seems that everyone is looking to build something AI related. I guess it’s the AI bubble. While AI is great in many industries, in language learning and exchange we believe humans are the core. Especially when wanting to achieve fluency. Talking to natives cannot be replaced by anything unless there is a very sophisticated AI that looks entirely like a human, acts like one, has the culture, and everything comes into place, which I believe is still too far away.All I s
Show HN: AI coding agent for VS Code with pay-as-you-go pricing- no subscription
I built LLM OneStop Code—an AI coding agent for VS Code that works like Claude Code or Cursor, but with one key difference: pure pay-as-you-go pricing. No monthly subscription required.The problem with existing tools:
- Cursor: $20/month for Pro (even if you barely use it)
- GitHub Copilot: $10/month minimum
- Claude Code: Rate-limited by API usage tier + monthly capsLLM OneStop Code charges only for what you use—billed in credits at cost + 5%. If you code 2 hours this month and 40 the