Claude Opus 4.8 vs GPT-5.5: What's Anthropic AI's new Ultracode mode, pricing, honesty claims and jailbreak debate
Anthropic has launched Claude Opus 4.8, a new AI model. It offers better coding and reasoning abilities. Users can now ...
Anthropic has launched Claude Opus 4.8, a new AI model. It offers better coding and reasoning abilities. Users can now ...
OpenAI Lost It's Trademark Rights to GPT at USPTO and TTAB. Then, Through General Counsel Che Chang and Pirkey Barber Attorney Christopher M. Weimer, Challenged FreedomGPT's Granted Trademark Application Anyway. AUSTIN, TX / ACCESS Newswire / June ...
I have been interested in long-horizon coding tasks for a while, especially with benchmarks like FrontierSWE, where even the best coding agents like Codex and Claude Code struggle to complete tasks.These agents come with a collection of tools like bash, file edits, grep, glob, etc.Lazarus takes a different approach. The idea is to give the model exactly one tool: a persistent Python runtime.Model writes Python code, executes it, and receives stdout/stderr. Through Python it inspects repos,
I have been working on multiple projects lately involving AI endpoints (including some I run locally) and I found I needed a way to easily load balance across multiple. Sometimes my on-prem would not be able to handle to load and Id have to crank up the z.ai usage or Anthropic depending on where my credits were at the time.One thing led to another and I ended up writing Busbar: An LLM gateway, written in Rust (I have a thing for Rust lately). You point your existing OpenAI/Anthropic/Ge
Making sense of what you get from Claude AI at the free tier, especially as limits are vague and keep shifting.
Anthropic says its Claude AI is no longer just helping humans write code. It is increasingly helping build the next ...
EvidenceLoop demo for the SANS FIND EVIL! hackathon. Built with OpenAI Codex GPT-5.5. Demonstrates local-first PCAP triage, live LLM-assisted analysis, validation, follow-up gaps, and traceable artifacts.
We built this library because agent harnesses were too fragmented and we needed a simple abstraction to call multiple coding-agent SDKs.lite-harness has one function - query()import { query } from "@lite-harness/sdk";for await (const message of query({ prompt: "Fix the failing test", options: { // swap harness between: "claude-agent", "openai-agents", "pi-ai" harness: "openai-agents", model: "gpt-5.5&
Hey HN, we’re Shalin & Kanyes, best friends who've been hacking together for 10+yrs, and now founders of Hyper (https://heyhyper.ai/). Hyper is a shared “company brain” that plugs into information flowing inside a company to make AI agents and automations better and ultimately save people time.Models have gotten good enough that they can (mostly) take on long-horizon, complex tasks. We believe the bottleneck now is that these smart-enough models often lack information abo
I've been coding since I was little, and practicing yoga since I was 25. Both are fun to do and to share.Like many of you reading this, I've been experimenting with AI agents. Unlike many of you, I'm a yoga teacher. I got to thinking... could my agents behave better if they did yoga? It works for me, and LLMs are trained on human generated content...So I developed a program for agents to get the benefits us humans get from the physical practice of yoga. Improved focus, memory
I like the work Anthropic doing on claude code and the ecosystem. I am developer but not holiding any PhD or wrote any paper in AI/ML. How to get into Anthropic and work on the stuff they are building. What skills I need to build?
I'm trying to reach someone at OpenAI regarding what appears to be a bug in their phone verification flow.The issue is not related to my normal account login or 2FA. I can successfully log in to ChatGPT, and my authenticator-based 2FA works as expected.The problem occurs during OpenAI's phone verification flow (which is seemingly now required to use codex) at https://auth.openai.com/phone-verificationFor phone verification, I'm required to receive a verification cod
Hi HN,Circus Chief is a tool for managing coding agent sessions from a browser. It's specifically optimized for small screens. It supports Claude Code, OpenAI Codex, and Google Gemini CLI agents.FeaturesAgents can operate Circus Chief itself. Agents can spawn sessions, schedule sessions, interact with the Kanban board — anything you can do in the UI, an agent can also do.Schedule work ahead of time.Automatically reschedule when you hit usage limits.Configurable, chainable prompt templates.U
Got this email overnight from OpenAI:We’re updating our Privacy Policy to include information about ads in ChatGPT, including how they work and how you can control your experience.Ads may appear on Free and Go plans. Plus, Pro, Enterprise, Business and Education plans do not have ads. Ads do not influence the answers ChatGPT gives you. When you see an ad, they are always clearly labeled as sponsored and visually separated from the organic answer. We’ll ask you to make a choice about personalised
Hi HN,I built Aura-IDE, a native desktop LLM coding harness for AI assisted software engineering.The idea is not just “chat with your codebase.” Aura wraps models in a structured engineering loop:repo awareness → Planner spec → Worker execution → surgical edits → validation → recovery → final receiptThe Planner reads the project and writes a spec. The Worker executes that spec with filesystem tools, diff approval, terminal validation, and recovery behavior. The goal is to make ordinary models pr
Not sure if this is happening to others, but sometimes when I paste an image, I get an answer completely different from what I attached. It also sometimes responds in a different language. I'm sure it's not my image, since I ask it to explain what it sees and it describes something completely different. What's going on?
The incident highlights how attackers can hide malicious code in software packages that differ from the source code available for review.
OpenAI turns Codex into an enterprise platform with hosted web apps, 62 business app plugins, and 110 skills. Non-developers are 20% of 5M weekly users, growing 3x faster.
OpenAI has selected ZoomInfo to be natively available inside OpenAI Codex for Work as a set of go-to-market skills, so sellers, SDRs, and RevOps can r ...
OpenAI ( OPENAI) has introduced role-specific plugins, including for finance and marketing, for Codex. The company introduced ...