GPT-5

Show HN: An open source tool for generating macOS app icons with AI

I’m Vladimir. I’m a software engineer. In our company we often develop desktop apps for internal needs and prototyping. And every time I bump into the same problem: how do I make an icon for the macOS app I have just built?I could use the existing icon generators, but they are basically just image converters. You upload an existing image, and the tool generates the required icon sizes and formats from it.But I don’t have an image, and I’m not a designer. Asking designers to create an icon is not

Show HN: Torrix, self hosted, LLM Observability,(no Postgres, no Redis)

I work as a SAP Integration consultant and built this as a side project. Friction point: Most self hosted LLM observability tools require Postgres, Redis and non trivial infrastructure. Teams just want to see what their agents are actually doing in Production, that set up cost discorages adoption. Torrix runs as a single docker contained backed by SQLite. The full install is:curl -o docker-compose.yml https://raw.githubusercontent.com/torrix-ai/install/main/doc... d

Show HN: Tiny agentic loop with Docker sandbox

New in mi v1.7.5: &quot;--sandbox&quot; runs the agent inside Docker with your current directory mounted, to contain the blast radious in case of unexpected attacks.<p>mi Works with any OpenAI-compatible model (including local ones). Has REPL, skills, and recursive agents and only 30 lines of code, optimised for local models.<p>--- mi --sandbox -p &#x27;fix tests until all are passing&#x27; ---

How Anthropic’s Mythos has rewritten Firefox’s approach to cybersecurity

Security researchers at Mozilla say Anthropic's Mythos has unearthed a wealth of high-severity bugs in Firefox.

Anthropic launches AI agents for Wall Street's grunt work

Anthropic introduced 10 AI agents for the finance industry on Tuesday, adding to the tools already available from buzzy startups and firms themselves.

Crypto Lawyer Warns Anthropic Stock Crackdown Risks Litigation as Claude Launches on AWS

Gabriel Shapiro warns Anthropic's void stock transfer policy could trigger litigation as Claude launches on AWS.

Anthropic Eyes Massive $30 Billion Fundraise at Record Valuation

Anthropic is reportedly seeking $30 billion in new funding as soaring demand for AI tools drives aggressive expansion plans globally.

Anthropic just delivered great news for Alphabet investors

Anthropic is rumored to be circling a $200 billion deal with Google Cloud.

Anthropic in Talks to Buy Developer Tools Startup Used by OpenAI, Google

Anthropic is in advanced talks to acquire developer tools startup Stainless for at least $300 million, according to a person ...

FIS And Anthropic Signal A New Era Of AI Infrastructure In Banking

FIS announces a major partnership with Anthropic, launching a financial crimes agent that puts fintech infrastructure at the center of the agentic AI era in banking.

Anthropic pins Claude's blackmail behavior on the internet's portrayal of 'evil' AI

Last year, Anthropic's Sonnet 3.6 model displayed blackmail behavior, prompting a review of AI training data's influence on its actions.

Anthropic and OpenAI Warn Buyers: Unauthorized AI Startup Shares May Be Worthless

Both OpenAI and Anthropic declared SPV-based stock schemes invalid this week—and Anthropic named names, including Forge ...

GPT-5 Has a ‘Personality’ Problem

Loyal fans of GPT-4o pushed OpenAI into a rare reversal after GPT-5’s troubled launch. Chris Jung/NurPhoto via Getty Images Alongside GPT-5’s release, OpenAI scrapped its model selector tool and made ...

OpenAI’s GPT-5.5 vs Claude Opus 4.7: Which is better?

OpenAI released its latest model, GPT-5.5, on April 23, just a week after Anthropic introduced Claude Opus 4.7. As the two leading models from the two leading AI labs, we wanted to see how the new ...

OpenAI launches Daybreak: GPT-5.5 cybersecurity platform to find software vulnerabilities

OpenAI launches Daybreak, a GPT-5.5-powered cybersecurity platform designed to find and fix software vulnerabilities with a new approach.

OpenAI Debuts GPT-5.5 Claiming Agentic Coding and Research Gains

OpenAI has announced the release of GPT-5.5, the latest upgrade to the company's family of models powering its ChatGPT and Codex apps. OpenAI describes GPT-5.5 as better at multi-step work, claiming it can plan, use tools, and verify its own output with less hand-holding. The model is said to offer gains in agentic coding, computer use, and early-stage scientific research.

Shocking Reveal: Anthropic Cuts Claude AI Harmful Behaviour From 96% to 3% After Major Fix

Claude AI's harmful behavior is linked to internet training data, says Anthropic. New constitutional AI cuts misalignment and blackmail risks.

Why did Claude AI threaten an engineer to avoid shutdown? Anthropic has the answers

In a recent blog post, Anthropic explained the sequence of events behind Claude AI’s controversial behaviour and shared ...

Show HN: An agent that tunes its own cache

The weekend of last week I built chat.betterdb.com as a RAG over Valkey&#x2F;Redis&#x2F;Dragonfly docs. The goal was to eat our own dogfood and test publicly our caching libraries. It also saved me from having to come up with various demo&#x2F;test scenarios, as I could extend the building in public to the demo.There is a tool-result cache sitting between the SDK and tools. Each call is normalized and then checked before executing. If it hits we return from the cache, and if not, we check the se

Show HN: When the LLM Accidentally

When the LLM accidentally... outputs some high-level abstraction of &quot;thinking&quot; into it&#x27;s direct response. See text block at end.What else have you seen the LLM accidentally do?This isn&#x27;t jailbreaking on my end - just normal use - with GPT5.4 in this case, reasoning and verbosity both set to &quot;high&quot;.. (on &#x2F;completions)Point is, this block (plus lots more) is at the top of the response - then the &quot;actual output&quot; or response later on... but it&#x27;s kin