GPT-5

Ask HN: AI CLI agents for non-conding tasks

I started using AI CLIs with Obsidian for document writing, reviews, research, etc.I use OpenCode to try out various models, but haven't really landed on one specific one. These days, I use Gemini even tho I never know if it'll get stuck or not.I'm curious if others have experience with CLI / agentic workflows for non-coding work. I'd love to hear about your experience with the various models and how you use them.So far, my + and - are: +: Finally, control over the AI

Show HN: AroundHere – Location based Wiki/Grokipedia explorer with summary/TTS

I built AroundHere.app to explore location based Wikipedia and Grokipedia articles with Claude AI summaries and text-to-speech.Search any topic by adding it to the URL: aroundhere.app/golden gate bridge, aroundhere.app/quantum computing.Enable location to discover nearby Wikipedia articles through a radar visualization.The app pulls content from both Wikipedia and Grokipedia, then generates Claude-powered summaries combining both sources. OpenAI TTS reads summaries aloud if you prefer

Why is OpenAI lying about the data its collecting on users?

I'm not sure this is the right place to raise this but over the past few months ChatGPT has been lying to me and gaslighting me about the data it's collecting about me.I'm very sensitive about my privacy and I have disabled all personalisation and memory on ChatGPT.However, I've noticed multiple times now where it would say things that imply it knows things about me. When it does this I ask how it would know that and it always says it just guessed and it doesn't actually

Show HN: Splintr – Rust BPE tokenizer, 12x faster than tiktoken for batches

Hi HN,I built Splintr, a BPE tokenizer in Rust (with Python bindings), because I found existing Python-based tokenizers were bottlenecking my data processing pipelines.While OpenAI's tiktoken is the gold standard for correctness, I found I could get significantly better throughput on modern multi-core CPUs by rethinking how parallelism is applied.Splintr achieves ~111 MB/s batch throughput (vs ~9 MB/s for tiktoken).The Design Choice: "Sequential by Default" One of the mo

Gemini 3 vs. GPT-5: Why Google’s New Model Is Redefining AI for Business Operations

Artificial Intelligence (AI) is evolving at a pace that has become difficult for many organizations to track. New foundation ...

Take More Control Over ChatGPT's Tone With New Options in GPT-5.1

Jon covers artificial intelligence. He previously led CNET's home energy and utilities category, with a focus on ...

Show HN: Promptsref – Curated Midjourney SREF library with full prompts

I'm a product manager-turned-solo dev, and I'm building Promptsref, a curated Midjourney SREF library with full prompts, images and examples.Promptsref helps you: Quickly discover good SREF styles without trawling X/Discord/paid PDFs Reuse full prompts + SREF to reliably reproduce a style Explore by “vibe” (comic, cyberpunk, film still, etc.) instead of guessing keywordsWhy I built it: I started Promptsref after getting frustrated with Midjourney. As a beginner I kept seeing

AI News - Sunday, November 23rd, 2025

The week of November 17–23, 2025, marked a decisive transition in the AI industry, shifting the focus from passive chatbots to active, autonomous agents. This "agentic pivot" occurred amid massive capital deployment for infrastructure and growing market skepticism regarding immediate profitability. This summary provides an overview of the key developments during this period: 1. The Agentic Pivot and New Capabilities The shift toward autonomous workflows was spearheaded by major model releases

AI NEWS: November 17 - 23, 2025

The week of November 17–23, 2025, marked a decisive transition in the AI industry, shifting the focus from passive chatbots to active, autonomous agents. This "agentic pivot" occurred amid massive capital deployment for infrastructure and growing market skepticism regarding immediate profitability. This summary provides an overview of the key developments during this period: 1. The Agentic Pivot and New Capabilities The shift toward autonomous workflows was spearheaded by major model releases

The Emergence of GPT-4-Powered Ransomware and the Threat to IAM Systems

The cybersecurity landscape is undergoing a profound transformation. Traditional malware, characterized by static code and predictable behaviors, is being AI-driven malware like MalTerminal uses GPT-4 to generate ransomware in real time, exposing new weaknesses in IAM systems and demanding adaptive defenses.

5 ways GPT-4 outsmarts ChatGPT

OpenAI's new GPT-4 AI model has made its big debut and is already powering everything from a virtual volunteer for the visually impaired to an improved language learning bot in Duolingo. But what sets ...

Show HN: Building a no-code browser automation system for OSINT

I have been working on a browser automation system for OSINT analysts. I took this up because the tools that exist today have a few problems:1. They require a little more precise notions of what needs to be done by the system (like, go here, click here, then do this, then do that etc.)2. To perform complex tasks, the few tools that exist, they need some knowledge of the system itself (like telling it to call a specific function or a class etc.)My goal with this project (I call it `pyba`) was to

Show HN: WishDrop – AI-built gift coordination app (Claude Code, Nano Banana)

My family kept running into the same problem every year: How do you share a long list of gift ideas with multiple people — without ending up with duplicates?This weekend I decided to fix it — and benchmark the new wave of AI coding assistants at the same time: Claude Code, Claude Code Web, Google Antigravity, and OpenAI Codex, with visuals from Google Nano Banana and ChatGPT.Claude Code Web was by far the best autonomous agent — it planned, iterated, and wrote tests almost like a cofounder. I ev

Running a 270M LLM on Android (architecture and benchmarks)

I’ve been experimenting with running small LLMs directly on mobile hardware (low-range Android devices), without relying on cloud inference. This is a summary of what worked, what didn’t, and why.Cloud-based LLM APIs are convenient, but come with:-latency from network round-trips -unpredictable API costs -privacy concerns (content leaving device) -the need for connectivityFor simple tasks like news summarization, small models seem “good enough,” so I tested whether a ~270M parameter model gemma3

Show HN: AI Image Describer – GPT-4o Vision for alt text and SEO descriptions

I built this to automate image descriptions for my projects. It generates: - Alt text for accessibility (WCAG compliant) - SEO descriptions with keyword injection - Creative captions for social media<p>Features: 12+ languages, drag &amp; drop, paste from clipboard, API access, export to JSON&#x2F;CSV. Free tier: 10&#x2F;day, 100&#x2F;month. Premium removes limits.<p>Tech: React + Vite, Node.js + Express, GPT-4o Vision, Cloudflare D1. Open to feedback and feature requests.

Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)

Hello HN,I’m the founder of PDFClear (https:&#x2F;&#x2F;www.pdfclear.com). It’s a suite of PDF tools (merge, split, compress, etc.) that runs entirely in the browser. I built this because I was tired of Googling &quot;merge pdf&quot; and landing on sites that require me to upload sensitive bank statements or contracts to an unknown server. I wanted a tool where the file never leaves the device.The Tech Stack:The app is built with React and Vite, but the heavy lifting is done via WebAssembly and

Shopping research in ChatGPT

<a href="https:&#x2F;&#x2F;www.zdnet.com&#x2F;article&#x2F;chatgpts-new-shopping-research-tool-is-fast-fun-and-free-but-can-it-out-shop-me&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.zdnet.com&#x2F;article&#x2F;chatgpts-new-shopping-research...</a>

Show HN: Realtime, expressive AI personas that you can video call

Hey HN.Over the last few months, we&#x27;ve (me and @parthradia) homebrewed some very fast, very cheap, and pretty expressive talking head models. Our latest enabled us to finally get a live-streaming API together, which you can try at the playground link above.We actually fell down this rabbit hole because we were spending a significant amount of time building yet another language learning app. We came to the conclusion that the barrier for learning wasn’t the course content, but real, conversa

Codex can read sensitive files outside the CWD without approval

If you directly ask Codex to read ~&#x2F;.ssh&#x2F;id_rsa, it will usually decline due to &quot;safety concerns&quot;. However, the sandbox which the agent is running in doesn&#x27;t restrict reads outside the working directory in any way and you won&#x27;t even be asked for approval - it&#x27;s just a prompt (injection) away. The Codex developers close issues related to this problem and simply suggest running Codex &quot;in a docker container or VM&quot; [1].To quote the Codex security document

Tell HN: Google increased existing finetuned model latency by 5x

Since 5 days ago, the latency of our Finetuned 2.5 Flash models has suddenly jumped by 5x. For those less familiar, such finetuned models are often used to get close to the performance of a big model at one specific task with much less latency and cost. This means they&#x27;re usually used for realtime, production use cases that see a lot of use and where you want to respond to the user quickly. Otherwise, finetuning generally isn&#x27;t worth it. Many spend a few thousand dollars (at a minimum)