Prompt InsightsOpen Prompt Builder

DISPATCH // AI NEWS

Latest AI News

Signal over noise. Concise, curated news on AI models, tools, and prompt engineering for people who ship.

ModelsJun 20, 20262 min read

GLM-5.2 Tops Open-Weights Agentic Benchmarks as Anthropic Pulls Agent SDK Billing

Z.ai's GLM-5.2, a 753B parameter MoE model released under MIT, is topping new agentic benchmarks, while Anthropic quietly pauses token-based billing for its Claude Agent SDK. Two signals that together reveal where the open vs. closed frontier is moving.

Prompt EngineeringJun 20, 20263 min read

AI Killed the Economics of Code Production. Now Engineering Discipline Is the Scarce Resource.

When code became free and instant, the bottleneck shifted from writing to judgment. Here is what that means for teams shipping LLM features today.

AgentsJun 20, 20263 min read

Anthropic Pauses Agent SDK Token Billing as Cost Pressure Reshapes the Agent Stack

Anthropic reversed a billing change that would have hit heavy Claude Agent SDK users hard, just as it was set to take effect. The reversal is a signal that token economics are becoming the central battleground for agent infrastructure.

ModelsJun 17, 20262 min read

GLM-5.2 Becomes the New Open-Weights Leader, Beating GPT-5.5 on Agentic Knowledge Work

Z.ai's GLM-5.2, a 753B-parameter MoE under the MIT license, is now the top open-weights model on the Artificial Analysis Intelligence Index and scored above GPT-5.5 on the new AA-Briefcase agentic eval. The frontier-grade option you can self-host just shifted.

AgentsJun 15, 20262 min read

Anthropic Pauses Token-Based Billing for the Claude Agent SDK

On the day a token-based billing change for the Claude Agent SDK was due to take effect, Anthropic paused it. Here is what shifts for teams building agent runtimes, and how to plan around the uncertainty.

ModelsJun 12, 20262 min read

Gemma 4 12B Drops the Vision Encoder for Simpler Multimodal Deployment

Google DeepMind's Gemma 4 12B fuses vision and language into one encoder-free architecture, cutting deployment complexity for self-hosted and edge inference.

Prompt EngineeringJun 8, 20264 min read

Brevity Is a Token Budget: Why Concise Prompts Win in Agentic Systems

In agentic loops, the same context gets re-sent on every turn, so wordy prompts compound into real cost. Practitioners are reporting 60% token cuts by treating brevity as an engineering discipline, not a style preference.