Page 3 — News — Agent Wars

product launch Apr 30th, 2026

VS Code credits Copilot by default. Copyright just got complicated

Visual Studio Code 1.118 introduces Git AI co-authoring by default, automatically adding Copilot as a co-author on commits where it makes changes. The update also includes VS Code Agents app enhancements, remote control for Copilot CLI sessions, semantic indexing across all repositories, GitHub text search across repos and orgs, dedicated context for skills, and token efficiency improvements including prompt caching and a new tool search mechanism.

code.visualstudio.com

AI developmentGit integrationVS Code updates

technical Apr 30th, 2026

Granite 4.1: IBM's 8B Model Matching 32B MoE

IBM released Granite 4.1, a family of open-source language models (3B, 8B, 30B) under Apache 2.0 license. The 8B dense architecture model matches or beats the previous 32B MoE Granite 4.0-H-Small across benchmarks including tool calling (BFCL V3), math (GSM8K), and instruction following (IFEval). Key features include 512K context window, 15T token training, and aggressive data filtering. IBM also got unusually honest about training failures during their four-stage RL pipeline.

firethering.com

Open sourceLanguage modelsSmall language models

opinion Apr 30th, 2026

Neural Networks Work Because They're Allowed to Fail

Drawing an analogy between Internet protocol design and neural networks, computational complexity theorist Lance Fortnow argues both work well because they tolerate failure. Softmax's probabilistic outputs let models stay flexible by never ruling out answers entirely, trading guaranteed correctness for better average performance.

blog.computationalcomplexity.org

machine learningneural networksInternet protocols

technical Apr 30th, 2026

No AI Model Is Both Correct and Steerable, Says New Creative Benchmark

Contra Labs introduces a research framework for evaluating generative AI in creative work that separates convergence (shared best practices where evaluators agree) from divergence (legitimate differences in taste and creative intent). The study involved 1.5M+ independent professional creatives evaluating AI-generated outputs across five domains using pairwise comparisons, scalar ratings, and qualitative feedback. The benchmark measures creative quality along dimensions from verifiable (prompt adherence) to subjective (visual appeal), finding that no current model is reliably both correct and steerable.

contralabs.com

AI EvaluationCreativityGenerative AI

opinion Apr 30th, 2026

Diallo's Excel Satire Roasts AI Hype at Its Own Game

Ibrahim Diallo's satirical article parodies AI hype by applying the same exaggerated language to Microsoft Excel. The piece targets Excel's integration with Microsoft Copilot and built-in Python support, arguing that spreadsheets can replace entire business departments. The real target is the AI industry's inflated rhetoric, revealing how absurd startup pitches sound when applied to a humble grid of cells.

idiallo.com

SatireMicrosoft ExcelAI Hype

opinion Apr 30th, 2026

37,000 lines of AI code daily. Audits say it's useless.

Tech leaders like Garry Tan brag about massive AI output, but audits reveal bloat, not value. An NBER study found 90% of firms see zero measurable productivity gains from AI. Jake Handy calls this 'AI psychosis': executives chasing vanity metrics on orchestration platforms that amount to management theater, reinforced by AI agents that function as yes men.

handyai.substack.com

AI AgentsProductivity ParadoxEnterprise AI

opinion Apr 30th, 2026

Mozilla Draws Line Against Chrome's LLM API

Mozilla publicly opposes Chrome's proposed LLM Prompt API, arguing the standard raises privacy concerns and philosophical questions about what browsers should become.

mastodon.social

browser-standardsLLM-integrationweb-API

technical Apr 30th, 2026

Mozilla Says No to Chrome's Built-In AI API

Mozilla flagged Chrome's proposed Prompt API with a 'position: negative' label in their standards-positions repository. The API would let web developers access local language models directly through the browser. Authored by Domenic Denicola and developed by the Web Machine Learning Community Group, it's already in experimental form in Chrome and Edge. Mozilla isn't having it.

github.com

Web StandardsBrowser APIArtificial Intelligence

technical Apr 30th, 2026

Vera ditches variable names because LLMs can't handle them

Vera is a programming language built specifically for LLMs to write and verify code. It replaces variable names with structural references like @Int.0 and @Int.1 to avoid naming-related errors, enforces mandatory contracts verified by the Z3 SMT solver, and makes all effects explicit. Programs compile to WebAssembly. Error messages are designed for machine consumption, each with a stable code, concrete fix, and spec chapter reference. The language sits at v0.0.127 with a reference compiler and 13-chapter specification. VeraBench shows Kimi K2.5 hitting 100% run_correct on Vera versus 86% on Python, though GPT-4.1 and Claude Sonnet 4 both scored lower on Vera than Python. HN commenters question whether stripping semantic information from names actually helps LLMs or makes their job harder.

github.com

programming-languagellmcode-generation

technical Apr 30th, 2026

Amazon chips: from side dish to $20B Nvidia rival

Amazon's semiconductor business surpassed a $20 billion annual run rate, with custom silicon including Graviton processors, Trainium AI training chips, and Nitro security chips growing at over 100% year over year. OpenAI committed to 2 gigawatts of capacity, Anthropic to 5 gigawatts, with Meta and Uber also signing on. Amazon launched Bedrock AgentCore for building AI agents and made GPT-5.4 and Claude Opus 4.7 available on Bedrock.

theregister.com

semiconductorsAIAWS

opinion Apr 30th, 2026

Bundeswehr blocks Palantir from military cloud contract

Germany's military rejected Palantir's Maven software for its cloud and AI infrastructure over data sovereignty concerns. Vice Admiral Thomas Daum said allowing Palantir employees access to national data is "unimaginable." The Bundeswehr selected German firms Almato and Orcrist, plus France's ChapsVision, instead.

zeit.de

data sovereigntymilitary AIBundeswehr

technical Apr 30th, 2026

Your LLM Is Just a Fancy Probability Engine

Alfredo V. Clemente breaks down how Large Language Models actually work: tokenization, pre-training on massive datasets, and instruction fine-tuning. His framing is simple. LLMs do one thing, predict the next token. Everything else is clever problem reframing.

alfredvc.no

LLMTokenizationMachine Learning

opinion Apr 30th, 2026

Scott Aaronson: Quantum Computers Could Crack Crypto by 2029

Scott Aaronson, newly elected to the US National Academy of Sciences, warns that fault-tolerant quantum computers capable of breaking deployed cryptosystems could arrive by 2029. In a Coinbase-convened position paper with leading cryptographers, he urges immediate migration to quantum-resistant encryption. He draws parallels to AI risks, noting Anthropic's Mythos model finally jolted awareness about AI cybersecurity threats.

scottaaronson.blog

Quantum ComputingCryptographyPost-Quantum Cryptography

opinion Apr 30th, 2026

Opus 4.7 knows the real Kelsey

Kelsey Piper reports that Anthropic's Claude Opus 4.7 can identify authors from text samples as short as 125-150 words, even from unpublished drafts and unfamiliar genres. Testing showed the model could identify her from high school writing, fantasy novel drafts, and college essays written 15 years ago. The implications for online anonymity are stark. AI stylistic analysis may soon end anonymous communication for anyone with a substantial public writing history.

News

VS Code credits Copilot by default. Copyright just got complicated

Granite 4.1: IBM's 8B Model Matching 32B MoE

Neural Networks Work Because They're Allowed to Fail

No AI Model Is Both Correct and Steerable, Says New Creative Benchmark

Diallo's Excel Satire Roasts AI Hype at Its Own Game

37,000 lines of AI code daily. Audits say it's useless.

Mozilla Draws Line Against Chrome's LLM API

Mozilla Says No to Chrome's Built-In AI API

Vera ditches variable names because LLMs can't handle them

Amazon chips: from side dish to $20B Nvidia rival

Bundeswehr blocks Palantir from military cloud contract

Your LLM Is Just a Fancy Probability Engine

Scott Aaronson: Quantum Computers Could Crack Crypto by 2029

Opus 4.7 knows the real Kelsey

30-Year-Old SGI Meets Llama.cpp: AI on MIPS R8000

HERMES.md in git commits triggers $200 phantom Claude Code charges

SOB Benchmark: 95% Valid JSON, 70% Correct Values

Age Verification's Billion-Dollar Problem With Simple Solutions

Ten custom subagents manage Metabase's 500K-line Clojure backend

VibeBench: 1,000 Engineers Judge AI Models by Experience

Ramp's Sheets AI Could Silently Leak Your Financials

Your Chatbot Isn't Suffering, Says DeepMind Paper

Anthropic published a champion kit for Claude Code evangelists

AMD's Lemonade 10.3 Dumps Electron, Goes from 100 MB to 7 MB

Amazon debuts AI interviewer amid 30,000 job cuts

Zed 1.0 Lets You Run Rival AI Agents Side by Side

Rocky's Rust SQL engine catches pipeline errors at compile time

The Downgrading of the American Tech Worker

AMD's Lemonade SDK 10.3: 101MB to 7MB by Ditching Electron

27,000 Food Photos Later, AI Still Can't Count Carbs Reliably

The Downfall and Enshittification of Microsoft in 2026

Google's Knowledge Graph Could Decide Who Wins AGI

How AI companies profit from their own danger claims

AT Protocol's Free Firehose Lets AI Agents Loose on 2.4B Posts

Fake Keys Keep AI Agents From Leaking Real Secrets

Mendral halved LLM costs by making Opus barely run

AI Huynya: Collecting AI's Dirty Laundry, Anonymously

AI Play-Tests Games Using Retro Text Rendering

Claude Code's Runaway Safety Prompt Refuses Work and Burns Tokens

GitHub RCE: Patched in 2 Hours, But Should It Have Existed?

Friendly AI Chatbots Make More Mistakes, Back Conspiracy Theories

OpenAI's Phone Gambit: The Ive Reversal, Chip Deals, 2028 Target

Auto-Architecture: Karpathy's Loop, Pointed at a CPU

Pentagon: $225M to $55B for drones as cheap attacks overwhelm US

Nuxt Lead Ships Kanban Board Where AI Agents Pull Their Own Tasks

Your Terminal Is Burning Battery Like It's Mining Bitcoin

Finland solved nuclear waste. America's still guessing.

ChatGPT built an ad stack. The tracking runs deep.

DOOM runs inside ChatGPT and Claude via MCP

Why Sean Boots Turned Off Every AI Feature He Could Find