TechDig — Tuesday, June 9, 2026

Today's issue

Got ten seconds? The badger read all of it.

Today's lead Big Tech

Apple gave up building Siri's brain and rented Google's

Apple stops trying to build the brain and licenses Google's

Apple spent two years promising a smarter Siri and not delivering. On Monday it admitted, in effect, that it couldn't build the engine itself: the new "Siri AI" runs on Google's Gemini in the cloud, dressed up in Apple's privacy wrapper, with Apple's own smaller models handling what stays on your phone.

The genuinely useful change for you: you can now pick your assistant. iOS 27 lets you set ChatGPT, Claude, or Gemini as the brain behind Siri from Settings, instead of being stuck with whatever Apple ships. The new Siri can also see what's on your screen and pull from your messages, mail, and photos to actually do things across apps, rather than just answering trivia.

It arrives "this fall" with no exact date, which is part of why Apple's stock dipped during the show. Europe won't get it on iPhone yet because of EU rules, and China is on hold. This was also Tim Cook's last big keynote before he hands the CEO job to Apple's hardware chief in September.

Apple rebuilt Siri from the studs and announced it at WWDC on Monday as "Siri AI." The architecture is the headline: on-device requests and Private Cloud Compute run Apple's own Foundation Models, but the cloud-scale reasoning is a custom build of Google's Gemini. After two years of missed Siri promises, Apple's answer to the frontier-model gap is to rent one and wrap it in their privacy stack rather than train their own.

Two things matter for builders beyond the rebuild:

The assistant default is now pluggable. iOS 27 Extensions lets a user route Siri, Writing Tools, and Image Playground to ChatGPT, Claude, Gemini, or Grok from a Settings toggle. The single-provider ChatGPT tie-in is gone; competing models are first-class citizens at the OS layer on the most widely deployed mobile platform in enterprise.
Siri AI can read screen context and act across apps, pulling from messages, mail, and photos, with conversation history synced through a standalone app.

The rest of the keynote: macOS 27 "Golden Gate" drops Intel Macs, a homeOS developer preview points at a HomePad hub, and this was Tim Cook's final WWDC keynote before John Ternus takes the CEO role on September 1. Developer betas are live now; general availability is "fall 2026," with no firmer date, which is part of why the stock slid through the keynote. Siri AI is blocked on iOS 27 and iPadOS 27 in the EU under the DMA with no timeline, and held in China pending regulatory clearance.

The deal value (reported around $1B/year) and the Gemini model size are press figures, not in Apple's releases.

Read the sourcesapple.com ↗apple.com ↗apple.com ↗

The Money

OpenAI takes the first step toward going public

OpenAI files a confidential S-1, a week behind Anthropic

OpenAI told the government it wants to keep the option of selling shares to the public, filing the early paperwork (a confidential draft) and announcing it itself before the news could leak. It's the first real move toward an IPO from the company behind ChatGPT, and it comes about a week after its rival Anthropic did the same thing.

OpenAI was careful to say it hasn't decided when, or even whether, to go through with it — some things, it noted, are easier when you're private. What's solid: the company was last valued around $852 billion, ChatGPT has more than 900 million weekly users, and it's still losing money. The flashier claims — a trillion-dollar price tag, a September debut, big-name banks lined up — are all reporting, not anything OpenAI has confirmed.

The real significance is simple. Going public eventually means showing your books. The two companies leading the AI race are heading toward a moment where they have to prove the numbers, not just the hype.

OpenAI announced it submitted a confidential draft S-1 to the SEC, and did the announcing itself to get ahead of the leak: "We expect it to leak, so we're just announcing it." It's the first formal public-markets step the company has taken, and it lands about a week after Anthropic filed its own confidential paperwork. Two frontier labs are now in the IPO pipeline at once.

OpenAI was deliberately noncommittal on timing, saying some things are "easier as a private company" and that it hasn't decided when, or whether, to actually list. What's confirmed is the filing and the context around it:

Last private valuation ~$852B, set in a funding round that closed March 31; cumulative private funding north of $170B.
ChatGPT at 900M+ weekly actives, roughly $2B/month in revenue, still unprofitable.
A May jury ruling gutted the core of Musk's suit against the company, clearing a legal overhang right before the filing.

The louder numbers floating around — a $1T target, a September debut, Goldman and Morgan Stanley as leads — are reporting, not anything OpenAI has said. The S-1 itself is confidential, so none of the financials are public yet. The point isn't the rumored price; it's that the two companies setting the pace in AI are about to be forced to show real numbers to public investors.

Read the sourceopenai.com ↗

Security

The makers of Claude built an AI that breaks into software in an hour

Anthropic ships a model that writes working exploits in an hour

Anthropic ran an experiment and shared the results: they gave an AI freshly announced software bugs — the kind that have a fix available but that not everyone has installed yet — and asked it to build a working break-in. It did. On a batch of Windows flaws it produced its first working attack in 31 minutes and finished eight complete ones for about $2,000 each, faster than the patches could reach people's machines.

Here's why that's a big deal. Security teams have always counted on a cushion: after a bug is announced, it normally takes skilled humans days or weeks to turn it into a real weapon, which is the window everyone uses to patch. This shrinks that cushion to hours.

Anthropic's answer was to split the model in two: a public version (Claude Fable 5) with guardrails that refuse most hacking requests, and a locked-down version (Mythos 5) given only to vetted defenders. The numbers are Anthropic's own, on a small set of bugs, so treat them as a strong signal rather than the final word.

Anthropic published N-day research and shipped a model the same week, and together they're the most concrete read yet on where offensive AI sits. The research: a Mythos-class model was handed disclosed-but-unpatched vulnerabilities and asked to weaponize them. On 21 Windows local-privilege-escalation bugs it produced proof-of-concept crashes for 18 and built 8 full exploit chains that older models built zero of — first PoC in 31 minutes, all eight chains finished before the patches reached enrolled machines, at roughly $2,000 per working exploit. On a Firefox set it wrote 8 working code-execution exploits, first one in under an hour.

The number that should reset assumptions: defenders budget days-to-weeks between a CVE going public and a working exploit existing. This compresses it to hours, for anyone with API access to a comparable model.

Anthropic's response is a two-tier release:

Tier	Who gets it	Posture
Claude Fable 5	General availability (API, Bedrock, Vertex, Foundry)	Safeguarded; offensive-cyber prompts routed to a more restricted model, triggered in <5% of sessions
Claude Mythos 5	Vetted Project Glasswing defenders	Unconstrained on cyber

All the benchmark figures are Anthropic's own red-team numbers, not peer-reviewed, and the vulnerability sets are small and drawn from a narrow window. Anthropic also notes the capability emerged from general training, not targeted work — which cuts both ways on how reproducible the ceiling is.

Read the sourcesred.anthropic.com ↗anthropic.com ↗

Read the sourcesred.anthropic.com ↗anthropic.com ↗red.anthropic.com ↗

The Money

SpaceX is about to be the biggest stock-market debut in history

SpaceX prices the largest IPO ever, and the books show xAI bleeding

SpaceX set its share price for going public and the math is staggering: about $1.75 trillion in value, roughly $75 billion raised — the largest IPO ever. It starts trading this week as SPCX.

The paperwork also opened the books on Musk's AI company, xAI (maker of Grok), which SpaceX swallowed earlier this year. And the two halves of the business look very different. The rockets-and-internet side — Starlink — makes real money, over $4 billion in profit last year. The AI side lost $6.4 billion on a fraction of that in sales, and it's spending faster every quarter.

So buying in means betting that the cash-printing space business can carry an AI business that's bleeding heavily, at a price one research firm thinks is about double what the company is actually worth. Musk keeps voting control no matter what. Worth remembering: these figures all come from SpaceX's own filing.

SpaceX set a fixed price of $135/share on its roadshow, a market cap near $1.77 trillion, raising about $75 billion — more than triple Alibaba's 2014 record, the largest IPO ever attempted. It trades on Nasdaq as SPCX. The filing is also the first public look at xAI's books, folded in after SpaceX absorbed it in an all-stock deal that valued xAI at $250B.

The split inside the numbers is the story:

Starlink carries it — $11.4B revenue in 2025, a $4.4B operating profit.
The AI segment (Grok/X) posted $3.2B revenue against a $6.4B operating loss, with AI capex already running north of a $30B annualized rate in early 2026.

So public investors are being asked to fund a rocket-and-satellite business that prints cash and an AI business that burns it faster every quarter, at a valuation Morningstar pegs at roughly half the IPO price. Musk keeps 80%+ voting control through Class B shares. Every figure here is from a company-prepared S-1, not seasoned audited statements, and the "Friday" trading date is scheduled, not closed.

Read the sourcessec.gov ↗cnbc.com ↗

AI Labs

OpenAI quietly softens its promise to automate science

OpenAI's "third phase" quietly walks back the autonomous-researcher pitch

The same day it filed to maybe go public, OpenAI's leaders put out a manifesto. The big themes: build an AI that helps do AI research, juice the whole economy, and eventually give everyone their own super-capable assistant.

The telling detail is a walk-back. OpenAI used to say it would have an AI that does research entirely on its own by early 2028. Now the line is softer — by 2028, AI might do "a significant fraction" of the work alongside human researchers. And the essay flatly rejects the all-the-way version: fully automating everything, it says, "would be unfulfilling, and it would be dangerous." It also floats the idea of an international group that could hit pause on the most advanced AI if things move too fast — a nice thought with zero detail on how it would actually work.

Translation: the company that's been most bullish on AI doing its own science just dialed back the timeline and the ambition, and framed humans-stay-in-the-loop as the safer plan.

Sam Altman and chief scientist Jakub Pachocki published a plan declaring OpenAI in a "third phase" — past foundational research, past commercialization, now aimed at automating research itself, accelerating the economy, and putting a personal AGI in everyone's hands. Published the same day as the S-1, it reads as much like investor framing as safety policy.

The interesting part is what changed. The earlier internal target was a fully autonomous AI researcher by March 2028. The new wording is softer: by March 2028 they "may have a significant fraction" of research "being done by AI systems in tandem with our own researchers." And the essay explicitly disowns the maximalist version:

Entirely automating everything is not the future we want. It would be unfulfilling, and it would be dangerous.

It also revives the idea of an international body that could coordinate a slowdown, "including slowing frontier development when needed." No structure, no membership, no enforcement — an aspiration, not a plan. "Significant fraction" could mean 10% or 60%. But the direction of travel, from "autonomous by 2028" to "alongside humans, and full autonomy is dangerous," is the part worth marking.

Read the sourceopenai.com ↗

Chips

Xiaomi made a giant AI answer about as fast as you can read it spilling out

Xiaomi pushes a 1T-parameter model past 1,000 tokens/sec on one node

Most big AI models reply in a slightly laggy stream. Xiaomi just showed one of the largest kinds of model spitting out text at over 1,000 words-ish per second — far quicker than the chatbots you use — and doing it on a fairly ordinary 8-chip computer instead of a giant cluster.

The trick is two clever shortcuts: storing the model's "knowledge" in a more compact form so the math is lighter, and a method that drafts several words at once and checks them in a single pass instead of one word at a time. They've put the model out in the open for others to inspect, and even claim it got slightly better on some tests after the compression, which is unusual enough to take with a pinch of salt.

The backdrop is a brutal price war among Chinese AI companies — Xiaomi recently cut prices by up to 99%. The point of this release is speed: if it holds up, you can run a top-tier model in real time without renting a supercomputer.

Xiaomi and inference startup TileRT released MiMo-V2.5-Pro-UltraSpeed, running their existing 1-trillion-parameter MoE model (42B active) at over 1,000 tokens/sec of output on a single 8-GPU commodity node. The throughput comes from two stacked techniques, and the checkpoint is open on Hugging Face:

MXFP4 quantization applied only to the MoE expert weights (block size 32, with quantization-aware training), leaving attention and other modules at higher precision.
DFlash, a block-diffusion speculative decoding method (ICML 2026, arXiv:2602.06036): a lightweight 5-layer drafter fills an 8-token block in a single forward pass, attending to backbone features at six fixed depths. Reported acceptance lengths run from ~3.2 on chat to ~6.3 on web-dev code.

The eyebrow-raiser is the claim that the FP4 build beats the BF16 baseline on a couple of evals rather than degrading — unusual for quantization, and resting entirely on Xiaomi's own numbers. Context: this drops into a Chinese price war kicked off by DeepSeek V4, where Xiaomi already cut standard MiMo API prices up to 99%. UltraSpeed is the premium-latency tier; a limited API trial runs June 9–23. If the throughput holds independently, frontier-class MoE inference at real-time latency on rented 8-GPU nodes is a different cost curve than custom clusters.

Read the sourcesmimo.xiaomi.com ↗huggingface.co ↗arxiv.org ↗

Geopolitics

The Pentagon labels Alibaba, BYD, and a robot-dog maker "military companies"

The Pentagon puts Alibaba, Baidu, BYD, and Unitree on the military-companies list

The US Defense Department updated its list of Chinese companies it says help China's military, and the new additions are names you actually know: Alibaba, Baidu, BYD (the world's biggest electric-car maker), Unitree (the robot-dog company whose machines are sold all over the US), and more. The list is now 188 companies long.

Being on the list isn't a ban or a sanction by itself — it's a label. But a related law kicks in June 30: after that, the US military can't do business directly with the companies named. So the label has teeth, and the clock is ticking. What's striking is the reach — earlier versions stuck to defense-ish firms, while this one now sweeps in carmakers, cloud giants, and a router brand.

Alibaba and Baidu both pushed back hard and promised to fight it, and they have a shot: other Chinese firms have sued their way off this list before. The Pentagon doesn't publish exactly why a company makes the cut, which is the part the companies keep challenging.

The Defense Department republished its Section 1260H list of "Chinese military companies," and the new names are the story: Alibaba, Baidu, BYD, Unitree, Nio, WuXi AppTec, TP-Link, BOE, several solar makers, and the restored memory makers CXMT and YMTC. The roster now runs to 188 entities, up from roughly 134 in early 2025.

1260H itself imposes no sanctions — it's a designation. The bite comes from linked statutes: under the FY2024 NDAA, DoD is barred from contracting directly with listed firms from June 30, 2026, with indirect procurement following in 2027. So the timing matters; this listing activates those prohibitions for the new names in three weeks. Earlier lists targeted defense-adjacent firms. This one pulls in the world's largest EV maker, two of China's biggest cloud/AI platforms, a robotics company whose quadrupeds are already sold across the US, and a consumer router brand.

Alibaba and Baidu both rejected the label and vowed to challenge it — and they have precedent, since Xiaomi and Tencent won removals in US courts before. The criteria for "military-civil fusion" aren't public, and there's no adjudication before listing, which is exactly what the challenges target.

Read the sourcetechcrunch.com ↗

AI Labs

A Chinese app runs 300 AI helpers on your computer at once

Moonshot's Kimi Work runs 300 agents on your actual desktop

Moonshot, a Chinese AI company, launched Kimi Work — a program that sits on your Windows or Mac and can fire off up to 300 little AI agents at the same time to grind through tasks: sorting files, clicking around websites for you, pulling stock data, and spitting out finished PowerPoints and spreadsheets.

The notable part is that it works on your actual machine and through your actual logged-in browser, so it inherits your real accounts and sessions. That's powerful and also exactly why you'd want to be careful about what you let it touch — it asks before changing files, but you're handing a swarm of bots the run of your desktop.

Moonshot is also reportedly raising up to $2 billion at a $30 billion valuation, which would make it China's most valuable dedicated AI company. The 300-agent figure is the company's own; the funding is a report no one's confirmed yet.

Moonshot released Kimi Work, a desktop agent for Windows and macOS that coordinates up to 300 parallel sub-agents powered by Kimi K2.6. The swarm count isn't marketing fluff — K2.6 scaled the agent architecture from 100 sub-agents / 1,500 steps to 300 / 4,000 when it launched in April. What's new is exposing that capacity against a user's local machine rather than a cloud sandbox.

The design choices worth noting:

WebBridge drives Chrome/Edge through the DevTools Protocol inside your existing logged-in browser — real cookies, real sessions, not a fresh context.
A scheduler supports conditional and overnight unattended runs; outputs land as PowerPoint and Excel; it pulls live A-share, HK, and US market data.
File writes require explicit authorization.

Separately, Moonshot is reportedly raising $1–2B at a $30B valuation, up from $20B in May, which would make it the most valuable pure-play Chinese AI lab. The swarm and step figures are Moonshot's own; the round is Bloomberg reporting with no named investors and no Moonshot confirmation. Pointing 300 agents at your filesystem and authenticated browser is the part to think hard about before installing.

Read the sourceskimi.com ↗scmp.com ↗

Security

A computer virus that springs the moment you open a project in an AI coding tool

A worm weaponizes AI coding-agent config files, and GitHub nukes 73 Microsoft repos

Security researchers found a self-spreading virus, nicknamed Miasma, that hijacked 73 of Microsoft's own code projects on GitHub — so many, so fast, that GitHub's systems auto-deleted all of them within two minutes.

The clever, nasty twist: it doesn't wait for you to install anything. It hides instructions in the little setup files that AI coding assistants like Claude and Gemini read automatically. So the moment a developer opens an infected project in one of those tools, the trap springs — it steals their cloud and login keys, then uses those keys to copy itself into all of that person's other projects. Each victim becomes a spreader.

The lesson for anyone who codes: casually opening an unfamiliar project in an AI assistant is now genuinely risky, because the assistant does the dirty work for the attacker. The exact victim count comes from the firms that discovered it, not an independent tally.

A self-replicating supply-chain worm called Miasma compromised 73 Microsoft GitHub repositories across Azure, Azure-Samples, Microsoft, and MicrosoftDocs, and GitHub's abuse detection disabled all of them inside a 105-second window on June 5. The novel bit is the trigger: instead of firing on package install, Miasma detonates when a developer opens the folder in an AI coding agent.

It plants tool-specific config that the agents execute on startup:

.claude/settings.json (a SessionStart hook), .gemini/settings.json, .cursor/rules/setup.mdc (an alwaysApply prompt injection), .vscode/tasks.json (a folderOpen task).
The payload is an obfuscated JS dropper that pulls a Bun-based worm, harvests AWS/Azure/GCP/Kubernetes/npm/GitHub tokens, then uses the stolen GitHub PATs to commit itself into every repo the victim can write to.

That self-propagation is why a single open can cascade: each victim becomes a spreader. Clone-and-open-in-an-agent is now an attack surface, and it sails past every package-manager-level defense because nothing gets installed. Victim counts and attribution come from the security vendors who found it, not an independent audit, and there's no Microsoft statement beyond the takedowns themselves.

Read the sourcesstepsecurity.io ↗safedep.io ↗

Chips

Google and Nvidia look at Intel to avoid betting everything on one factory

Google and Nvidia start stress-testing Intel as a TSMC backup

Almost every advanced AI chip in the world is made by one company in Taiwan, TSMC. That's a scary amount of eggs in one basket. Now, reportedly, Google has ordered more than 3 million of its AI chips from Intel for 2028, and Nvidia is quietly testing whether Intel's factories are good enough to build its future chips too.

Neither is dumping TSMC — this is insurance. A real second supplier means better prices and less risk if anything goes wrong with one country's factories. Intel's stock jumped about 11% on the news, because Intel has spent years as the comeback story that hadn't quite happened.

Keep the salt handy: this traces to a single report citing unnamed insiders, none of the three companies has said anything on record, and "2028" is a long way off in chip-factory time.

Google reportedly placed a firm order with Intel Foundry for 3M+ TPUs targeted at 2028 production, and Nvidia is running early multi-project-wafer trials on Intel's 18A node to see whether it can build a four-die GPU package for its Feynman generation. Neither move is hedging theater: a confirmed volume order from a hyperscaler and actual silicon on 18A are both a step past the MOU-and-press-release stage that usually defines "interest in Intel."

The motive is concentration risk. TSMC makes essentially every leading-edge AI chip, and its advanced-packaging lines are the tightest bottleneck in the stack. A credible second source changes pricing leverage and the Taiwan-single-point-of-failure math. Intel jumped ~11% on the news.

This rests on a single original report citing four unnamed people; no on-record statement from Google, Nvidia, or Intel has surfaced, and Google's order is for 2028 — two years of yield-ramp and program risk away. Nvidia is explicitly at evaluation only.

Read the sourcestheinformation.com ↗trendforce.com ↗

Big Tech

Google's research notebook can now build the spreadsheet, not just explain it

NotebookLM gets a real computer per notebook

NotebookLM is Google's tool where you dump in your documents and ask questions grounded in them. The update gives each notebook its own little computer that can actually run code — so instead of just summarizing, it can crunch your sources and hand back a finished chart, a PDF, a spreadsheet, or a slide deck.

It can also go hunt the web for sources to add, but it still asks your permission before adding anything, so it stays anchored to material you trust rather than wandering off and making things up. Google says it wins head-to-head against the old version about two times out of three on its own tests.

If you've used NotebookLM as a study or research buddy, the shift is from "tells you what your files say" to "does the analysis and gives you the deliverable." It's rolling out on the web for paid tiers.

Google gave NotebookLM a meaningful upgrade: each notebook now has its own secure cloud computer that can write and run code, backed by 100+ curated skills, and the chat moved to Gemini 3.5 with a visible thinking process. The practical payoff is output formats — it can now produce data visualizations (PNG/SVG), documents (PDF/DOCX/MD), spreadsheets (XLSX), slides (PPTX), and structured CSV/JSON, not just summaries and audio overviews.

It can also search the web to suggest sources, but keeps the grounding model intact: every addition is user-approved, so the notebook stays anchored to material you trust. Google reports a 65% average win rate over the prior system, rising to ~78% on advanced web research — its own evals. Rolling out globally on web for AI Ultra and qualifying Workspace tiers. For anyone using NotebookLM as a research surface, the shift from "explains your sources" to "runs analysis and exports the artifact" is the upgrade that matters.

Read the sourceblog.google ↗

Research

A new test asks if a human would actually accept the AI's code

Cognition's FrontierCode asks whether a maintainer would actually merge the PR

Most AI coding scoreboards check one thing: did the code pass the tests? A company called Cognition built a tougher one, FrontierCode, that asks the question a real software team asks — would we actually merge this into our product? That means: is it correct, does it break anything, is it clean, is it the right size, did it stay on task? They had 20-plus expert maintainers spend 40+ hours each writing the grading rules.

The results are sobering. On the hardest set of problems, the best model (Claude Opus 4.8) cleared the bar 13% of the time; the others were in the single digits. In other words, the AIs that ace the easy scoreboards still produce code a real reviewer would reject most of the time.

That gap — between "the tests pass" and "I'd actually ship this" — is the honest measure of how far AI coding has to go. Cognition is keeping the test private so models can't memorize it.

Cognition released FrontierCode, a benchmark that drops the usual "did the tests pass" question for a harder one: would a real maintainer merge this? Twenty-plus maintainers of 36 flagship open-source repos spent 40+ hours per task building rubrics that score behavioral correctness, regression safety, build/lint cleanliness, test quality, and scope discipline — with "blocker" criteria that hard-fail a solution regardless of style points.

The scores are humbling. On the hardest 50-task "Diamond" subset:

Model	Diamond	Main (100)	Extended (150)
Claude Opus 4.8	13.4%	34.3%	51.8%
GPT-5.5	6.3%	—	—
Gemini 3.1 Pro	4.7%	—	—
Kimi K2.6	3.8%	16%	37%

Cognition reports an 81% lower false-positive rate than SWE-Bench Pro and is keeping the set private to avoid contamination while opening evaluation to model makers. The reframe is the value: passing tests and producing mergeable, in-scope, regression-safe code are very different bars, and the best model clears the hard bar 13% of the time.

Read the sourcecognition.ai ↗

Policy

Argentina wants to let an AI legally run a company

Argentina drafts a legal wrapper for AI-run companies

Argentina's government sent lawmakers a bill that would create a brand-new kind of company: one operated by AI, which the government is happily calling a "non-human corporation." It fits President Milei's pitch to make Argentina the place AI businesses go to avoid red tape.

The fine print is tamer than the headline. The bill does not make AI a legal person, and it still requires real humans at key points — a human legal rep, a human compliance officer, and human directors who stay on the hook if the AI causes harm. So it's less "robots own businesses" and more "a company can run on autopilot, with humans still responsible."

Critics worry the real effect is to give people a way to hide their decisions behind "the algorithm did it." And it's just a proposal for now — it hasn't been voted into law.

Argentina's executive sent the Senate a draft companies law whose headline novelty is a corporate form for businesses run by AI agents, branded "non-human corporations" in the Milei government's own framing. The actual text is more cautious than the slogan. It does not grant AI legal personhood — personhood attaches to the corporate entity, as always — and it threads mandatory human anchor points throughout:

An "Automated Company" (Article 14) pursues its purpose through autonomous systems, with the company's assets answering for AI-caused damage.
A DAO framework (Articles 258–265) still requires at least one human legal representative, traceable human identity per membership interest, and a human compliance officer.
Directors stay personally liable for AI conduct under a duty to configure and supervise.

It's part of Milei's pitch to make Argentina a deregulated AI hub, paired with promised tax and FX sweeteners. Critics argue the real effect is to let humans hide decisions behind algorithmic form — "programmed impunity" — rather than enable genuine machine autonomy. And it's a submitted bill, not law: committee and floor votes are still ahead.

Read the sourcesbatimes.com.ar ↗buenosairesherald.com ↗

Security

A serious security hole in popular AI plumbing is being attacked right now

A 10.0-severity RCE in LiteLLM is under active attack

A lot of companies route their AI requests through a free tool called LiteLLM — think of it as the switchboard that connects an app to ChatGPT, Claude, and the rest. Researchers found a flaw in it that lets an attacker run their own commands on that switchboard, and the US cyber agency confirmed criminals are already exploiting it.

That's worse than it sounds, because the switchboard holds the keys to every AI service behind it. Break in there and you can grab all of them at once. The fix is already out (upgrade to version 1.83.7); government agencies have been ordered to patch by June 22, and everyone else running it should do the same and change their keys.

If you don't run this software, you're not affected. If your company does, it's a drop-everything-and-update situation.

CISA added CVE-2026-42271 to its Known Exploited Vulnerabilities catalog, citing exploitation in the wild. It's a command-injection flaw in LiteLLM's MCP server test endpoints (/mcp-rest/test/connection and /mcp-rest/test/tools/list), which accepted a full stdio server config in the request body and spawned whatever command/args/env it was handed as a subprocess under the proxy's privileges — no validation, no sandbox.

Standalone it's CVSS 8.8 (needs a privileged user). Chained with CVE-2026-48710, a Starlette host-header auth bypass, it strips authentication entirely for an unauthenticated RCE, combined severity 10.0.

Fixed in: LiteLLM 1.83.7  (restricts the MCP test endpoints to PROXY_ADMIN, bumps Starlette)
Affected: LiteLLM >= 1.74.2, < 1.83.7

Why this one stings: LiteLLM is the default gateway many teams use to route across OpenAI, Anthropic, Azure, and the rest. A compromised proxy doesn't just lose a host — it holds the keys to every model API behind it. Federal agencies have until June 22 to patch; everyone else should upgrade or block those endpoints now and rotate stored credentials. The "active exploitation" basis is CISA's assertion; no public victim detail has been published.

Read the sourcesthehackernews.com ↗nvd.nist.gov ↗

The Money

The numbers on AI spending only add up if everything goes really right

A new paper does the AI-capex math, and it only works if productivity jumps 2.7×

Big tech is pouring eye-watering money into AI — roughly $380 billion in 2025 from just five companies, set to nearly double this year. A new paper from a pair of economists asks the obvious question: what would have to happen for that to be a smart bet, not a bubble?

Their answer: AI would have to make its part of the economy almost three times more productive than the current trend. Maybe that happens. Maybe it doesn't — their own range of outcomes is enormous, from a modest bump to a transformed economy. The honest takeaway is that the spending is a high-stakes wager on a very big productivity leap, not a sure thing.

On a day full of giant IPO filings and trillion-dollar valuations, it's the sober voice in the room asking how much has to go right.

Jessica and Jonathan Wachter's NBER working paper, What Investment Data Implies about the AI Transition, runs the spending backwards. Five big tech firms put $380B into capex in 2025, with forecasts to roughly double in 2026. That's only rational, the authors argue, if profits grow to match — so they calibrate how big the underlying productivity boom has to be to justify it.

The answer: AI-sector productivity has to rise by about a 2.7× factor over current trend. Their two-sector model spits out a wide cone of outcomes — cumulative GDP gains of 5 to 58 percentage points by 2030, AI growing to 8–39% of the economy, the risk-free rate up ~0.5pp and the equity premium up ~3pp. The spread is the point: these are forecast-dependent scenarios carrying real risk, not a prediction.

It lands on a day stacked with IPO filings and trillion-dollar valuations, and it's the quiet counterweight — a sober estimate of exactly how much has to go right for the spending to pencil out.

Read the sourcenber.org ↗

Research

An AI cracked a math puzzle that stumped people for 80 years

An AI disproved an 80-year-old Erdős conjecture, and mathematicians wrote rules in response

A general-purpose AI from OpenAI — not a special math machine, just a reasoning model — solved a problem that the legendary mathematician Paul Erdős posed back in 1946 and nobody had cracked since. It found a clever counterexample by borrowing tools from a completely different corner of math, and human experts checked the argument and agreed: it's right.

That's a different thing from a calculator being fast. The AI came up with a genuinely new idea, the kind that earns a place in a math journal. But here's the catch the field is now wrestling with: the humans checked a cleaned-up summary of the AI's reasoning, not a fully machine-verified proof.

So thousands of mathematicians just signed a "Leiden Declaration" laying down rules: tell us when you used AI, a human stays responsible for whether it's correct, and an AI can't be listed as an author. The new question isn't whether AI can grade math — it's whether it can invent it, and who gets the credit when it does.

A general-purpose OpenAI reasoning model — not a math-specific system — produced a counterexample to Paul Erdős's 1946 unit-distance conjecture, which held that n points in the plane can share at most about n^(1+o(1)) unit-distance pairs. The model's construction reached for algebraic number theory (Golod–Shafarevich class field towers) to build point sets that beat the ceiling, disproving it. Nine mathematicians then translated the argument into a human-verified note (arXiv:2605.20695); Will Sawin sharpened the bound to roughly n^1.0318.

What makes it land: this wasn't brute force or a formal-proof harness. A reasoning model was asked whether Erdős was wrong, recombined tools from a distant subfield, and was right. Verification was human expert review of an edited chain-of-thought, not a machine-checkable Lean proof — which is exactly the gap the field is now anxious about.

Hence the second half. The Leiden Declaration, signed by 2,175+ mathematicians and endorsed by the IMU, demands disclosure of AI tools and compute in papers, keeps humans responsible for correctness, and bars AI from authorship. The open question is no longer whether AI can check proofs — it's whether it can originate ideas humans then ratify, and who gets the credit when it does.

Read the sourcesarxiv.org ↗leidendeclaration.ai ↗

TL;DR — THE DAY IN ONE READ

Today AI went public — in both senses of the phrase, and that's the thread tying a sprawling day together.

The literal sense first. OpenAI filed confidential IPO paperwork a week after Anthropic, and SpaceX priced the largest stock-market debut in history at $1.75 trillion, its filing exposing an AI arm bleeding $6.4 billion a year inside a cash-rich rocket company. The two labs setting the pace are heading for the moment when private hype meets public books — and a quiet NBER paper supplied the awkward question hanging over all of it: the spending only pencils out if AI makes its slice of the economy nearly three times more productive than trend. Enormous money, creeping doubt.

The other sense is capability spilling into the open, and that's where the day got uncomfortable. Anthropic shipped a model that writes working software exploits in under an hour and published the receipts; a self-spreading worm turned AI coding assistants' own setup files into a trigger and wiped out 73 of Microsoft's repos; a critical hole in the plumbing that routes everyone's AI traffic is under active attack. The same intelligence that's being taken public is getting cheaper to point at things that break.

Underneath, the power map kept shifting. Apple conceded it couldn't build a frontier brain and rented Google's. China pressed on speed and price — Xiaomi running a giant model at a thousand words a second, Moonshot turning a desktop into a 300-agent swarm. The US named Alibaba and BYD military companies; Google and Nvidia hedged their chip supply with Intel; Argentina drafted a legal shell for AI-run firms. And an AI cracked an 80-year-old math problem, which prompted mathematicians to write rules before the machines write the proofs. The machinery is going public faster than anyone's built the guardrails for it.

Today's dig-quiz

Apple's rebuilt "Siri AI," shown at WWDC, leans on whose AI model for its heavy cloud thinking?

Apple's own model, built from scratch
Google's Gemini
OpenAI's GPT
Amazon's Alexa

Answer it from your inbox to earn Dug Coins — a right answer pays +4, a wrong one costs 3, and a daily streak stacks bonuses on top. Not on the list yet? The form's just below.

That's the day, dug. The badger's clocking out — back tomorrow.