Tag: Google

  • Claude Uncovers a 27-Year-Old Bug, Meta Bets $130B, and the Agentic Paradox Takes Hold

    Something quietly significant is happening in AI right now. The technology is no longer just generating text or images — it’s hunting software bugs that human engineers missed for decades, reshaping how much money the world’s biggest companies are willing to spend, and bumping into some thorny contradictions of its own making. Here’s what caught our attention this week.

    Anthropic’s Claude Mythos Found a Bug That’s Been Hiding Since 1997

    Anthropic launched Project Glasswing, giving select partners — including AWS, Apple, Cisco, Google, JPMorgan Chase, and Microsoft — early access to its most powerful model yet, Claude Mythos Preview, specifically to hunt down critical software vulnerabilities. The results are striking: in just weeks of internal testing, Mythos identified thousands of zero-day vulnerabilities across every major operating system and web browser. Among them was a 27-year-old bug lurking in OpenBSD — a flaw that had survived countless human audits since 1997.

    This isn’t just a headline-grabbing demo. It signals a genuine shift in how AI might be used defensively. The same capabilities that worry security researchers (AI-powered hacking) may also become our best tool for finding and patching weaknesses before attackers do.

    Google’s Gemini 3.1 Ultra: Two Million Tokens and True Multimodality

    Google launched Gemini 3.1 Ultra with a 2-million token context window — enough to reason across entire codebases, lengthy research documents, or hours of video in a single pass. What makes it notable isn’t just the size: Gemini 3.1 Ultra was designed from the ground up to reason across text, images, audio, and video simultaneously, without routing through separate transcription or processing steps. Google also added a sandboxed Code Execution tool, letting the model run and test its own code inline. With Google I/O around the corner, the company is clearly in sprint mode.

    Meta Is Spending Like There’s No Tomorrow

    Meta announced AI capital expenditures of $115–135 billion for 2026 — nearly double last year’s spending. That’s an extraordinary number, and it reflects just how seriously the company is taking the gap between itself and OpenAI and Google on frontier model development. Infrastructure at this scale means data centers, chips, energy, and talent, all competing for the same limited pool of resources. Whether this investment pays off in model quality is something we’ll be watching closely throughout the year.

    The Agentic Paradox: AI Agents Are Getting Expensive

    Here’s the contradiction nobody quite expected: as businesses rush to deploy autonomous AI agents, the cost of the frontier models powering them is rising sharply. Cloudflare recently credited AI with eliminating 1,100 roles — even as it posted record revenue — joining a growing list of tech companies linking headcount reductions to automation. But the irony is real: the efficiency gains AI promises can be partially eaten up by the compute costs of running increasingly capable models. The companies that figure out how to deploy agents cost-effectively will have a significant edge.

    Colorado Revamps Its Landmark AI Law

    On the regulatory front, Colorado overhauled its groundbreaking AI law — one of the first in the U.S. to specifically target AI systems making consequential decisions about jobs, healthcare, education, housing, and credit. The revisions reflect real-world pushback from industry and a desire to make the law more workable without gutting its core protections. It’s a useful case study in what AI regulation looks like when it moves from theory to practice.

    What This Moment Feels Like

    This week’s stories share a common thread: AI is becoming something that acts in the world, not just assists with it. Models are autonomously finding vulnerabilities, companies are committing generational levels of capital, and the unintended consequences — cost paradoxes, regulatory friction, workforce disruption — are arriving right alongside the breakthroughs. None of this is reason for panic or uncritical excitement. But it’s worth paying attention, because the decisions being made right now — by companies, regulators, and researchers — will shape how this all unfolds. As always, we’re watching with curiosity.

  • China’s Coding Surge, Google’s Pentagon Gamble, and the New Shape of AI Power

    The AI landscape doesn’t sit still for long, but even by its own standards, the past 48 hours have felt like a gear shift. From Beijing to Brussels, from Mountain View to the Pentagon’s classified networks, this week’s headlines are less about chatbots and more about who controls the most powerful technology on earth—and what they’re willing to do with it.

    China’s Open-Weight Blitz

    Four Chinese AI labs released open-weight coding models in a single twelve-day window, and the benchmarks are turning heads. Moonshot’s Kimi K2.6 briefly claimed the top spot on SWE-Bench Pro—a rigorous test of autonomous software engineering—beating closed models from OpenAI and Anthropic. GLM-5.1 from Z.ai, a 744-billion parameter mixture-of-experts model trained entirely on Huawei’s Ascend 910B chips (no Nvidia hardware), also topped the leaderboard. MiniMax M2.7 and DeepSeek V4 rounded out the group.

    What makes this wave remarkable isn’t just capability—it’s cost. Chinese frontier models are pricing at 15–30× cheaper than comparable Western offerings, with DeepSeek offering cache-hit pricing as low as $0.07 per million tokens. Because these are open-weights releases, developers worldwide can run or fine-tune them without paying anyone. If performance and cost continue on this trajectory, the competitive advantages of proprietary Western models will face real pressure.

    Google “Proudly” Arms the Pentagon

    Google has officially extended its AI partnership with the U.S. Department of Defense into classified networks, amending a $200 million contract to allow Gemini to be used for sensitive operations including mission planning and weapons targeting. The move drew swift internal backlash: more than 580 Google and DeepMind employees signed an open letter to CEO Sundar Pichai, calling the deal “inhumane” and urging him to pull back.

    Google’s response was unambiguous. A company memo told staff it “proudly” supports U.S. military work. The contrast with 2018’s Project Maven saga is striking—back then, 4,000 signatures and a dozen resignations were enough to kill a drone surveillance contract worth a few million dollars. Today, 580 voices face a classified AI market worth tens of billions, and a company that has already removed its earlier AI ethics red lines. Anthropic took the opposite stance: the Pentagon reportedly designated it a “supply chain risk” after CEO Dario Amodei refused unrestricted military use of Claude.

    The C-Suite Is Being Rebuilt Around AI

    A new IBM Institute for Business Value study of global CEOs finds that AI isn’t just changing workflows—it’s changing who sits at the executive table. Between 2026 and 2028, CEOs expect 53% of employees to need upskilling for their current roles, and 29% to require reskilling for entirely different jobs. New leadership roles are emerging to bridge AI strategy, ethics, and operations in ways that don’t fit neatly into existing org charts. The message from the corner office: the companies that figure out AI governance structures first will have a durable edge over those still retrofitting old hierarchies.

    Cohere and Aleph Alpha Merge into a Transatlantic AI Force

    In a deal blessed by both the Canadian and German governments, Cohere—last valued at $6.8 billion—has merged with Germany’s Aleph Alpha, a European AI champion known for its focus on data sovereignty and privacy. The combined entity positions itself as a credible alternative to American mega-labs for enterprises wary of routing sensitive data through U.S.-based providers. It’s a calculated bet that geopolitical anxiety about AI supply chains is a durable market, not a passing mood.

    Allegiance Is the New Capability

    Taken together, this week’s news sketches a world where AI power is dispersing and aligning along new fault lines. Chinese open-weight models are eroding Western moats. European labs are consolidating to stay relevant. American tech giants are planting flags in classified government infrastructure. The technology is maturing fast enough that the most consequential decisions now aren’t about benchmarks or parameter counts. They’re about who you’re building for—and who you’re willing to say no to.