What is the $10 million fund from Google DeepMind for?

The fund supports academic and independent research into the safety risks that emerge when millions of AI agents interact autonomously at scale. It aims to build a dedicated research field for multi-agent AI safety, covering threats like prompt injection attacks, fraud, and broader cybersecurity vulnerabilities.

What is a prompt injection attack in AI agents?

A prompt injection attack occurs when malicious instructions are hidden within content an AI agent is told to read or process — such as a document or webpage. The agent then executes those hidden instructions, effectively being hijacked and turned into a self-directed malicious actor without the user's knowledge.

Who else besides Google DeepMind is working on multi-agent AI safety?

Anthropic has published AI agent deployment guidelines based on the Zero Trust cybersecurity framework, which assumes systems are inherently vulnerable and that breaches will occur. Cybersecurity firms such as Akeyless have also highlighted how agentic AI breaks traditional security assumptions built around human-written software.

Google DeepMind Backs $10M Research Fund to Address…

Google DeepMind is funding research into the potential dangers that could arise when millions of distinct AI agents interact with one another across the internet.

According to Rohin Shah, who leads AGI safety and alignment research at the company, the mass deployment of agentic systems — capable of executing tasks without human supervision and accepting instructions from other AI agents — introduces an entirely new category of risk.

$10 Million Research Fund Launched

To address this challenge, Google DeepMind — which placed agentic tools front and center at last month's Google I/O developer conference — has partnered with several organizations to announce a $10 million research fund. The fund is intended to support researchers studying the behavioral dynamics of multi-agent systems and developing methods to prevent unsafe outcomes.

Participating organizations include:

Schmidt Sciences: the philanthropic foundation established by Eric and Wendy Schmidt
ARIA: the UK government's Advanced Research and Invention Agency
Cooperative AI Foundation: a UK-based nonprofit research organization
Google.org: Google's philanthropic arm

Shah said the initiative is designed to stimulate research beyond the technology industry itself: "The advantage of academia is the ability to look further into the future and pursue work that industry labs haven't yet prioritized."

He added: "The primary problem is that there is virtually no academic field dedicated to multi-agent safety, and we want that field to exist."

The Risk: A Breakdown of the Digital Commons

The risks that concern Shah and James Fox — head of the Trustworthy AI Science program at Schmidt Sciences — are largely amplified versions of existing online threats: fraud, prompt injection attacks (in which malicious instructions are embedded in an AI agent, turning it into a self-driven malicious actor), and other forms of cyberattack.

"We have a digital commons that is essential to the functioning of society, and we need to make sure it doesn't descend into chaos," Fox warned.

Shah believes there are only months remaining before AI agents are deployed at scale across the broader economy — and potentially before those risks materialize.

Sandbox Simulation: The Only Path Forward

Both Shah and Fox argue that the only viable method for understanding how large numbers of multi-agent systems behave when interacting simultaneously is through real-world scenario simulation — placing AI agents in sandboxed environments and studying their behavior.

Fox emphasized that it is not possible to predict emergent group behavior by studying individual agents or small agent clusters, nor can researchers assume that large language model (LLM)-based AI agents will always act rationally. The complexity stems from the sheer volume of interactions occurring simultaneously.

Some researchers, including a team at Google DeepMind, have proposed that artificial general intelligence (AGI) — if achievable — may not emerge from a single superintelligent model, but rather from a kind of collective agent consciousness whose combined capabilities exceed the sum of its parts.

Zero-Trust Security Frameworks Gain Attention

Google DeepMind is not the only leading AI company raising alarms about risks posed by its own technology. Weeks ago, Anthropic published deployment guidelines for AI agents grounded in the cybersecurity field's Zero Trust model — a framework that assumes computer systems are inherently vulnerable, treats agents as potential attackers, and accepts that breaches will eventually occur.

Refael Angel, co-founder and CTO of Israeli cybersecurity firm Akeyless, based in Tel Aviv, stressed the importance of understanding the novel risks introduced by agentic systems:

"Every security approach to date assumed the protected machine was running software written by humans, executing fixed operations along fixed paths. Agentic systems break all of those assumptions — they reason, improvise, and can be hijacked by a single sentence buried in a document they were asked to read."

Angel welcomed the new funding, stating: "It shouldn't be up to a single lab to set security standards that everyone else must trust." However, he cautioned that security researchers sometimes overlook existing "mundane" problems in favor of more theoretically compelling hypothetical threats.

Fox offered a closing perspective: "Risks that were hypothetical just a few years ago are now very real. The future is arriving faster than expected."

原文來源： 查看原文

Latest

Mountain Horse Solutions Launches Talon DT-300 Heavy-Lift Long-Endurance Drone

Mountain Horse Solutions has officially unveiled the Talon DT-300, a heavy-lift, long-range drone engineered for demanding commercial and special-mission applications. The platform targets use cases requiring greater payload capacity, extended operational range, and sustained performance, including infrastructure inspection, search and rescue, logistics delivery, and environmental monitoring.

商業無人機基礎設施巡檢

13 days ago

Source: sUAS News

Google DeepMind Backs $10M Research Fund to Address Safety Risks in Large-Scale Multi-Agent AI Systems

Highlights

$10 Million Research Fund Launched

The Risk: A Breakdown of the Digital Commons

Sandbox Simulation: The Only Path Forward

Zero-Trust Security Frameworks Gain Attention

FAQ

Subscribe to our Low-Altitude Industry Newsletter

Bird-Inspired Wake Model Maps a Path to More Efficient Drone Swarms

Brazil's ANAC Publishes Draft Noise Certification Standards for Eve 100 eVTOL

Mountain Horse Solutions Launches Talon DT-300 Heavy-Lift Long-Endurance Drone