返回列表
🧠 阿头学 · 💰投资

AlphaGo 十年:从围棋到 AGI 的搜索与规划之路

DeepMind 用十年验证了一个假设:搜索+规划+工具的组合,正在从游戏攻克走向科学突破,而 AGI 的关键不在模型大小,在于系统架构。
打开原文 ↗

2026-03-11 原文链接 ↗
阅读简报
双语对照
完整翻译
原文
讨论归档

核心观点

  • 搜索与规划是 AGI 的真正引擎,不是参数规模

AlphaGo 的胜利源于"深度网络(直觉)+ 蒙特卡洛树搜索(审慎)+ 强化学习(自我迭代)"的组合。这套方法已被复用到 AlphaFold(蛋白质折叠)、AlphaProof(数学证明)、AlphaEvolve(算法发现),证明搜索空间的智能导航,而非单纯的模式匹配,才是突破的关键。Gemini 的 Deep Think 模式正是将这种"搜索与规划"嵌入通用模型。

  • 创造力可被系统化产生与验证

Move 37 不是灵感,而是在 10^170 可能性空间中通过搜索找到的反常识但可验证的解。AlphaEvolve 在代码空间发现矩阵乘法新方法,属于同一类"可计算的创造力"。这意味着创新不再是黑箱,而是可被复制的系统流程。

  • AGI 的配方:世界模型 + 搜索规划 + 工具调用

文章明确指出,实现 AGI 需要三个要素的结合:Gemini 的多模态世界模型(理解物理世界)、AlphaGo 的搜索与规划技术(在可能性空间中导航)、专用工具的调用能力(如 AlphaFold 用于蛋白质结构)。这不是单一模型的突破,而是系统架构的突破。

  • 从"模仿人类"到"超越人类经验"的范式转移

AlphaGo Zero 从零开始自学,最终超越所有人类棋谱。这证明 AI 不再受限于人类存量数据的质量,而是能在自博弈中产生人类从未见过的知识。这对科学发现的意义在于:AI 可以在"假设空间"中进行大规模搜索,而不仅仅是在已知文献中检索。

  • 科学工具化的成功已被验证

AlphaFold 的开源数据库被全球 300 万研究者使用,从疟疾疫苗到"吃塑料"酶等领域都有应用。这不是概念验证,而是硬指标——证明当科学问题可被转化为"结构预测"或"空间搜索"问题时,这套范式能产生诺贝尔奖级的突破。

跟我们的关联

对 Neta 海外增长的启示:从"拍脑袋投放"到"搜索式增长"

  • 意味着什么:增长本质上也是巨大状态空间(渠道×创意×人群×时段×定价×落地页)的导航问题。不是做更多实验,而是更聪明地搜索。
  • 下一步:把增长系统设计成"候选集生成(LLM)→小范围搜索实验(多臂老虎机/贝叶斯优化)→胜者扩展"的闭环,目标是找到"反直觉但有效"的破局打法(如 Move 37 式的增长曲线)。

Agent 的产品壁垒来自"工具编排",不是"更会说"

  • 意味着什么:Neta 的 agent 竞争力应该来自调用专用工具的能力与可验证结果,而非对话体验的微小差异。
  • 下一步:设计"工具路由层"(何时调用哪种工具、如何验证输出),把"调用成功率/验证通过率/任务完成时长"做成核心指标,逼近文章描述的"通用+专用"AGI 路线的产品化版本。

团队决策的"多智能体辩论机制"

  • 意味着什么:文章提到 AI 联合科学家通过多个智能体对假设进行"辩论"来识别模式。20 人团队可以制度化这一机制。
  • 下一步:每个关键决策(海外市场选择、品牌定位、增长杠杆)都强制引入"反对者角色/反对者智能体",用证据与实验设计裁决,减少共识幻觉,加快证伪。

讨论引子

  • 如果 Neta 的海外增长是一盘棋,你现在是在模仿"人类专家的套路",还是在给 AI 足够的搜索空间去找那步"第 37 手"?
  • 你的 Agent 现在调用了多少个"专用工具"?如果少于 5 个,它还不够"通用"——它只是一个更聪明的搜索框。
  • 团队的关键决策中,有多少次真正引入了"反对者论证"而非"共识确认"?

你的浏览器不支持 video 标签。

十年前,我们的 AI 系统 AlphaGo 成为首个在复杂的围棋比赛中击败世界冠军的程序——这一里程碑比许多专家当时认为可能实现的时间整整提前了十年。

这一成就宣告了如今被认为是现代人工智能(AI)时代的开端。凭借一次富有创造力的落子——著名的“第 37 手(Move 37)”,AlphaGo 展示了 AI 的潜力,并发出信号:我们已经拥有了开始攻克真实世界科学问题所需的技术。

今天,这一突破仍在持续塑造我们构建通往人工通用智能(AGI)之路上各类系统的工作。我们相信,AGI 将是人类迄今发明过的最深远的技术,并可能成为推动科学、医学与生产力跃迁的终极工具。

创意火花

2016 年,超过 2 亿人观看了 AlphaGo 在首尔对阵世界冠军围棋选手李世石。比赛的转折点来自第二局中的“第 37 手(Move 37)”,这一步法极其反常,以至于职业解说起初认为它是失误。但事实证明,它是决定性的。大约一百手之后,那枚棋子恰到好处地落在了能让 AlphaGo 赢下对局的位置。这一幕展示了惊人的前瞻性,也体现了该 AI 系统能够超越对人类专家的模仿,找到全新的策略。

你的浏览器不支持 video 标签。

在 YouTube 上观看 AlphaGo 纪录片

围棋长期以来都是 AI 研究的试金石,因为它的复杂度极高。棋盘上可能的局面多达 10170 种——远远超过可观测宇宙中的原子数量。

为了让这项任务变得可处理,AlphaGo 将深度神经网络与高级搜索和强化学习结合起来——这是一种由 DeepMind 开创 的 AI 方法。

AlphaGo 先从人类专家对局中学习,建立起对“合理落子”的模型;随后又与自己进行几十万盘对弈,在不断强化最强的取胜策略中持续提升。系统随后只考虑最可能产生收益的路径,并在这更小的候选落子集合中,找到最可能引导它取胜的一步。

在 AlphaGo 之后,我们构建了 AlphaGo Zero,它从完全随机的对弈开始学习,最终成为史上或许最强的围棋选手。接着,我们将系统进一步泛化为 AlphaZero,它从零开始自学,掌握任何“二人、完美信息”类游戏,包括围棋、国际象棋和将棋。除了游戏规则之外不具备任何先验知识的 AlphaZero,能够在数小时内学会精通国际象棋,并击败不仅是顶尖人类棋手,还有当时最优秀的专用国际象棋程序,例如 Stockfish。即便国际象棋早已在这些程序的辅助下被深入分析,AlphaZero 仍然像在围棋中一样,提出了耐人寻味的新策略

这进一步证明了我在首尔获胜那一刻就确信的一点——这项技术已经准备好被用于我们更真实的目标:加速科学突破。

我认为,AlphaGo 带来的最大启示,是对 AI 时代的一次确定性预演——它证明这并非遥远而模糊的未来,而是正来到我们门口的现实。它像一份“来自未来的路线图”,向人类发出了关于世界即将如何改变的清晰信号。

围棋大师 李世石

UNIST 兼职教授

催化科学突破

通过证明自己能够在围棋棋盘那巨大的搜索空间中导航,AlphaGo 展示了 AI 有潜力帮助我们更好地理解物理世界的浩瀚复杂性。我们首先尝试解决蛋白质折叠问题——这是一项持续了 50 年的重大挑战:预测蛋白质的三维结构。这些信息对于理解疾病以及开发新药至关重要。

2020 年,我们终于用 AlphaFold 2 系统攻克了这一长期悬而未决的科学难题。随后,我们为科学界已知的全部 2 亿种蛋白质预测并“折叠”出了结构,并将其以开源数据库的形式免费提供给科研人员。如今,全球超过 300 万研究者使用 AlphaFold 数据库 加速重要工作,涵盖从疟疾疫苗到“吃塑料”酶等诸多领域。2024 年,John Jumper 和我代表整个 AlphaFold 团队,因领导这一项目而获颁诺贝尔化学奖,这对我们而言是一生的荣耀。

自 AlphaGo 获胜以来,我们已将其开创性方法应用到科学与数学的许多其他领域,包括:

数学推理:作为 AlphaGo 架构最直接的后继,AlphaProof 结合语言模型与 AlphaZero 的强化学习和搜索算法,学习证明形式化数学命题。它与 AlphaGeometry 2 一起,成为首个在国际数学奥林匹克(IMO)上达到奖牌标准(银牌)的系统,证明 AlphaGo 的方法能够解锁高级数学推理,并为我们最强大的通用模型奠定基础。

我们规模最大、能力最强的模型 Gemini,最近更进一步。其 Deep Think 模式的高级版本达成了 2025 年 IMO 的金牌级表现,所采用的方法也受到了 AlphaGo 的启发。从那以后,Deep Think 又被应用于科学与工程中更复杂、更开放式的挑战。

算法发现:正如 AlphaGo 在对局中搜索最佳落子一样,我们的代码智能体 AlphaEvolve 在计算机代码空间中探索,以发现更高效的算法。它也拥有自己的“第 37 手(Move 37)”时刻:找到了矩阵乘法的一种新方式——这是一种为几乎所有现代神经网络提供动力的基础数学运算。如今,AlphaEvolve 正在从数据中心优化到量子计算等一系列问题上接受测试。

科学协作:我们正将 AlphaGo 开创的搜索与推理原则整合进一位 AI 联合科学家 中。通过让多个智能体对科学观点与假设进行“辩论”,该系统成为一位协作者,能够执行严谨思考,以在数据中识别模式并解决复杂问题。在与 伦敦帝国理工学院 的验证研究中,它分析了数十年的文献,并独立得出了与研究人员花费多年提出、并通过实验验证的同一项关于抗微生物耐药性的假设。

我们也用 AI 来更好地理解基因组推进聚变能源研究改进天气预测 等等。

尽管我们的科学模型令人印象深刻,但它们高度专门化。要实现诸如创造无限清洁能源、或解决我们今天尚未理解的疾病这类根本性突破,我们需要能够发现不同学科之间底层结构与联系的通用 AI 系统,并像最优秀的科学家那样帮助我们提出新假设。

智能的未来

要让 AI 真正通用,它需要理解物理世界。我们从一开始就将 Gemini 构建为多模态,使其不仅能理解语言,也能理解音频、视频、图像与代码,从而建立对世界的模型。

为了跨越这些模态进行思考与推理,最新的 Gemini 模型使用了我们在 AlphaGo 与 AlphaZero 中开创的一些技术。

下一代 AI 系统还需要能够调用专用工具。例如,如果模型需要知道某种蛋白质的结构,它就可以使用 AlphaFold 来获得答案。

我们认为,将 Gemini 的世界模型、AlphaGo 的搜索与规划技术,以及专用 AI 工具的使用相结合,将被证明是实现 AGI 的关键。

真正的创造力,是这样的 AGI 系统必须展现的一项关键能力。“第 37 手(Move 37)”让我们瞥见了 AI 跳出框架思考的潜力,但真正的原创发明还需要更多。它不仅要像 AlphaGo 那样提出一种新颖的围棋策略,更要能够真正发明出一款同样深邃、优雅、也同样值得研究的游戏——正如围棋本身。

在 AlphaGo 的传奇胜利十年之后,我们的终极目标已近在眼前。最初在“第 37 手(Move 37)”中闪现的创意火花,催化出一系列如今正在汇聚的突破,为通往 AGI 的道路铺平了前路——并开启科学发现的新黄金时代。

了解 AlphaGo 在 YouTube 上观看 AlphaGo 纪录片

相关文章

AlphaGo

了解更多

与 AlphaGo 及中国顶尖棋手一起探索围棋之谜

2017 年 4 月 研究

了解更多

AlphaGo 的创新

2017 年 4 月 研究

了解更多

AlphaGo 的下一步

2017 年 5 月 研究

了解更多

关注我们

订阅,获取我们最新创新的动态

我接受 Google 的条款与条件,并确认我的信息将依据 Google 的隐私政策 使用。

AlphaGo at 10: How AI Innovation Is Paving the Path to AGI — Google DeepMind

Skip to main content

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Models

Research

Science

About

我们的使命:以负责任的方式构建 AI,造福人类

关于 Google DeepMind

责任

通过前瞻性安全防护确保 AI 安全,即使面对不断演化的威胁

了解更多

新闻

了解我们最新的 AI 突破、项目与动态

了解更多

招聘

我们正在寻找希望对世界产生真实、积极影响的人

了解更多

了解更多

Education National Partnerships for AI The Podcast

模型

研究

科学

关于

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Models

我们的使命:以负责任的方式构建 AI,造福人类

关于 Google DeepMind

责任

通过前瞻性安全防护确保 AI 安全,即使面对不断演化的威胁

了解更多

新闻

了解我们最新的 AI 突破、项目与动态

了解更多

招聘

我们正在寻找希望对世界产生真实、积极影响的人

了解更多

了解更多

Education National Partnerships for AI The Podcast

模型

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Build with Gemini Try Gemini

[Google DeepMind

      DeepMind](https://deepmind.google/)

Google AI Learn about all our AI Google DeepMind Explore the frontier of AI Google Labs Try our AI experiments Google Research Explore our research

Products and apps

Gemini app Chat with Gemini Google AI Studio Build with our next-gen AI models Google Antigravity Our agentic development platform

Models Research Science About

Build with Gemini Try Gemini

March 10, 2026 Research

我们的使命:以负责任的方式构建 AI,造福人类

关于 Google DeepMind

责任

通过前瞻性安全防护确保 AI 安全,即使面对不断演化的威胁

了解更多

新闻

了解我们最新的 AI 突破、项目与动态

了解更多

招聘

我们正在寻找希望对世界产生真实、积极影响的人

了解更多

了解更多

Education National Partnerships for AI The Podcast

使用 Gemini 构建 试用 Gemini

Google DeepMind DeepMind

Google AI:了解我们的全部 AI Google DeepMind:探索 AI 的前沿 Google Labs:试用我们的 AI 实验 Google Research:探索我们的研究

产品与应用

Gemini app:与 Gemini 聊天 Google AI Studio:用我们的下一代 AI 模型构建 Google Antigravity:我们的智能体开发平台

模型 研究 科学 关于

使用 Gemini 构建 试用 Gemini

2026 年 3 月 10 日 研究

From games to biology and beyond: 10 years of AlphaGo’s impact

Demis Hassabis

Share

      Copied

Your browser does not support the video tag.

Ten years ago, our AI system AlphaGo became the first program to defeat a world champion at the complex game of Go – reaching a milestone in the field a decade before many experts thought possible.

The achievement heralded the beginning of what is now recognized as the modern era in artificial intelligence (AI). With a single creative play, the famous ‘Move 37,’ AlphaGo demonstrated the potential of AI and signaled that we now had the techniques to begin tackling real-world scientific problems.

Today, this breakthrough continues to inform our work building systems on the path to artificial general intelligence (AGI). We believe AGI will be the most profound technology ever invented and potentially the ultimate tool to advance science, medicine, and productivity.

从游戏到生物学及更远:AlphaGo 影响的十年

Demis Hassabis

分享

已复制

你的浏览器不支持 video 标签。

十年前,我们的 AI 系统 AlphaGo 成为首个在复杂的围棋比赛中击败世界冠军的程序——这一里程碑比许多专家当时认为可能实现的时间整整提前了十年。

这一成就宣告了如今被认为是现代人工智能(AI)时代的开端。凭借一次富有创造力的落子——著名的“第 37 手(Move 37)”,AlphaGo 展示了 AI 的潜力,并发出信号:我们已经拥有了开始攻克真实世界科学问题所需的技术。

今天,这一突破仍在持续塑造我们构建通往人工通用智能(AGI)之路上各类系统的工作。我们相信,AGI 将是人类迄今发明过的最深远的技术,并可能成为推动科学、医学与生产力跃迁的终极工具。

A creative spark

In 2016, over 200 million people watched AlphaGo face world-champion Go player Lee Sae Dol in Seoul. The match was defined by ‘Move 37’ in Game 2, a play so unconventional that professional commentators initially thought it was a mistake. But it proved to be decisive. One hundred or so moves later, the stone was in exactly the right position for AlphaGo to win the game. It was a display of incredible foresight and the AI system’s ability to go beyond mimicking human experts and find entirely new strategies.

Your browser does not support the video tag.

Watch the AlphaGo documentary on YouTube

Go has long been a proving ground for AI research because of the game’s sheer complexity. There are 10170 possible positions on the board—far more than the number of atoms in the observable universe.

To make the game tractable, AlphaGo used deep neural networks combined with advanced search and reinforcement learning – an AI approach DeepMind pioneered.

AlphaGo learned a model of plausible Go moves by first learning from games played by human experts, and then playing hundreds of thousands of games against itself, improving as the strongest winning strategies were reinforced. The system then considered only the most potentially fruitful paths and from that smaller subset of moves, found the one most likely to lead it to win.

After AlphaGo, we built AlphaGo Zero, which learned the game from completely random play and became arguably the strongest player in history. Then we generalized the system further with AlphaZero, which taught itself from scratch to master any 2-player perfect information game, including Go, chess, and shogi. Beginning with no prior knowledge other than the rules of the game, AlphaZero was able to learn to master chess in a matter of hours, and beat not only the top human players but the best specialised chess programs at the time, like Stockfish. And even though chess had been so heavily analysed with the aid of these programs, just as with Go, AlphaZero was still able to come up with interesting new strategies.

It was further proof of what I knew the moment we won the match in Seoul - the technology was ready to be applied to our real goal of accelerating scientific breakthroughs.

I believe the greatest lesson AlphaGo offered was a definitive preview of the AI era—proving it wasn’t some distant, vague future, but a reality arriving on our doorstep. It served as a "roadmap from the future," sending a clear signal to humanity about how the world was about to change.

Go Master Lee Sae Dol

Adjunct Professor at UNIST

创意火花

2016 年,超过 2 亿人观看了 AlphaGo 在首尔对阵世界冠军围棋选手李世石。比赛的转折点来自第二局中的“第 37 手(Move 37)”,这一步法极其反常,以至于职业解说起初认为它是失误。但事实证明,它是决定性的。大约一百手之后,那枚棋子恰到好处地落在了能让 AlphaGo 赢下对局的位置。这一幕展示了惊人的前瞻性,也体现了该 AI 系统能够超越对人类专家的模仿,找到全新的策略。

你的浏览器不支持 video 标签。

在 YouTube 上观看 AlphaGo 纪录片

围棋长期以来都是 AI 研究的试金石,因为它的复杂度极高。棋盘上可能的局面多达 10170 种——远远超过可观测宇宙中的原子数量。

为了让这项任务变得可处理,AlphaGo 将深度神经网络与高级搜索和强化学习结合起来——这是一种由 DeepMind 开创 的 AI 方法。

AlphaGo 先从人类专家对局中学习,建立起对“合理落子”的模型;随后又与自己进行几十万盘对弈,在不断强化最强的取胜策略中持续提升。系统随后只考虑最可能产生收益的路径,并在这更小的候选落子集合中,找到最可能引导它取胜的一步。

在 AlphaGo 之后,我们构建了 AlphaGo Zero,它从完全随机的对弈开始学习,最终成为史上或许最强的围棋选手。接着,我们将系统进一步泛化为 AlphaZero,它从零开始自学,掌握任何“二人、完美信息”类游戏,包括围棋、国际象棋和将棋。除了游戏规则之外不具备任何先验知识的 AlphaZero,能够在数小时内学会精通国际象棋,并击败不仅是顶尖人类棋手,还有当时最优秀的专用国际象棋程序,例如 Stockfish。即便国际象棋早已在这些程序的辅助下被深入分析,AlphaZero 仍然像在围棋中一样,提出了耐人寻味的新策略

这进一步证明了我在首尔获胜那一刻就确信的一点——这项技术已经准备好被用于我们更真实的目标:加速科学突破。

我认为,AlphaGo 带来的最大启示,是对 AI 时代的一次确定性预演——它证明这并非遥远而模糊的未来,而是正来到我们门口的现实。它像一份“来自未来的路线图”,向人类发出了关于世界即将如何改变的清晰信号。

围棋大师 李世石

UNIST 兼职教授

Catalyzing breakthroughs in science

By proving it could navigate the massive search space of a Go board, AlphaGo demonstrated the potential for AI to help us better understand the vast complexities of the physical world. We started by attempting to solve the protein folding problem, a 50-year grand challenge of predicting the 3D structure of proteins - information that is crucial for understanding diseases and developing new drugs.

In 2020, we finally cracked this longstanding scientific problem with our AlphaFold 2 system. From there, we folded the structures for all 200 million proteins known to science and made them freely available to scientists in an open-source database. Today, over 3 million researchers around the world use the AlphaFold database to accelerate their important work on everything from malaria vaccines to plastic-eating enzymes. And in 2024, it was the honor of a lifetime for John Jumper and I to be awarded the Nobel Prize in Chemistry for leading this project, on behalf of the entire AlphaFold team.

Since AlphaGo’s win, we’ve applied its groundbreaking approach to many other areas of science and mathematics, including:

Mathematical reasoning: The most direct descendant of AlphaGo’s architecture, AlphaProof learned to prove formal mathematical statements using a combination of language models and AlphaZero’s reinforcement learning and search algorithms. Alongside AlphaGeometry 2, it became the first system to achieve a medal-standard (silver) at the International Mathematical Olympiad (IMO), proving AlphaGo's methods could unlock advanced mathematical reasoning and laying the foundation for our most capable general models.

Gemini, our largest and most capable model, recently went even further. An advanced version of its Deep Think mode achieved gold-medal level performance at the 2025 IMO using an approach inspired by AlphaGo. Since then, Deep Think has been applied to even more complex, open-ended challenges across science and engineering.

Algorithm discovery: Just as AlphaGo searched for the best move in a game, our coding agent AlphaEvolve explores the space of computer code to discover more efficient algorithms. It had its own Move 37 moment when it found a novel way to multiply matrices, a fundamental mathematical operation powering nearly all modern neural networks. AlphaEvolve is now being tested on problems ranging from data center optimization to quantum computing.

Scientific collaboration: We are integrating the search and reasoning principles pioneered with AlphaGo into an AI co-scientist. By having agents 'debate' scientific ideas and hypotheses, this system acts as a collaborator capable of performing the rigorous thinking necessary to identify patterns in data and solve sophisticated problems. In validation studies at Imperial College London, it analyzed decades of literature and independently arrived at the same hypothesis about antimicrobial resistance that researchers had spent years developing and validating experimentally.

We’ve also used AI to better understand the genome, advance fusion energy research, improve weather prediction and more.

As impressive as our scientific models are, they are highly specialized. To achieve fundamental breakthroughs like creating limitless clean energy or solving diseases that we don’t understand today, we need general AI systems that can find underlying structure and connections between different subject areas, and help us to come up with new hypotheses like the best scientists do.

催化科学突破

通过证明自己能够在围棋棋盘那巨大的搜索空间中导航,AlphaGo 展示了 AI 有潜力帮助我们更好地理解物理世界的浩瀚复杂性。我们首先尝试解决蛋白质折叠问题——这是一项持续了 50 年的重大挑战:预测蛋白质的三维结构。这些信息对于理解疾病以及开发新药至关重要。

2020 年,我们终于用 AlphaFold 2 系统攻克了这一长期悬而未决的科学难题。随后,我们为科学界已知的全部 2 亿种蛋白质预测并“折叠”出了结构,并将其以开源数据库的形式免费提供给科研人员。如今,全球超过 300 万研究者使用 AlphaFold 数据库 加速重要工作,涵盖从疟疾疫苗到“吃塑料”酶等诸多领域。2024 年,John Jumper 和我代表整个 AlphaFold 团队,因领导这一项目而获颁诺贝尔化学奖,这对我们而言是一生的荣耀。

自 AlphaGo 获胜以来,我们已将其开创性方法应用到科学与数学的许多其他领域,包括:

数学推理:作为 AlphaGo 架构最直接的后继,AlphaProof 结合语言模型与 AlphaZero 的强化学习和搜索算法,学习证明形式化数学命题。它与 AlphaGeometry 2 一起,成为首个在国际数学奥林匹克(IMO)上达到奖牌标准(银牌)的系统,证明 AlphaGo 的方法能够解锁高级数学推理,并为我们最强大的通用模型奠定基础。

我们规模最大、能力最强的模型 Gemini,最近更进一步。其 Deep Think 模式的高级版本达成了 2025 年 IMO 的金牌级表现,所采用的方法也受到了 AlphaGo 的启发。从那以后,Deep Think 又被应用于科学与工程中更复杂、更开放式的挑战。

算法发现:正如 AlphaGo 在对局中搜索最佳落子一样,我们的代码智能体 AlphaEvolve 在计算机代码空间中探索,以发现更高效的算法。它也拥有自己的“第 37 手(Move 37)”时刻:找到了矩阵乘法的一种新方式——这是一种为几乎所有现代神经网络提供动力的基础数学运算。如今,AlphaEvolve 正在从数据中心优化到量子计算等一系列问题上接受测试。

科学协作:我们正将 AlphaGo 开创的搜索与推理原则整合进一位 AI 联合科学家 中。通过让多个智能体对科学观点与假设进行“辩论”,该系统成为一位协作者,能够执行严谨思考,以在数据中识别模式并解决复杂问题。在与 伦敦帝国理工学院 的验证研究中,它分析了数十年的文献,并独立得出了与研究人员花费多年提出、并通过实验验证的同一项关于抗微生物耐药性的假设。

我们也用 AI 来更好地理解基因组推进聚变能源研究改进天气预测 等等。

尽管我们的科学模型令人印象深刻,但它们高度专门化。要实现诸如创造无限清洁能源、或解决我们今天尚未理解的疾病这类根本性突破,我们需要能够发现不同学科之间底层结构与联系的通用 AI 系统,并像最优秀的科学家那样帮助我们提出新假设。

Future of intelligence

For an AI to be truly general, it needs to understand the physical world. We built Gemini to be multimodal from the beginning so it could understand not just language, but also audio, video, images and code to build a model of the world.

To think and reason across these modalities, the latest Gemini models use some of the techniques we pioneered with AlphaGo and AlphaZero.

The next generation of AI systems will also need to be able to call upon specialized tools. For example, if a model needed to know the structure of a protein it could use AlphaFold for that.

We think the combination of Gemini’s world models, AlphaGo’s search and planning techniques, and specialized AI tool use will prove to be critical for AGI.

True creativity is a key capability that such an AGI system would need to exhibit. Move 37 was a glimpse of AI’s potential to think outside the box, but true original invention will require something more. It would need to not only come up with a novel Go strategy, as AlphaGo impressively did, but actually invent a game as deep and elegant, and as worthy of study as Go.

Ten years after AlphaGo’s legendary victory, our ultimate goal is on the horizon. The creative spark first seen in Move 37 catalyzed breakthroughs that are now converging to pave the path towards AGI - and usher in a new golden age of scientific discovery.

Learn about AlphaGo Watch the AlphaGo documentary on YouTube

Related Posts

AlphaGo

Learn more

Exploring the mysteries of Go with AlphaGo and Chinas top players

April 2017 Research

Learn more

Innovations of AlphaGo

April 2017 Research

Learn more

AlphaGos next move

May 2017 Research

Learn more

Follow us

Sign up for updates on our latest innovations

I accept Google's Terms and Conditions and acknowledge that my information will be used in accordance with Google's Privacy Policy.

Sign up

Build AI responsibly to benefit humanity

Models

Gemini Nano Banana Gemini Audio Gemma Genie Lyria Veo

Research

Evals Breakthroughs Publications Responsibility

Science

AlphaFold AlphaGenome WeatherNext AlphaEarth

Products

Gemini app Google AI Studio Google Antigravity

Learn more

About News Careers National Partnerships for AI The Podcast

About Google

Google products

Privacy

Terms

Cookies management controls

智能的未来

要让 AI 真正通用,它需要理解物理世界。我们从一开始就将 Gemini 构建为多模态,使其不仅能理解语言,也能理解音频、视频、图像与代码,从而建立对世界的模型。

为了跨越这些模态进行思考与推理,最新的 Gemini 模型使用了我们在 AlphaGo 与 AlphaZero 中开创的一些技术。

下一代 AI 系统还需要能够调用专用工具。例如,如果模型需要知道某种蛋白质的结构,它就可以使用 AlphaFold 来获得答案。

我们认为,将 Gemini 的世界模型、AlphaGo 的搜索与规划技术,以及专用 AI 工具的使用相结合,将被证明是实现 AGI 的关键。

真正的创造力,是这样的 AGI 系统必须展现的一项关键能力。“第 37 手(Move 37)”让我们瞥见了 AI 跳出框架思考的潜力,但真正的原创发明还需要更多。它不仅要像 AlphaGo 那样提出一种新颖的围棋策略,更要能够真正发明出一款同样深邃、优雅、也同样值得研究的游戏——正如围棋本身。

在 AlphaGo 的传奇胜利十年之后,我们的终极目标已近在眼前。最初在“第 37 手(Move 37)”中闪现的创意火花,催化出一系列如今正在汇聚的突破,为通往 AGI 的道路铺平了前路——并开启科学发现的新黄金时代。

了解 AlphaGo 在 YouTube 上观看 AlphaGo 纪录片

相关文章

AlphaGo

了解更多

与 AlphaGo 及中国顶尖棋手一起探索围棋之谜

2017 年 4 月 研究

了解更多

AlphaGo 的创新

2017 年 4 月 研究

了解更多

AlphaGo 的下一步

2017 年 5 月 研究

了解更多

关注我们

订阅,获取我们最新创新的动态

我接受 Google 的条款与条件,并确认我的信息将依据 Google 的隐私政策 使用。

订阅

以负责任的方式构建 AI,造福人类

模型

Gemini Nano Banana Gemini Audio Gemma Genie Lyria Veo

研究

Evals Breakthroughs Publications Responsibility

科学

AlphaFold AlphaGenome WeatherNext AlphaEarth

产品

Gemini app Google AI Studio Google Antigravity

了解更多

关于 新闻 招聘 National Partnerships for AI The Podcast

关于 Google

Google 产品

隐私

条款

Cookie 管理控件

相关笔记

Learn, build, and plan anything](https://deepmind.google/models/gemini/) Nano Banana Create and edit detailed images Gemini Audio Talk, create and control audio

Specialized models

Veo Generate cinematic video with audio Imagen Generate high-quality images from text Lyria Generate high fidelity music and audio SynthID Watermark and identify AI content

World models embodied AI

Genie 3 Generate and explore interactive worlds Gemini Robotics Perceive, reason, use tools and interact

Open models

Gemma Build responsible AI applications at scale

Our latest AI breakthroughs and updates from the lab

Explore research

Breakthroughs

SIMA 2 An agent that plays, reasons, and learns with you Genie 3 Generate and explore interactive worlds AlphaGo Mastering the game of Go Gemini Robotics Perceive, reason, use tools and interact

Learn more

Evals Publications Responsibility

Unlocking a new era of discovery with AI

Explore science

Life sciences

AlphaFold Predict protein structures with high accuracy AlphaGenome Decode genetics to pinpoint diseases AlphaMissense Find root causes of rare genetic diseases

Climate and sustainability

AlphaEarth Foundations Map our planet in unprecedented detail WeatherNext Fast and accurate AI weather forecasting Weather Lab Test our experimental weather models

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Models

Research

Science

About

Explore our next generation AI systems

Explore models

Gemini

Gemini Learn, build, and plan anything Nano Banana Create and edit detailed images Gemini Audio Talk, create and control audio

Specialized models

Veo Generate cinematic video with audio Imagen Generate high-quality images from text Lyria Generate high fidelity music and audio SynthID Watermark and identify AI content

World models embodied AI

Genie 3 Generate and explore interactive worlds Gemini Robotics Perceive, reason, use tools and interact

Open models

Gemma Build responsible AI applications at scale

Our latest AI breakthroughs and updates from the lab

Explore research

Breakthroughs

SIMA 2 An agent that plays, reasons, and learns with you Genie 3 Generate and explore interactive worlds AlphaGo Mastering the game of Go Gemini Robotics Perceive, reason, use tools and interact

Learn more

Evals Publications Responsibility

Unlocking a new era of discovery with AI

Explore science

Life sciences

AlphaFold Predict protein structures with high accuracy AlphaGenome Decode genetics to pinpoint diseases AlphaMissense Find root causes of rare genetic diseases

Climate and sustainability

AlphaEarth Foundations Map our planet in unprecedented detail WeatherNext Fast and accurate AI weather forecasting Weather Lab Test our experimental weather models

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Models

Explore our next generation AI systems

Explore models

Gemini

Gemini Learn, build, and plan anything Nano Banana Create and edit detailed images Gemini Audio Talk, create and control audio

Specialized models

Veo Generate cinematic video with audio Imagen Generate high-quality images from text Lyria Generate high fidelity music and audio SynthID Watermark and identify AI content

World models embodied AI

Genie 3 Generate and explore interactive worlds Gemini Robotics Perceive, reason, use tools and interact

Open models

Gemma Build responsible AI applications at scale

Research

Our latest AI breakthroughs and updates from the lab

Explore research

Breakthroughs

SIMA 2 An agent that plays, reasons, and learns with you Genie 3 Generate and explore interactive worlds AlphaGo Mastering the game of Go Gemini Robotics Perceive, reason, use tools and interact

Learn more

Evals Publications Responsibility

Science

Unlocking a new era of discovery with AI

Explore science

Life sciences

AlphaFold Predict protein structures with high accuracy AlphaGenome Decode genetics to pinpoint diseases AlphaMissense Find root causes of rare genetic diseases

Climate and sustainability

AlphaEarth Foundations Map our planet in unprecedented detail WeatherNext Fast and accurate AI weather forecasting Weather Lab Test our experimental weather models

About

Our mission is to build AI responsibly to benefit humanity

About Google DeepMind

Responsibility

Ensuring AI safety through proactive security, even against evolving threats

Learn more

News

Discover our latest AI breakthroughs, projects, and updates

Learn more

Careers

We’re looking for people who want to make a real, positive impact on the world

Learn more

Learn more

Education National Partnerships for AI The Podcast

Build with Gemini Try Gemini

[Google DeepMind

      DeepMind](https://deepmind.google/)

Google AI Learn about all our AI Google DeepMind Explore the frontier of AI Google Labs Try our AI experiments Google Research Explore our research

Products and apps

Gemini app Chat with Gemini Google AI Studio Build with our next-gen AI models Google Antigravity Our agentic development platform

Models Research Science About

Build with Gemini Try Gemini

March 10, 2026 Research

From games to biology and beyond: 10 years of AlphaGo’s impact

Demis Hassabis

Share

      Copied

Your browser does not support the video tag.

Ten years ago, our AI system AlphaGo became the first program to defeat a world champion at the complex game of Go – reaching a milestone in the field a decade before many experts thought possible.

The achievement heralded the beginning of what is now recognized as the modern era in artificial intelligence (AI). With a single creative play, the famous ‘Move 37,’ AlphaGo demonstrated the potential of AI and signaled that we now had the techniques to begin tackling real-world scientific problems.

Today, this breakthrough continues to inform our work building systems on the path to artificial general intelligence (AGI). We believe AGI will be the most profound technology ever invented and potentially the ultimate tool to advance science, medicine, and productivity.

A creative spark

In 2016, over 200 million people watched AlphaGo face world-champion Go player Lee Sae Dol in Seoul. The match was defined by ‘Move 37’ in Game 2, a play so unconventional that professional commentators initially thought it was a mistake. But it proved to be decisive. One hundred or so moves later, the stone was in exactly the right position for AlphaGo to win the game. It was a display of incredible foresight and the AI system’s ability to go beyond mimicking human experts and find entirely new strategies.

Your browser does not support the video tag.

Watch the AlphaGo documentary on YouTube

Go has long been a proving ground for AI research because of the game’s sheer complexity. There are 10170 possible positions on the board—far more than the number of atoms in the observable universe.

To make the game tractable, AlphaGo used deep neural networks combined with advanced search and reinforcement learning – an AI approach DeepMind pioneered.

AlphaGo learned a model of plausible Go moves by first learning from games played by human experts, and then playing hundreds of thousands of games against itself, improving as the strongest winning strategies were reinforced. The system then considered only the most potentially fruitful paths and from that smaller subset of moves, found the one most likely to lead it to win.

After AlphaGo, we built AlphaGo Zero, which learned the game from completely random play and became arguably the strongest player in history. Then we generalized the system further with AlphaZero, which taught itself from scratch to master any 2-player perfect information game, including Go, chess, and shogi. Beginning with no prior knowledge other than the rules of the game, AlphaZero was able to learn to master chess in a matter of hours, and beat not only the top human players but the best specialised chess programs at the time, like Stockfish. And even though chess had been so heavily analysed with the aid of these programs, just as with Go, AlphaZero was still able to come up with interesting new strategies.

It was further proof of what I knew the moment we won the match in Seoul - the technology was ready to be applied to our real goal of accelerating scientific breakthroughs.

I believe the greatest lesson AlphaGo offered was a definitive preview of the AI era—proving it wasn’t some distant, vague future, but a reality arriving on our doorstep. It served as a "roadmap from the future," sending a clear signal to humanity about how the world was about to change.

Go Master Lee Sae Dol

Adjunct Professor at UNIST

Catalyzing breakthroughs in science

By proving it could navigate the massive search space of a Go board, AlphaGo demonstrated the potential for AI to help us better understand the vast complexities of the physical world. We started by attempting to solve the protein folding problem, a 50-year grand challenge of predicting the 3D structure of proteins - information that is crucial for understanding diseases and developing new drugs.

In 2020, we finally cracked this longstanding scientific problem with our AlphaFold 2 system. From there, we folded the structures for all 200 million proteins known to science and made them freely available to scientists in an open-source database. Today, over 3 million researchers around the world use the AlphaFold database to accelerate their important work on everything from malaria vaccines to plastic-eating enzymes. And in 2024, it was the honor of a lifetime for John Jumper and I to be awarded the Nobel Prize in Chemistry for leading this project, on behalf of the entire AlphaFold team.

Since AlphaGo’s win, we’ve applied its groundbreaking approach to many other areas of science and mathematics, including:

Mathematical reasoning: The most direct descendant of AlphaGo’s architecture, AlphaProof learned to prove formal mathematical statements using a combination of language models and AlphaZero’s reinforcement learning and search algorithms. Alongside AlphaGeometry 2, it became the first system to achieve a medal-standard (silver) at the International Mathematical Olympiad (IMO), proving AlphaGo's methods could unlock advanced mathematical reasoning and laying the foundation for our most capable general models.

Gemini, our largest and most capable model, recently went even further. An advanced version of its Deep Think mode achieved gold-medal level performance at the 2025 IMO using an approach inspired by AlphaGo. Since then, Deep Think has been applied to even more complex, open-ended challenges across science and engineering.

Algorithm discovery: Just as AlphaGo searched for the best move in a game, our coding agent AlphaEvolve explores the space of computer code to discover more efficient algorithms. It had its own Move 37 moment when it found a novel way to multiply matrices, a fundamental mathematical operation powering nearly all modern neural networks. AlphaEvolve is now being tested on problems ranging from data center optimization to quantum computing.

Scientific collaboration: We are integrating the search and reasoning principles pioneered with AlphaGo into an AI co-scientist. By having agents 'debate' scientific ideas and hypotheses, this system acts as a collaborator capable of performing the rigorous thinking necessary to identify patterns in data and solve sophisticated problems. In validation studies at Imperial College London, it analyzed decades of literature and independently arrived at the same hypothesis about antimicrobial resistance that researchers had spent years developing and validating experimentally.

We’ve also used AI to better understand the genome, advance fusion energy research, improve weather prediction and more.

As impressive as our scientific models are, they are highly specialized. To achieve fundamental breakthroughs like creating limitless clean energy or solving diseases that we don’t understand today, we need general AI systems that can find underlying structure and connections between different subject areas, and help us to come up with new hypotheses like the best scientists do.

Future of intelligence

For an AI to be truly general, it needs to understand the physical world. We built Gemini to be multimodal from the beginning so it could understand not just language, but also audio, video, images and code to build a model of the world.

To think and reason across these modalities, the latest Gemini models use some of the techniques we pioneered with AlphaGo and AlphaZero.

The next generation of AI systems will also need to be able to call upon specialized tools. For example, if a model needed to know the structure of a protein it could use AlphaFold for that.

We think the combination of Gemini’s world models, AlphaGo’s search and planning techniques, and specialized AI tool use will prove to be critical for AGI.

True creativity is a key capability that such an AGI system would need to exhibit. Move 37 was a glimpse of AI’s potential to think outside the box, but true original invention will require something more. It would need to not only come up with a novel Go strategy, as AlphaGo impressively did, but actually invent a game as deep and elegant, and as worthy of study as Go.

Ten years after AlphaGo’s legendary victory, our ultimate goal is on the horizon. The creative spark first seen in Move 37 catalyzed breakthroughs that are now converging to pave the path towards AGI - and usher in a new golden age of scientific discovery.

Learn about AlphaGo Watch the AlphaGo documentary on YouTube

Related Posts

AlphaGo

Learn more

Exploring the mysteries of Go with AlphaGo and Chinas top players

April 2017 Research

Learn more

Innovations of AlphaGo

April 2017 Research

Learn more

AlphaGos next move

May 2017 Research

Learn more

Follow us

Sign up for updates on our latest innovations

I accept Google's Terms and Conditions and acknowledge that my information will be used in accordance with Google's Privacy Policy.

📋 讨论归档

讨论进行中…