返回列表
🧠 阿头学 · 💬 讨论题 · 💰投资

自带 Agent:AI 应用的新架构之争

这篇文章判断 BYOA(自带 Agent)是未来主流架构之一,但在商业动力、安全合规和用户习惯上明显乐观偏头重脚,短期更像高端玩家与新产品的机会,而非全行业必然终局。
打开原文 ↗

2026-02-03 原文链接 ↗
阅读简报
双语对照
完整翻译
原文
讨论归档

核心观点

  • 智能应“跟着人走”,而不是“住在 App 里” 作者认为当前“每个 App 各造一个小 AI 助手”的模式导致上下文碎片化、体验低效,真正高价值的是一个长期了解你的个人 Agent,跨应用携带你的记忆与偏好,这一判断在体验层是成立的。
  • BYOA = App 提供环境和接口,智能由用户携带 Polylogue 作为案例,通过 MCP/Skills 等标准,让任意兼容 Agent 进入工作区读写文档、协作创作;App 专注场景和数据结构,Agent 负责智能,这种“环境 vs 智能”的职责划分结构清晰、技术路径可行。
  • 对 App 和生态的好处被强调,但商业阻力被低估 作者认为 App 不必自建 AI、只需当“好东道主”,还能吸引高端用户、获得增长;但这忽略了主流 SaaS 通过内建 AI 强化留存、做高毛利付费层的强烈动力,BYOA 与“自有 AI 护城河”在战略上存在根本冲突。
  • BYOA 目前只适合高级用户,但作者预言会在 2026 年底前普及 文中承认现在连接个人 Agent 仍需配置和理解技术细节,却断言 BYOA 很快会变成普遍模式;在缺乏 adoption 数据、用户行为证据情况下,这个时间表属典型乐观外推。
  • 放弃控制 AI 层“反而更值钱”的说法是反直觉但有条件成立 在模型高度同质化、用户已有强个人 Agent 的细分生态里,这个逻辑是成立的;但在大众 SaaS 市场和企业采购场景中,绝大多数玩家仍会优先控制智能入口和数据闭环,这点文章明显选择性忽略。

跟我们的关联

1. 对 ATou:产品与架构判断标尺 这篇文章为判断“要不要在工具里硬塞 Copilot”提供了一个清晰对立面:你可以刻意把产品设计成“Agent 的最佳宿主”,而不是“自带一个平庸助手的封闭系统”。下一步可以:

  • 盘点现有/规划中的产品:哪些更适合做 BYOA 宿主(强结构数据、工作流),哪些不适合(只是通用 AI 功能壳);
  • 优先为宿主型产品设计对 Agent 友好的 API、权限模型和“技能说明书”(类似 SKILL.md)。

2. 对 Neta:个人 Agent 与工作流布局信号 文中把“统一个人 Agent + BYOA 能力的 App”定义为下一阶段高端用户的主战场,这对你意味着:

  • 个人 Agent 路线不是“锦上添花”,而是未来一部分用户的“操作系统”;
  • 下一步可以在技术路线图里更明确地规划:支持 MCP/Skills、为主流 SaaS 设计可复用的接入适配层,以便在 BYOA 生态中占据标准或中枢位置。

3. 对 Uota:工作与协作方式的前置思考 BYOA 模式在团队里,等于“每个人带着自己的 Agent 来开工”,而不是整个团队只用一个平台的内建 AI。对你的启示是:

  • 未来项目协作可能是“人 + 人的 Agent + 团队通用 Agent”混合运作;
  • 下一步可以在内部试验:用一个统一 Agent 接业务系统(文档、任务、日历),验证体验是否真的优于在每个工具用独立 AI 插件。

4. 对 ATou/Neta:投资与战略扫描视角 文章实际上在给出一个筛选框架:

  • 看一家公司是不是在做“AI 功能”还是在做“Agent 宿主 / Agent 基础设施”;
  • 下一步可以把“是否对 Agent 友好(API、权限、MCP/Skills、第三方 Agent 接入案例)”作为评估新项目/标的的一项硬指标,同时警惕只会喊 BYOA 口号、但既无开放接口也无真实集成的伪概念公司。

讨论引子

1. 对多数用户和企业来说,“统一个人 Agent + BYOA”真的比“各 App 内建一个专用 AI”更现实、更安全吗?在哪些场景中答案会完全相反? 2. 站在一个 SaaS 创业者或大厂产品负责人的角度,如果允许用户自带 Agent,你到底是获得了更多用户价值,还是把自己的 AI 变现空间让给了模型厂商? 3. 如果我们相信“智能应该跟着用户走”,那在组织里应该优先统一的是“工具栈”(大家用同一个系统),还是“Agent 栈”(大家各有 Agent,但互相可协作)?

每个 App 都在造自己的 AI。这里一个助手,那里一个 Copilot,每个 SaaS 产品的角落里都塞了个聊天机器人。

一年前还觉得挺酷,现在觉得烦了。跟我们现在知道的 OpenClaw 能做到的相比,它们都差得远。

你打开 Notion,Notion 的 AI 帮你写。打开邮件客户端,它的 AI 帮你起草回复。每个都有自己的模型、自己的上下文窗口、自己对你的理解——准确地说,几乎完全不理解你。

与此同时,你可能已经有了一个真正了解你的 AI。也许是积累了几个月对话历史的 Claude。也许是像 Clawdbot 这样有权访问你的文件、日历、记忆的个人 Agent。那个真正拥有你的上下文的东西待在一个地方,而五十个不了解你的小聊天机器人散布在你用的每个 App 里。

架构搞反了。智能应该跟着用户走,而不是住在 App 里面。

有一种更好的方式:自带 Agent(Bring Your Own Agent,BYOA)

不再是 App 说「来,这是我们的 AI 助手」,而是 App 说「带上你信任的任何 Agent,我们给它接口来操作你在这里的东西」。

BYOA 模式是以应用为中心的:App 是你去的环境,但你带着自己的智能。

实践中的样子

Polylogue 是一个可运转的案例。它是一个协作写作与思考工具,用户可以把自己的 AI Agent 接入来处理文档。

连接 Agent 时,用户收到一份指令(格式化为 SKILL.md)——可以直接交给自己的 Agent,附带一个 API Key 让 Agent 直连工作区。

Agent 可以阅读文档、回复评论、起草内容、协作——不是因为 Polylogue 自己造了 AI,而是它把环境开放给了用户信任的任何 Agent。

这种集成也越来越容易构建。MCP 和 Skills 正在标准化 Agent 如何发现和交互应用级工具——把以前需要定制 API 对接的工作变成接近即插即用。一个暴露了 MCP 服务端或像 Polylogue 一样提供 Skill 文档的 App,可以让任何兼容的 Agent 立即接入。无需定制集成。

App 专注做好文档管理和协作。智能层是用户带来的。如果你有一个积累了数月上下文的个人 Agent——了解你的写作风格、你的项目、你的偏好——这些上下文会自动出现在 Polylogue 里,因为是你的 Agent 在干活。

这跟那些试图把你锁在内建 AI 里的竞品是根本不同的设计哲学。你不需要用你的笔记去训练 Polylogue 的 AI,也不需要把所有东西都交给它。接入你现有的 Clawdbot(或其他 Agent),你立刻就能用上所有那些上下文。

为什么这条路会赢

很容易看出为什么这应该成为主流模式。

对用户来说:一个真正了解你的 Agent,跨所有 App 运作。不用再反复解释自己。不用再面对五十个从零开始的聊天机器人。你的 Agent 携带你的上下文、偏好、历史——到处都是。

对 App 来说:你不需要自己造 AI 功能、不需要跟上模型军备竞赛。你只需要暴露正确的接口、做一个好东道主。专注你真正擅长的——领域专属体验——让用户的 Agent 处理智能层。

对生态来说:最早拥抱 BYOA 的 App 会吸引那些已经有成熟 Agent 的用户。而那些往往是高级用户、早期采用者——驱动增长的人。

前路

今天,BYOA 还是高级用户的玩法。你得有一个配置好的个人 Agent,懂得怎么连接,愿意做一些配置。但管道在快速变简单。MCP 在标准化工具接口。App 开始暴露为 Agent 交互而非仅为人类交互设计的 API。

BYOA App 现在可能只吸引最前沿的早期采用者,但以 AI 领域的推进速度,说它在 2026 年底前变得普遍并不夸张。

拥抱 BYOA 的 App 和团队会发现一个反直觉的优势:放弃对 AI 层的控制,反而让你变得更有价值,而不是更没价值。

它们成为你最好的 AI 真正运作的环境,进而成为更多工作发生的地方。

  • 原文:https://x.com/nateliason/status/2018352113860927648
  • 作者:Nat Eliason
  • 发布:2026-02-02
  • 翻译:Opus

Every app is building its own AI now. An assistant here, a copilot there, a chatbot in the corner of every SaaS product you use.


It was cool a year ago, but now it's annoying. They all kinda suck compared to what we now know is possible thanks to OpenClaw.

You open Notion, and Notion's AI helps you write. You open your email client, and its AI drafts replies. Each one has its own model, its own context window, its own understanding of who you are — which is to say, almost none.

每个 App 都在造自己的 AI。这里一个助手,那里一个 Copilot,每个 SaaS 产品的角落里都塞了个聊天机器人。

Meanwhile, you probably already have an AI that does know you. Maybe it's Claude with months of conversation history. Maybe it's a personal agent like Clawdbot that has access to your files, your calendar, your memory. The thing that actually has your context is sitting in one place while fifty little chatbots that don't have it are scattered across every app you touch.

一年前还觉得挺酷,现在觉得烦了。跟我们现在知道的 OpenClaw 能做到的相比,它们都差得远。

The architecture is backwards. The intelligence should follow the user, not live inside the app.

你打开 Notion,Notion 的 AI 帮你写。打开邮件客户端,它的 AI 帮你起草回复。每个都有自己的模型、自己的上下文窗口、自己对你的理解——准确地说,几乎完全不理解你。

There's a better way: Bring Your Own Agent.

与此同时,你可能已经有了一个真正了解你的 AI。也许是积累了几个月对话历史的 Claude。也许是像 Clawdbot 这样有权访问你的文件、日历、记忆的个人 Agent。那个真正拥有你的上下文的东西待在一个地方,而五十个不了解你的小聊天机器人散布在你用的每个 App 里。

Instead of the app saying "here's our AI assistant," apps should say "bring whatever agent you trust, and we'll give it access to work with your stuff here."

架构搞反了。智能应该跟着用户走,而不是住在 App 里面。

The BYOA pattern is app-centric: the app is the environment you go to, but you bring your own intelligence with you.

有一种更好的方式:自带 Agent(Bring Your Own Agent,BYOA)

What This Looks Like in Practice

不再是 App 说「来,这是我们的 AI 助手」,而是 App 说「带上你信任的任何 Agent,我们给它接口来操作你在这里的东西」。

Polylogue is a working example. It's a collaborative writing and thinking tool where users can bring their own AI agent to interface with their documents.

BYOA 模式是以应用为中心的:App 是你去的环境,但你带着自己的智能。

When connecting an agent, the user receives instructions (formatted as a SKILL.md) they can give to their agent along with an API key so the agent can connect directly to their workspace.

实践中的样子

The agent can read documents, respond to comments, draft content, and collaborate — not because Polylogue built its own AI, but because it opens its environment to whatever agent the user trusts.

Polylogue 是一个可运转的案例。它是一个协作写作与思考工具,用户可以把自己的 AI Agent 接入来处理文档。

This kind of integration is also getting easier to build. MCPs and Skills are formalizing how agents discover and interact with app-level tools — turning what used to require custom API work into something closer to plug-and-play. An app that exposes an MCP server, or provides Skill documentation like Polylogue, gives any compatible agent immediate access. No bespoke integration required.

连接 Agent 时,用户收到一份指令(格式化为 SKILL.md)——可以直接交给自己的 Agent,附带一个 API Key 让 Agent 直连工作区。

The app focuses on being great at document management and collaboration. The intelligence layer is whatever the user brings. If you have a personal agent with months of context about your writing style, your projects, your preferences — that context shows up in Polylogue automatically, because it's your agent doing the work.

Agent 可以阅读文档、回复评论、起草内容、协作——不是因为 Polylogue 自己造了 AI,而是它把环境开放给了用户信任的任何 Agent。

This is a fundamentally different design philosophy from competitors who are trying to lock you into their built-in AI. You don't need to train Polylogue's AI on your writing preferences or give it all your notes. By plugging in your existing Clawdbot (or other agent) you immediately have access to all of that.

这种集成也越来越容易构建。MCP 和 Skills 正在标准化 Agent 如何发现和交互应用级工具——把以前需要定制 API 对接的工作变成接近即插即用。一个暴露了 MCP 服务端或像 Polylogue 一样提供 Skill 文档的 App,可以让任何兼容的 Agent 立即接入。无需定制集成。

Why This Wins

App 专注做好文档管理和协作。智能层是用户带来的。如果你有一个积累了数月上下文的个人 Agent——了解你的写作风格、你的项目、你的偏好——这些上下文会自动出现在 Polylogue 里,因为是你的 Agent 在干活。

It's easy to see why this should become the dominant pattern moving forward.

这跟那些试图把你锁在内建 AI 里的竞品是根本不同的设计哲学。你不需要用你的笔记去训练 Polylogue 的 AI,也不需要把所有东西都交给它。接入你现有的 Clawdbot(或其他 Agent),你立刻就能用上所有那些上下文。

For users: One agent that actually knows you, operating across every app you use. No more re-explaining yourself. No more fifty chatbots that each start from scratch. Your agent carries your context, your preferences, your history — everywhere.

为什么这条路会赢

For apps: You don't need to build and maintain your own AI features. You don't need to keep up with the model race. You just need to expose the right interfaces and be a good host. Focus on what you're actually good at — the domain-specific experience — and let the user's agent handle the intelligence layer.

很容易看出为什么这应该成为主流模式。

For the ecosystem: The apps that embrace this early will attract the users who already have sophisticated agents. And those tend to be the power users, the early adopters, the people who drive adoption.

对用户来说:一个真正了解你的 Agent,跨所有 App 运作。不用再反复解释自己。不用再面对五十个从零开始的聊天机器人。你的 Agent 携带你的上下文、偏好、历史——到处都是。

The Road Ahead

对 App 来说:你不需要自己造 AI 功能、不需要跟上模型军备竞赛。你只需要暴露正确的接口、做一个好东道主。专注你真正擅长的——领域专属体验——让用户的 Agent 处理智能层。

Today, BYOA is a power user pattern. You need to have a personal agent set up, understand how to connect it, be willing to do some configuration. But the plumbing is getting simpler fast. MCP is standardizing tool interfaces. Apps are starting to expose APIs designed for agent interaction rather than just human interaction.

对生态来说:最早拥抱 BYOA 的 App 会吸引那些已经有成熟 Agent 的用户。而那些往往是高级用户、早期采用者——驱动增长的人。

BYOA apps may only appeal to bleeding edge early adopters now, but with how fast the AI space is moving, it's not unreasonable to think it could become commonplace by the end of 2026.

前路

Apps and teams that embrace it will find themselves with a counterintuitive advantage: by giving up control of the AI layer, they become more valuable, not less.

今天,BYOA 还是高级用户的玩法。你得有一个配置好的个人 Agent,懂得怎么连接,愿意做一些配置。但管道在快速变简单。MCP 在标准化工具接口。App 开始暴露为 Agent 交互而非仅为人类交互设计的 API。

They become the environment where your best AI actually works, and by extension, where more work happens.

BYOA App 现在可能只吸引最前沿的早期采用者,但以 AI 领域的推进速度,说它在 2026 年底前变得普遍并不夸张。

拥抱 BYOA 的 App 和团队会发现一个反直觉的优势:放弃对 AI 层的控制,反而让你变得更有价值,而不是更没价值。

Link: http://x.com/i/article/2018347415263117312

它们成为你最好的 AI 真正运作的环境,进而成为更多工作发生的地方。

Every app is building its own AI now. An assistant here, a copilot there, a chatbot in the corner of every SaaS product you use.

It was cool a year ago, but now it's annoying. They all kinda suck compared to what we now know is possible thanks to OpenClaw.

You open Notion, and Notion's AI helps you write. You open your email client, and its AI drafts replies. Each one has its own model, its own context window, its own understanding of who you are — which is to say, almost none.

Meanwhile, you probably already have an AI that does know you. Maybe it's Claude with months of conversation history. Maybe it's a personal agent like Clawdbot that has access to your files, your calendar, your memory. The thing that actually has your context is sitting in one place while fifty little chatbots that don't have it are scattered across every app you touch.

The architecture is backwards. The intelligence should follow the user, not live inside the app.

There's a better way: Bring Your Own Agent.

Instead of the app saying "here's our AI assistant," apps should say "bring whatever agent you trust, and we'll give it access to work with your stuff here."

The BYOA pattern is app-centric: the app is the environment you go to, but you bring your own intelligence with you.

What This Looks Like in Practice

Polylogue is a working example. It's a collaborative writing and thinking tool where users can bring their own AI agent to interface with their documents.

When connecting an agent, the user receives instructions (formatted as a SKILL.md) they can give to their agent along with an API key so the agent can connect directly to their workspace.

The agent can read documents, respond to comments, draft content, and collaborate — not because Polylogue built its own AI, but because it opens its environment to whatever agent the user trusts.

This kind of integration is also getting easier to build. MCPs and Skills are formalizing how agents discover and interact with app-level tools — turning what used to require custom API work into something closer to plug-and-play. An app that exposes an MCP server, or provides Skill documentation like Polylogue, gives any compatible agent immediate access. No bespoke integration required.

The app focuses on being great at document management and collaboration. The intelligence layer is whatever the user brings. If you have a personal agent with months of context about your writing style, your projects, your preferences — that context shows up in Polylogue automatically, because it's your agent doing the work.

This is a fundamentally different design philosophy from competitors who are trying to lock you into their built-in AI. You don't need to train Polylogue's AI on your writing preferences or give it all your notes. By plugging in your existing Clawdbot (or other agent) you immediately have access to all of that.

Why This Wins

It's easy to see why this should become the dominant pattern moving forward.

For users: One agent that actually knows you, operating across every app you use. No more re-explaining yourself. No more fifty chatbots that each start from scratch. Your agent carries your context, your preferences, your history — everywhere.

For apps: You don't need to build and maintain your own AI features. You don't need to keep up with the model race. You just need to expose the right interfaces and be a good host. Focus on what you're actually good at — the domain-specific experience — and let the user's agent handle the intelligence layer.

For the ecosystem: The apps that embrace this early will attract the users who already have sophisticated agents. And those tend to be the power users, the early adopters, the people who drive adoption.

The Road Ahead

Today, BYOA is a power user pattern. You need to have a personal agent set up, understand how to connect it, be willing to do some configuration. But the plumbing is getting simpler fast. MCP is standardizing tool interfaces. Apps are starting to expose APIs designed for agent interaction rather than just human interaction.

BYOA apps may only appeal to bleeding edge early adopters now, but with how fast the AI space is moving, it's not unreasonable to think it could become commonplace by the end of 2026.

Apps and teams that embrace it will find themselves with a counterintuitive advantage: by giving up control of the AI layer, they become more valuable, not less.

They become the environment where your best AI actually works, and by extension, where more work happens.

Link: http://x.com/i/article/2018347415263117312

📋 讨论归档

讨论进行中…