🧠 阿头学 · 💬 讨论题

前置部署工程师不是速成职业，而是 AI 落地期的关键杂交岗位

这篇文章对 FDE 的岗位价值判断基本成立，但“30 天入门”明显带有招聘营销色彩，真正有价值的是它提出的 AI 落地方法论，而不是速成承诺。
打开原文 ↗

2026-05-22 原文链接 ↗

阅读简报

双语对照

完整翻译

原文

讨论归档

核心观点

FDE 的价值真实存在 文章把 FDE 定义为连接业务流程、工程实现和高层 ROI 叙事的人，这个判断是对的，因为企业买的不是模型本身，而是可验证的效率提升与风险可控的部署结果。
最强能力不是做 agent，而是判断哪里不该用 AI 文中关于 agent / 传统代码 / 人工的三分法很有操作性：规则清晰且输入多变再用 agent，规则和输入都稳定就直接写代码，依赖隐性经验就保留人工，这比“万物 agent 化”靠谱得多。
AI 落地的主线不是 demo，而是 Audit → Evals → Deployment 这套三段式方法抓住了企业部署的核心顺序：先识别高 ROI 流程，再建立可信评估，最后渐进上线；这比先做炫技原型再硬推生产环境成熟得多。
Evals 的本质是商业信任工具 文章强调 eval 不是单纯技术测试，而是帮助高管理解“这东西到底值不值钱”的证据系统，这个判断非常关键，因为多数企业 AI 项目死于信任不足，而不是模型完全不可用。
“30 天成为 FDE”说法站不住脚 作者前面把 FDE 描述成百万美元级复合人才，后面又给出 30 天路线图，这不是严格自洽的职业判断，更像降低门槛的招聘文案；更准确的说法应该是“30 天建立候选人雏形”，而不是“成为 FDE”。

跟我们的关联

对 ATou 意味着什么、下一步怎么用 ATou 如果在看 AI agent 或应用层机会，应该把注意力从“模型多强”转向“流程是否高频、是否可评估、是否能渐进放权”；下一步可以直接用文中的三分法筛项目，砍掉低频炫技型自动化。
对 Neta 意味着什么、下一步怎么用 Neta 如果在搭产品或内容框架，这篇最值得吸收的是“eval 是信任产品”的判断；下一步可以把任何 AI 方案拆成 workflow、golden set、评分维度、上线权限四层，而不是只讲功能。
对 Uota 意味着什么、下一步怎么用 Uota 如果关注组织与角色演化，这篇说明未来贵的不是纯工程师，也不是纯顾问，而是能把业务痛点翻成系统方案的人；下一步可以围绕“业务理解 + 工程实现 + 结果表达”去定义人才模型或训练标准。
对投资/公司判断意味着什么、下一步怎么用 这篇隐含一个重要判断：Applied AI 公司的护城河不只在模型接入，而在交付方法、行业 workflow 和 eval 资产；下一步看项目时应追问它是靠人海驻场挣钱，还是能把 FDE 经验产品化。

讨论引子

1. FDE 现在是长期核心岗位，还是一个会被平台化工具逐步压缩的过渡角色？ 2. 企业 AI 落地的真正瓶颈到底是模型能力不足，还是组织内部缺少能做 Audit 和 Evals 的人？ 3. “驻场”对 AI 部署到底是必要条件，还是 Palantir 式叙事被过度神化了？

读完这篇文章后，你应该会明白，为什么 Anthropic、OpenAI、Google 和其他 AI 公司都在寻找 FDE，以及你该如何抓住这波需求。

这份工作我自己做过，也亲手招聘过一些世界上最优秀的人，帮 Varick 搭起我们的 FDE 体系。我也发现，关于这个当下科技圈最火岗位的真正权威指南，其实并不存在。这篇就是那份指南。

为什么 AI 公司需要 FDE

想成为 FDE，第一步是先真正理解，为什么 AI 公司会如此迫切地需要这类人。

如果你相信智能正在变成一种商品，那么顺理成章的结论就是，唯一的竞争优势只剩下你如何使用它，以及你把它用在什么地方。甚至可以说，单有智能本身，并不存在竞争优势。因此，决定公司该如何、又该在何处使用智能，才成了最重要的角色，而这正是前置部署工程师的职责。

企业会雇佣像 Varick 这样的 Applied AI 公司，由它们派出 FDE，帮助企业把技术价值尽可能榨出来。这样做的好处是，企业获得的是一支已经做过大规模 AI 转型的团队。这会让客户比竞争对手快得多，也由此带来巨大的效率提升。

FDE 是一类能力极强的工程师。他们既能极深地理解客户的问题，也能在一个自己可能从未见过的代码库里直接写代码，还能把业务影响讲给不懂技术的决策者听，从而推动成交。这是一个价值百万美元的岗位。

这个岗位需要什么

做 FDE，意味着你得和客户在同一个现场。Palantir 的 CTO 说过，如果你不真正身处一个环境里，就不可能为那个环境打造产品。我们内部也看到了同样的情况。

FDE 这个词本身就起源于 Palantir，而他们对驻场这件事极其认真。2010 年，他们曾与阿富汗的一支特种部队合作。特种部队白天执行任务，拿到反馈后再转给 FDE，FDE 则在夜里把代码发出去。

身处现场，对于部署军事软件来说是必要的；对于部署 AI，同样如此。要想看到真正的效率提升，公司就必须从底层开始围绕 AI 重新构建。而这件事，只有通过和客户坐在一起，基于公司的专属数据和专属语境去打造定制化 agent，才有可能实现。

关于这个岗位

在我们看来，Applied AI 的 FDE 工作主要分成三部分，分别是 Audit、Evals 和 Deployment。下面逐一拆开说。

Audit： 你会驻场客户，梳理公司不同团队里的流程和工作流。比如，和营收运营团队待两周，和采购团队待一周，再和财务团队待整整一个月。

和每个团队一起工作时，你会学到几件事，他们的工作长什么样，瓶颈在哪里，以及你可能在哪些地方打造出真正有价值的 agent。

除了理解公司里每个团队的工作流，审计阶段还有一个重要任务，就是判断哪些东西该自动化，哪些不该。因为有一个界限，超过那个界限后，agent 制造的问题可能会比它解决的问题还多。

下面有三个通用原则，可以帮助你做判断。

如果一个工作流可以被提炼成规则，但输入是变化的，比如这次输入是一封邮件，下次可能是 PDF，再下一次可能是扫描图像，而且这项工作还涉及调用工具，那就适合放一个 agent 进去。如果规则和输入都可预测，代码会更快也更便宜。如果决策依赖模式识别和领域经验，那就继续保留人工处理。

如果一个 agent 一个月只跑五次，客户是拿不到好 ROI 的。你要找的是那些流程长、量又大的自动化场景。必须有足够的量，才值得做。

在构建 agent 时，不要过度使用 AI。大多数自动化任务，其实都可以用一串工具调用，再加一次对 LLM 的调用，把它当作编排层来完成。AI 用得太多，只会带来不必要的 token 成本，而这种成本在规模化之后会被不断放大，输出质量也往往更差。

Audit 阶段的最后一部分是做原型。想了解怎么构建一个 agent，可以去看 Agents 101。想把这个 agent 从演示版带到生产环境，可以去看 Agents 102。

Evals： 如果一个客户要为 AI 部署花上数百万美元，他们就必须知道它到底有没有在起作用。为此，FDE 需要构建细致的 eval。

一个好的 eval，不只是检查 agent 最终给出的答案对不对，还会验证这个 AI 的思考方式是否像人类。要做到这点，可以做两件事。

追踪人的处理步骤，并在每一步给 AI 打分。人不是一步就把问题解决掉的，而是一个多步骤过程。把这些步骤画出来，再看 AI 是否也在沿途打中了同样的检查点。

先从少量但高质量的目标结果样例开始，然后用它们去衡量其他一切。如果你在做一个客服 agent，就和真人坐在一起，先搞清楚面对某个用户问题时，什么才叫最好的回答。针对几类任务反复做几次。这样你就知道了什么叫好，也就能拿这个标准去要求 agent。

Evals 是向客户证明价值的方式。虽然人人都说公司想要 AI，但仍然有很多人怀疑它到底有没有用。一个好的 agent 评估，正是高管建立信任、相信 agent 能带来 ROI 所需要的东西。

Deployment： 避免做大规模数据迁移。更好的做法是，在现有数据层之上构建 API，比如 SharePoint 或数据库，然后在上层放一个模型作为编排器，去查询这些数据。这样能省时间、省钱，更重要的是，能让你避开拆掉现有系统这场残酷噩梦。我们的客户已经花了数百万美元、用了好几年，才迁移到他们最新的 ERP。现在他们最不想做的事，就是再换一次。

当上面的事情都完成后，再创建一个执行环境，用来安全地测试 agent。这是直接建在公司基础设施里的沙箱，这样你就能在进入生产环境之前，先运行、测试并调试 agent。

准备转到生产环境时，节奏要慢。先拿一个小工作流，让它跑通，再逐步叠加新的能力。比如，先做一个能发现 bug、做调查、并写出一张工单，总结它认为哪里出了问题的 agent。如果这个能稳定工作，再进一步给它写代码和推送 PR 的能力。

先从最小的自主单元开始，然后再逐步给它行动能力。

这就是 FDE 如何从 Audit 走到 Deployment。学会这些步骤，本身就已经是这份工作的全部内容了。

如何在 30 天内成为 FDE

通常来说，最容易在 FDE 岗位上做出成绩的人，主要来自三种背景，顾问、产品经理和软件工程师。即使你不属于这三类中的任何一种，只要照着本节末尾的 30 天路线图去做，拿到这个岗位的概率也会成倍提高。你可以一边投递和面试，一边并行完成这些事。

顾问/产品经理

如果你是顾问或产品经理，你大概率已经具备把数据翻译成 ROI 的能力。这已经是工作的一半了。但这类背景的人最大障碍，往往是缺少工程经验。

一个高质量作品集可以缓解这个问题。从下面这些 side project 里选两个，全力做深：

一个可以直接用于生产环境的 AI agent，它能完整执行你过去在工作中原本要手动完成的整套流程。它应该能调用 API，能自主记录自己的思考过程，还要有失败处理机制。
一个建立在某个数据集之上的 RAG pipeline。数据集可以按你想切入的行业来选，比如法律文件、医疗记录、财务披露等等。
一个你自己搭建的 eval framework，能够从多个维度给 agent 输出打分，比如正确性、格式、成本、延迟，并且适用于不同业务流程，比如采购、应付账款等等。
一个 MCP，让你能把 LLM 接到那些目前还不支持 AI 集成的遗留软件上。

不要把自己的理解外包给 AI。只要一步一步来，这些概念其实都能弄明白。这也是为什么标题写的是 30 天，而不是 30 分钟。

软件工程师

可以说，做 FDE 最重要的部分就是沟通。你必须把 AI 能做什么、不能做什么，翻译成一个非技术 VP 也能听懂的东西。如果做不到这一点，就做不了 FDE。

软件工程师也应该构建和顾问/产品经理部分提到的那些类似项目，但你还得把自己做出来的每一个组成部分都讲清楚。技术栈、结果、你做过哪些迭代、带来了什么业务结果。最重要的是，你一开始为什么要做这些 agent，想解决的痛点是什么，以及如果放到真实客户沟通里，这件事会怎样展开。

不论背景如何都适用的 30 天路线

如果想要更具体一点，可以照着下面这个 30 天计划来，它几乎能让你为各种情况做好准备。

Checkpoint 1（第 7 天）：

什么是 agent，以及 agent loop 是怎么运作的。去读 Anthropic 的 Building Effective Agents，然后写一个脚本，把这个 loop 跑起来，prompt → model → response → next step。
怎样让一个 agent 调用工具。用 Anthropic/OpenAI 的工具使用教程，加上两个工具调用，一个 API 调用，一个网页搜索。
怎样建立正确的 guardrails。在任何结果到达用户之前，加上输入校验、最大步数限制，以及输出过滤。
什么时候该用 context window，什么时候该用外部 memory。默认优先用 context，除非状态需要在一次运行之后继续持久存在。
什么是 audit trail，以及怎么搭。把每次 prompt、工具调用和响应都连同时间戳记下来。这能帮助你发现并标记 agent 的错误。

Checkpoint 2（第 14 天）：

怎样强制结构化输出。始终返回 JSON。去读 OpenAI 的开发者页面。
怎样把 demo 带到生产环境，以及通常会在哪里出问题。去读 Agents 102。
怎样做 checkpoint。每隔 n 步把 agent 状态保存到文件里，这样它就能从上一个 checkpoint 重新启动。

Checkpoint 3（第 21 天）：

重试逻辑和指数退避是怎么工作的。每一次外部调用都需要重试。失败后分别等待 1 秒、2 秒、4 秒、8 秒，最高封顶 16 秒。
部署 agent 时如何优化成本。三件事，用更便宜的模型处理便宜的子任务，像 Opus 这种模型只该用于推理；缓存常见 prompt；限制最大 token 数。并跟踪每次查询的成本。
怎样为 eval 构建 golden dataset。先从 20 个真实查询开始，由你自己标注完美输出。Anthropic 的 Demystifying evals for AI agents 已经把这件事讲得很完整。
多 agent pipeline 和并行架构是怎么工作的。当一个 agent 扛不住整项工作时，就把任务拆开。一个负责规划，其他负责执行，再由一个负责综合。

Checkpoint 4（最后一周）：

把上面的内容全部复习一遍，并且大声讲出来。尽可能把所有内容都和业务指标联系起来。

TLDR

FDE 是当下科技行业需求最旺盛的岗位。每家公司都需要 AI，但没人知道该怎么把它真正部署出来。

这份工作分成三个阶段，audit、evals 和 deployment。你的任务就是理解每个阶段，以及它各自存在的意义。

决定性因素是你的作品集，以及你讲述它的能力。去构建 agent、RAG pipeline、eval framework、MCP 等等，更重要的是，你必须能够自信地讲清楚，你正在构建的每样东西背后的业务使用场景是什么。

缺乏沟通能力，对 FDE 岗位来说是致命问题。如果你没法向一个非技术决策者解释 AI 能做什么、不能做什么，那就不会有部署发生。

要知道什么时候 AI 不是答案。这会帮助你赢得客户信任，更重要的是，也能让生产环境中的 agent 真正做出 ROI。

把这些步骤做好，你成功进入这个领域的概率会成倍提升。

如果你想加入旧金山增长最快的 Applied AI 公司，Varick 正在招聘。我们正在打造硅谷最精英的 FDE 团队，由 Citadel Securities 前 COO 领衔。可直接在 https://www.varickagents.com/careers 申请。

By the end of this article, you should understand why Anthropic, OpenAI, Google, and other AI companies are looking for FDEs, and how you can capitalize on this demand.

读完这篇文章后，你应该会明白，为什么 Anthropic、OpenAI、Google 和其他 AI 公司都在寻找 FDE，以及你该如何抓住这波需求。

I've done the job myself, hired some of the best in the world to build out our FDE motion at Varick, and noticed there's no real definitive guide to the hottest role in tech right now. This is that guide.

Why do AI companies need FDEs

为什么 AI 公司需要 FDE

To become an FDE, the first step is to internalize why AI companies are in desperate need of them.

想成为 FDE，第一步是先真正理解，为什么 AI 公司会如此迫切地需要这类人。

If you believe intelligence is becoming commoditized, it then follows that the only competitive edge is how and where you use it. In fact, I will go so far as to say that there is no competitive advantage in intelligence alone. Therefore, determining how and where companies use it becomes the most important role and that is the role of a forward deployed engineer.

Businesses hire an Applied AI company (like Varick) that deploys FDEs to help them get the most out of the technology. Doing this gives them access to a team that has already done large-scale AI transformations that make clients move much faster than their competitors, and by doing so, yields massive efficiency gains.

The FDE is a highly skilled engineer who can understand the customer's problems very deeply, write code into a code base they've potentially never seen before, and communicate the business impact to a non-technical decision maker to close the deal. This is a million-dollar hire.

What the role requires

这个岗位需要什么

Being an FDE requires you to be on-site with a customer. Palantir's CTO says that you cannot build products for an environment without actually being in the environment itself. We've seen the same thing internally.

The term FDE actually originated from Palantir, and they took being on-site very seriously. In 2010, they worked with the Special Forces group in Afghanistan. The Special Forces would go on the mission in the day, get feedback, and route it to the FDEs who would ship code during the night.

Being in the environment is as necessary for deploying military software as it is for deploying AI. In order to see real efficiency gains, a company needs to be rebuilt around AI from the ground up. And that is only possible through sitting with the customer and building custom agents that are engineered on company-specific data, with company-specific context.

About the role

关于这个岗位

In our view, there are three main parts of an Applied AI FDE's job: Audit, Evals, and Deployment. Let's break down each one.

在我们看来，Applied AI 的 FDE 工作主要分成三部分，分别是 Audit、Evals 和 Deployment。下面逐一拆开说。

Audit: You're onsite with a client, mapping processes/workflows in different teams within the company. For example: two weeks with rev ops, one week with procurement, and a full month with finance.

Audit： 你会驻场客户，梳理公司不同团队里的流程和工作流。比如，和营收运营团队待两周，和采购团队待一周，再和财务团队待整整一个月。

With each team you sit with, you learn a few things: what their job looks like, where the bottlenecks are, and where you might create agents that deliver value.

和每个团队一起工作时，你会学到几件事，他们的工作长什么样，瓶颈在哪里，以及你可能在哪些地方打造出真正有价值的 agent。

Along with understanding the workflows of each team in the company, an important part of the audit phase is determining what should be automated vs what shouldn't. There's a point where agents can create more problems than they solve.

Here are three general principles you can follow to help you decide.

下面有三个通用原则，可以帮助你做判断。

If a workflow can be distilled into rules but the inputs are different (one input is an email, the next could be a PDF, the next is a scanned image), and the work involves calling tools, put an agent in. If the rules and inputs are both predictable, code is faster and cheaper. If the decision needs pattern recognition and domain expertise, leave it manual.

Your clients aren't going to get good ROI if the agent runs five times a month. Look for lengthy, high-volume automations. There needs to be enough volume to matter.

如果一个 agent 一个月只跑五次，客户是拿不到好 ROI 的。你要找的是那些流程长、量又大的自动化场景。必须有足够的量，才值得做。

Don't overuse AI when building your agents. Most automation tasks can be done with a series of tool calls and just one call to an LLM as an orchestrating layer. Too much AI leads to unneeded token costs (which compound at scale) and often a lower-quality output.

The final part of the Audit phase is prototyping. See Agents 101 to learn how to build an agent, and Agents 102 to take that agent from demo to production.

Audit 阶段的最后一部分是做原型。想了解怎么构建一个 agent，可以去看 Agents 101。想把这个 agent 从演示版带到生产环境，可以去看 Agents 102。

Evals: If a customer is spending millions on an AI deployment, they need to know it's working. To do that, an FDE builds detailed evals.

Evals： 如果一个客户要为 AI 部署花上数百万美元，他们就必须知道它到底有没有在起作用。为此，FDE 需要构建细致的 eval。

A good eval doesn't just check if the final answer an agent gives you is correct, but also verifies the AI is thinking like a human would. In order to do that, do two things:

一个好的 eval，不只是检查 agent 最终给出的答案对不对，还会验证这个 AI 的思考方式是否像人类。要做到这点，可以做两件事。

Trace the human's steps and grade the AI on each one: A human doesn't solve problems in one move. It's a multi-step process. Map out those steps and see if the AI is hitting the same checkpoints along the way.

Start small with great examples of the intended outcome, then measure everything against them: If you're building a customer support agent, sit with a human and figure out what the best possible answer to a user's query is. Repeat that a few times over a few tasks. Now you know what "great" looks like and can hold the agents to that standard.

Evals prove value to the customer. While everyone says they want AI in their company, there are still many who are skeptical of whether it works. A good agent evaluation is what an executive needs to trust the agent will provide ROI.

Deployment: Avoid large-scale data migrations. Instead, build APIs over an existing data layer (SharePoint or databases) and place a model on top as an orchestrator to query through it. This saves time and money, and more importantly, saves you from the brutal nightmare of ripping out your existing systems. Our clients have spent millions of dollars and multiple years migrating to their latest ERP. The last thing they want to do is replace it again.

Once all of the above is completed, create an execution environment to test the agent safely. This is a sandbox directly in the company's infra so you can run, test, and debug the agent before hitting production.

When moving to production, start slow. Take a small workflow, get it to work, and then layer on additional capabilities. For example, start with an agent that catches bugs, investigates, and writes a ticket summarizing what it thinks went wrong. If that works, only then give it the capability of writing code and pushing a PR.

Start with the smallest unit of autonomy; only then give it the capability to take action.

先从最小的自主单元开始，然后再逐步给它行动能力。

That is how you go from an audit to deployment as an FDE. Learning these steps is the entire job in itself.

这就是 FDE 如何从 Audit 走到 Deployment。学会这些步骤，本身就已经是这份工作的全部内容了。

How to become an FDE in 30 days

如何在 30 天内成为 FDE

There are typically three backgrounds that find the most success as an FDE: Consultants, Product Managers, and Software Engineers. Even if you don't fall into any of these, following the 30-day roadmap at the end of this section will exponentially increase your chances of getting the role. Do these in parallel with applying and interviewing.

Consultants/PMs

顾问/产品经理

As a consultant or PM, you should already have the ability to translate data into ROI. That is half the job. But the biggest roadblock for someone with this background is a lack of engineering experience.

如果你是顾问或产品经理，你大概率已经具备把数据翻译成 ROI 的能力。这已经是工作的一半了。但这类背景的人最大障碍，往往是缺少工程经验。

A high-quality portfolio can mitigate this. Pick two of these side projects and go all in:

一个高质量作品集可以缓解这个问题。从下面这些 side project 里选两个，全力做深：

A production-ready AI agent that can execute an entire process that you used to do manually in your old job. It should be able to call APIs, log its thinking autonomously, and have a failure harness.

一个可以直接用于生产环境的 AI agent，它能完整执行你过去在工作中原本要手动完成的整套流程。它应该能调用 API，能自主记录自己的思考过程，还要有失败处理机制。

A RAG pipeline built on top of a dataset (choose a custom dataset for whatever industry you're trying to break into: legal docs, medical records, financial filings, etc.).

一个建立在某个数据集之上的 RAG pipeline。数据集可以按你想切入的行业来选，比如法律文件、医疗记录、财务披露等等。

An eval framework you built yourself that scores an agent's outputs across multiple dimensions (correctness, format, cost, latency) for different business processes (procurement, accounts payable, etc.).

一个你自己搭建的 eval framework，能够从多个维度给 agent 输出打分，比如正确性、格式、成本、延迟，并且适用于不同业务流程，比如采购、应付账款等等。

An MCP where you can connect an LLM to legacy software that doesn't currently support AI integration.

一个 MCP，让你能把 LLM 接到那些目前还不支持 AI 集成的遗留软件上。

Do not outsource your understanding to AI. If you take it step by step, these concepts should be quite understandable. There's a reason why this is titled 30 days, not 30 minutes.

不要把自己的理解外包给 AI。只要一步一步来，这些概念其实都能弄明白。这也是为什么标题写的是 30 天，而不是 30 分钟。

Software Engineers

软件工程师

Arguably the most important part of being an FDE is communication. You need to translate what AI can and can't do into something that makes sense to a non-technical VP. If you can't do that, you can't be an FDE.

可以说，做 FDE 最重要的部分就是沟通。你必须把 AI 能做什么、不能做什么，翻译成一个非技术 VP 也能听懂的东西。如果做不到这一点，就做不了 FDE。

SWEs should build projects similar to those mentioned in the consulting/PM section, but explain every single component of what you just built. The tech stack, results, iterations you made, business outcomes. Most importantly, you need to have a reason for building those agents in the first place: what was the pain point you're solving for, and how might this go down in a real client interaction?

30-day outline regardless of role

不论背景如何都适用的 30 天路线

For something more concrete, follow this 30-day plan that will prepare you for almost anything:

如果想要更具体一点，可以照着下面这个 30 天计划来，它几乎能让你为各种情况做好准备。

Checkpoint 1 (7 days in):

Checkpoint 1（第 7 天）：

What is an agent and how an agent loop works: read Anthropic's Building Effective Agents, then write a script that runs the loop: prompt → model → response → next step.

什么是 agent，以及 agent loop 是怎么运作的。去读 Anthropic 的 Building Effective Agents，然后写一个脚本，把这个 loop 跑起来，prompt → model → response → next step。

How to make an agent call a tool: add two tool calls (an API call and a web search) using the Anthropic/OpenAI tool use tutorials.

怎样让一个 agent 调用工具。用 Anthropic/OpenAI 的工具使用教程，加上两个工具调用，一个 API 调用，一个网页搜索。

How to build proper guardrails: add input validation, a max-step limit, and output filtering before anything reaches the user.

怎样建立正确的 guardrails。在任何结果到达用户之前，加上输入校验、最大步数限制，以及输出过滤。

When to use context window vs external memory: default to context unless state needs to persist longer than the run.

什么时候该用 context window，什么时候该用外部 memory。默认优先用 context，除非状态需要在一次运行之后继续持久存在。

What is an audit trail and how to build one: log every prompt, tool call, and response with timestamps. This helps find and flag agent errors.

什么是 audit trail，以及怎么搭。把每次 prompt、工具调用和响应都连同时间戳记下来。这能帮助你发现并标记 agent 的错误。

Checkpoint 2 (14 days in):

Checkpoint 2（第 14 天）：

How to enforce structured outputs: always return JSON. Read through OpenAI's developer page.

怎样强制结构化输出。始终返回 JSON。去读 OpenAI 的开发者页面。

How to take a demo to prod and what usually breaks: Read Agents 102.

怎样把 demo 带到生产环境，以及通常会在哪里出问题。去读 Agents 102。

How to checkpoint: save agent state every n steps to a file so it can restart from the last checkpoint.

怎样做 checkpoint。每隔 n 步把 agent 状态保存到文件里，这样它就能从上一个 checkpoint 重新启动。

Checkpoint 3 (21 days in):

Checkpoint 3（第 21 天）：

How retry logic and exponential backoff work: every external call needs retries. On failure, wait 1s, 2s, 4s, 8s, cap at 16s.

重试逻辑和指数退避是怎么工作的。每一次外部调用都需要重试。失败后分别等待 1 秒、2 秒、4 秒、8 秒，最高封顶 16 秒。

How to optimize cost when deploying agents: three things: cheaper models for cheap subtasks (Opus should be used only for reasoning), cache common prompts, cap max tokens. Track cost per query.

部署 agent 时如何优化成本。三件事，用更便宜的模型处理便宜的子任务，像 Opus 这种模型只该用于推理；缓存常见 prompt；限制最大 token 数。并跟踪每次查询的成本。

How to build a golden dataset for evals: start with 20 real queries, label the perfect output yourself. Anthropic's "Demystifying evals for AI agents" covers everything.

怎样为 eval 构建 golden dataset。先从 20 个真实查询开始，由你自己标注完美输出。Anthropic 的 Demystifying evals for AI agents 已经把这件事讲得很完整。

How multi-agent pipelines and parallel architectures work: split the job when one agent can't handle it. One plans, others execute, one synthesizes.

多 agent pipeline 和并行架构是怎么工作的。当一个 agent 扛不住整项工作时，就把任务拆开。一个负责规划，其他负责执行，再由一个负责综合。

Checkpoint 4 (Final week):

Checkpoint 4（最后一周）：

Review the above and communicate everything out loud. Tie everything you can to business metrics.

把上面的内容全部复习一遍，并且大声讲出来。尽可能把所有内容都和业务指标联系起来。

TLDR

The FDE is the most in-demand role in tech right now. Every company needs AI, but nobody knows how to deploy it.

FDE 是当下科技行业需求最旺盛的岗位。每家公司都需要 AI，但没人知道该怎么把它真正部署出来。

The job has three phases (audit, evals, deployment). Your job is to understand each phase and its purpose.

这份工作分成三个阶段，audit、evals 和 deployment。你的任务就是理解每个阶段，以及它各自存在的意义。

Your portfolio and your ability to speak about it are the deciding factors. Build agents, RAG pipelines, eval frameworks, MCPs, etc, and, most importantly, be able to confidently articulate the business use case behind everything you're building.

Lack of communication ability is a deal-breaker for the FDE role. If you can't explain what AI can and can't do to a non-technical decision maker, there won't be a deployment.

缺乏沟通能力，对 FDE 岗位来说是致命问题。如果你没法向一个非技术决策者解释 AI 能做什么、不能做什么，那就不会有部署发生。

Know when AI isn't the answer; this builds trust with the client and more importantly, ROI for the agents in production.

要知道什么时候 AI 不是答案。这会帮助你赢得客户信任，更重要的是，也能让生产环境中的 agent 真正做出 ROI。

Do these steps and your chances of breaking in will be exponentially higher.

把这些步骤做好，你成功进入这个领域的概率会成倍提升。

If you want to make the jump to the fastest-growing Applied AI company in SF, Varick is hiring. We're building out the most elite FDE team in Silicon Valley, led by a former COO of Citadel Securities. Apply directly at https://www.varickagents.com/careers.