Buyer guide / The four vendor types selling this title in 2026

AI forward deployed engineer: which of the four vendors are you actually hiring?

By May 2026 the title sits on job boards at Anthropic, OpenAI, Palantir, Salesforce, ServiceNow plus Accenture, EY, Cohere, Ramp, Rippling, and at least a dozen smaller studios. The four vendor types selling under it look identical on the first call. They behave very differently in your repo eighteen months later. This page is the buyer-side decision framework, not a career guide.

Matthew Diakonov, Written with AI

Published May 8, 202611 min read

Direct answer (verified 2026-05-08)

An AI forward deployed engineer is a senior engineer who works from inside the customer's GitHub, Slack, and standup for two to six weeks to ship a production AI agent plus an eval harness, a CI pipeline, and a runbook. As of May 2026 four structurally different vendor types sell that title: model labs (Anthropic Applied AI, OpenAI), big-firm consulting practices (Accenture's Microsoft FDE practice, EY), platform vendors (Palantir, Salesforce, ServiceNow with Accenture), and vendor-neutral studios (fde10x is one). The cleanest test for which one you are hiring is the leave-behind: which of four files (rubric.yaml, eval/cases.yaml, .github/workflows/pilot-gate.yml, runbook.md) sit in your repository, and which sit in the vendor's, when the engineer leaves at week six.

Sources reviewed: Anthropic, Forward Deployed Engineer, Applied AI, Accenture launches Microsoft FDE practice, EY launches FDE roles, The Pragmatic Engineer, Forward Deployed Engineers.

The buyer's question, in one file list

Most articles on this topic frame the question as a job description: what does an AI FDE do all day, what salary range, what tech stack. That framing is fine for engineers reading career guides. It is not the buyer's question. The buyer's question is structural: when the engagement ends, which of four files are in my repository, and which are in the vendor's? Print the list, ask the question, and the four vendor types separate themselves before any contract is signed.

four-file-test.txt

The four vendor types, side by side

Same title, four very different contracts. The dimensions that matter to a buyer are: who pays the engineer, what runs the eval at week six, where the agent code lives, what it costs to swap models after handoff, how long the engagement runs, who is actually on the call in week one, and whether there is a refund clause if the work misses the prototype rubric. The vendor-neutral studio is the column we run; the right-hand column collapses the other three for legibility.

Feature	Other three vendor types	Vendor-neutral studio
Who pays the engineer	Model lab: fee plus inference contract. Consulting: time-and-materials, often $300 to $500 per hour. Platform: fee bundled with platform license.	Fixed fee. Pre-agreed scope, refund-and-exit gate on calendar day 14.
What runs the eval at week 6	Model lab: vendor-hosted eval surface. Consulting: a Jira-tracked manual review by a senior partner. Platform: a dashboard you cannot export.	rubric.yaml + eval/cases.yaml on your main branch, run by your CI.
Where the agent code lives	Model lab: your repo, but pinned to one vendor SDK. Consulting: their repo, transferred at the end if you ask. Platform: their platform, runs only there.	Your GitHub, MIT-style internal license, no SDK lock.
Cost to swap models after handoff	Model lab: re-architect off SDK, re-build eval surface. Consulting: re-engagement at hourly rate. Platform: usually not allowed by terms.	Change one config line. Re-run rubric.yaml on the same eval set.
Engagement length	Model lab: 6 to 12 months typical. Consulting: 6 weeks to 2 years. Platform: 6 to 18 months for an initial landing.	2 weeks (prototype) to 6 weeks (production).
Who is on the call in week 1	Model lab: a Solutions Architect who hands off to an FDE. Consulting: a partner plus an analyst, FDE assigned later. Platform: a sales engineer plus an account executive.	The named senior engineer who will write the code.
Refund clause	Rare in any of the other three. T&M arrangements have no exit gate.	Prorated refund and exit if the week 2 prototype rubric is missed.

The right-hand column is honest about the tradeoffs. A model-lab AI FDE is often the right hire when the team has standardized on one vendor's API and wants white-glove support tightening prompts and agents on that surface. A big-firm consulting AI FDE is often the right hire for a multi-team rollout where the bottleneck is partner-level governance, not engineering velocity. A platform AI FDE is the right hire when the platform is already part of the stack and the agent must run inside it. A vendor-neutral studio is the right hire when the buyer wants the eval harness and the agent code in their own repo, with the model layer swappable, and a fixed-fee engagement that ends.

The four files (and why this is the only test that matters)

Every other test (technical interview, reference check, sample architecture diagram) is gameable in week one and unverifiable in week eighteen. The file-list test is verifiable on day one (ask the candidate to show you redacted versions from a prior engagement) and verifiable on the last day of the engagement (the files either landed on main or they did not). The four files on a vendor-neutral studio engagement, in the order they land:

Vendor-neutral studio leave-behind, on the client's main branch by calendar day 35

rubric.yaml. Five graded axes (faithfulness, helpfulness, completeness, tone, policy), per-axis floors, week-by-week ratchet from week 2 to week 6. Lives at the repo root.
eval/cases.yaml. At least thirty cases drawn from real production traces (not happy path), per-axis ground truth, stakes tags. The literal test set the rubric grades against.
.github/workflows/pilot-gate.yml. Runs the rubric on every pull request and on a Monday 09:00 UTC cron. The cron run is what fires the refund-and-exit clause if it goes red on calendar day 14.
runbook.md. The rollback command, the model-swap path, the cost dashboard URL, the three named humans paged at 2am. Owned by the client's on-call team after week 6.

The first of those files, rubric.yaml, carries the engagement. It defines the bar the model has to clear to ship, the per-axis floors that catch failure shapes the average would hide, and the week-by-week ratchet that walks the bar from prototype-grade in week 2 to production-grade in week 6. A working excerpt from a real engagement, lightly redacted:

rubric.yaml

A model-lab AI FDE will run substantially the same logic, but the eval surface is hosted on the lab's side, the rubric weights live in a vendor dashboard, and on the day you decide to swap models the eval surface goes with the SDK. A consulting AI FDE will run a manual review by a senior partner, which is credible the day it happens and unreproducible in month four. A platform AI FDE will hand you a dashboard you cannot export. The four-file test is the only version where the artifact survives the engagement intact, in your repo, on your CI, against your test set.

When each vendor type is actually the right hire

A buyer-side decision framework only earns trust if it admits the cases where the column you are pitching is not the right column. Four honest cases:

Hire a model-lab AI FDE when:

You have already standardized on one vendor's API for the next twelve months, the production agent will only ever run on that vendor, and the bottleneck is the long tail of prompt and agent behavior on that vendor's specific surface (tool-use semantics, structured output edge cases, specific model versions). Anthropic's Forward Deployed Engineer, Applied AI listing is explicit that the deliverables are MCP servers, sub-agents, and agent skills built against Claude. That is the right hire if you are building on Claude. It is the wrong hire if you might swap to Bedrock or an open-weight model in 2027.

Hire a big-firm consulting AI FDE when:

The bottleneck is governance and stakeholder alignment across many teams, not engineering velocity on a single agent. If the work is a 12-month rollout across three business units, two procurement organizations, and a regulator-facing risk function, a partner-led engagement with named analysts and a risk officer in the room is appropriate. If the work is shipping one production agent in 6 weeks, hourly billing on a partner-plus-analyst structure is the most expensive shape money can buy.

Hire a platform-vendor AI FDE when:

The platform is already part of the stack (Palantir Foundry, ServiceNow, Salesforce Data Cloud) and the agent has to read from or write to platform data structures. In that case the FDE who knows the platform's data model cold is worth more than the FDE who is vendor-neutral. The trade is that the agent leaves with the platform; it does not run elsewhere.

Hire a vendor-neutral studio AI FDE when:

The leave-behind matters more than the engagement length. You want the rubric, the eval cases, the CI gate, and the runbook in your own repo, with the model swappable behind the SDK and no platform license to keep paying for. You are willing to commit to a written one-pager from a free scoping call, a fixed fee, and a refund-and-exit gate on calendar day 14 if the prototype rubric does not land. fde10x is one of these.

Why the role exploded in 2024 and 2025

Foundation models commoditized faster than the application layer that uses them. By 2024, the choice between Claude, GPT, Gemini, and an open-weight model on Bedrock or Vertex was a swap, not a strategy. The model labs realized the application layer was the moat, and the cheapest way to win it was an engineer inside the customer's repo for a few weeks. The customers responded by asking for the same shape without the vendor binding. The vendor-neutral studio split off in 2025. The big consulting firms then standardized their own version under the same title, and the platform vendors followed.

800%

“Monthly job listings for forward deployed engineers grew more than 800 percent year over year, with hires across at least Anthropic, OpenAI, Ramp, Salesforce, Palantir, Commure, Matta, Gecko Robotics, Lindy and John Deere.”

The Pragmatic Engineer, 2025

The growth number is interesting. The harder fact is that the title fragmented while the listings exploded. A buyer who reads the title alone cannot tell which of the four contracts they are about to sign. The file-list test is the way to read past the title.

What the buyer should do on the first call

A short script that works against any of the four vendor types. The point is not to pitch the studio answer; the point is to surface which column you are actually buying so the rest of the conversation is honest:

Open with the file-list question. "When the engineer leaves at week six, which of these four files are in our repo and which are in yours? rubric.yaml, eval/cases.yaml, .github/workflows/pilot-gate.yml, runbook.md."
Ask for redacted versions of those four files from a prior engagement, on the call. A vendor-neutral studio will share them. A model-lab FDE will share a vendor-hosted eval surface and a sub-agent skill bundle. A consulting team will share a slide deck. A platform vendor will share a dashboard screenshot. The artifact you see is the one you will get.
Ask what happens if the week 2 prototype rubric does not land. The vendor-neutral answer is a prorated refund and the code stays with you. The other three usually have no exit gate; the engagement continues at the agreed rate or the agreed license.
Ask which engineer will be on the call in week one. If the answer is a Solutions Architect, an account executive, or a partner with an analyst, then the named senior engineer who writes the code is not the person you are buying.
Ask what swapping models costs after handoff. The vendor-neutral answer is one config line and a re-run of rubric.yaml on the same eval set. The model-lab answer is a re-architecture off the SDK. The platform answer is usually a contract amendment.

Ten minutes of those five questions will tell you which of the four columns you are sitting in. The rest of the engagement is mostly mechanical from there.

Want a vendor-neutral AI FDE in your repo on calendar day 1?

Sixty minutes with the engineer who would own the build. You leave with a written one-pager: the four leave-behind files named, the production outcome, the rubric, and a fixed fee. The week 2 cancel-and-refund clause is in the MSA before any code is written.

Frequently asked questions

What is an AI forward deployed engineer in one sentence?

An AI forward deployed engineer is a senior engineer who works from inside the customer's GitHub, Slack, and standup for two to six weeks to ship a production AI agent plus an eval harness, a CI pipeline, and a runbook. The deliverable is merged code in the customer's repository, not a recommendation document or a dashboard.

How is an AI FDE different from a regular forward deployed engineer?

Functionally, the role is the same: an engineer embedded in the customer's environment shipping production code. The 'AI' qualifier signals two things. First, the work is an LLM-powered agent or pipeline, not a Palantir-style data platform integration. Second, the eval harness is the load-bearing artifact (more than the code itself), because the agent's behavior is graded probabilistically against a rubric instead of unit-tested deterministically. An AI FDE who cannot write rubric.yaml is not yet an AI FDE.

Who actually sells AI FDE engagements in 2026?

Four kinds of vendor. Model labs (Anthropic Applied AI, OpenAI's FDE team) bring an engineer who pins your inference to their vendor SDK. Big-firm consulting practices (Accenture's Microsoft FDE practice, EY's FDE roles) run hourly time-and-materials with a partner plus analyst. Platform vendors (Palantir, ServiceNow plus Accenture for agentic AI, Salesforce AI FDE roles) bundle the engineer with a platform license that you keep paying for. Vendor-neutral studios (fde10x is one) sell a fixed-fee engagement with a refund-and-exit gate on calendar day 14 and four leave-behind files in your repo.

How do I tell which vendor type I am about to hire?

Ask one question on the first call: 'When the engineer leaves at week six, which of these four files are in our GitHub repository, and which are in yours?' The four files are rubric.yaml, eval/cases.yaml, .github/workflows/pilot-gate.yml, and runbook.md. If the honest answer is all four in your repo with the model swappable behind the SDK, you are hiring a vendor-neutral studio. If the answer involves a hosted eval surface or a vendor SDK lock, you are hiring a model lab. If the answer is hourly billing for a partner plus analyst, you are hiring consulting. If the answer requires a platform license to keep running, you are hiring a platform vendor.

How much does an AI FDE cost in 2026?

Model lab AI FDEs run on the lab's enterprise contract, where the engineering hours are bundled with an inference commitment that often clears six figures per quarter. Big-firm consulting practices run on time-and-materials, typically $300 to $500 per hour for a senior engineer, with the full engagement landing in the low to mid six figures. Platform vendors bundle the engineer with the platform's annual license. Vendor-neutral studios price the engagement as a fixed fee for a 2-week prototype or a 6-week production engagement, with a written one-pager produced from a free scoping call before any contract is signed.

What technical skills does an AI FDE need?

Production fluency in Python plus one of TypeScript or Go, comfortable reading a stranger's repo on day one, eval discipline (ragas plus a custom rubric plus human review on every engagement), MCP-native and A2A-compatible orchestration, Docker plus Kubernetes plus one of AWS, GCP, or Azure, and at least one of LangGraph, Pydantic AI, or a hand-rolled agent DAG. The harder requirement is judgment: an engineer who calls a feature done before the rubric score lands, or who folds when the customer's product manager pushes back on which case to grade against, is not yet an AI FDE.

How long is an AI FDE engagement?

Two to six weeks for a vendor-neutral studio engagement focused on a single production agent. Six to twelve months for a model-lab engagement, because the lab also wants to land its inference contract and the engineer's job extends past the agent into model-selection consulting. Six weeks to two years for a big-firm consulting engagement, depending on whether it is a focused scope or a multi-team rollout. Six to eighteen months for a platform vendor engagement, because the goal includes platform adoption, not just the agent.

What is an AI FDE not?

An AI FDE is not a sales engineer (whose week ends with a signed deal), not a solutions architect (whose week ends with a recommendation document), and not a staff-aug contractor (who takes a Jira ticket and writes the code without owning the eval). The seniority bar is the same as a senior software engineer at a strong product team. The deliverable is merged code, a passing rubric, and a runbook the customer's on-call team owns; not a deck.

What if my team already has senior MLEs? Do we still need an AI FDE?

Often no. The pattern is most useful when the team has a board-level AI mandate, a deadline in the next two to six weeks, and a hiring gap on senior MLEs that cannot be closed in time. A team with three senior MLEs already shipping production agents does not need a vendor-neutral studio AI FDE. They might still want a model-lab AI FDE for a few days of pair-programming on a vendor-specific edge case.

Is an AI FDE the right call for a Series A startup, or only for enterprises?

Both, but for different reasons. For $2B to $20B revenue enterprises, the AI FDE is a way to ship a board-mandated agent on a deadline without waiting six months to fill a senior MLE role. For Series A AI-native startups (10 to 40 person team, $8M to $25M raised), the AI FDE is a way to land the next eval jump before a better-funded competitor ships it, without diluting equity or burning founder time on a hire that takes a quarter. For pre-seed or staff-aug-shaped buyers, the model is a poor fit; the engagement assumes named senior engineers in a real GitHub repo, not a shared Slack channel and a Jira board.

Related guides on the FDE role

Definition

What is a forward deployed engineer (the role, the three flavors, and the four-file test)

The structural definition of the role. Where it came from, how it fragmented in 2024 to 2025, and the file-name question that tells you which flavor you are about to hire.

Read

History