Build vs buy, with the receipts
A demo-grade copilot is a weekend project. A production-grade agent β one that acts on your API for authenticated end users, in their language, without leaking data between tenants β is not. Here is the math, with named sources.
What production-grade actually includes
The gap between the weekend demo and something you can put in front of customers is exactly the part that never shows up in the demo:
- End-user authenticationThe agent must act as the signed-in user β token verification, scopes, step-up confirmation. Teams report the OAuth takes longer than the tool logic.
- Tenant isolationEvery query, cache, log, and metric scoped per customer β and provable, because your buyers' security reviews will ask.
- Action safetyWrite and destructive actions need server-enforced confirmation gates. An agent that deletes without asking generates the tickets it was meant to prevent.
- Multilingual, including RTLYour end users do not all read English. Prompts, UI, and error states in every market language, with right-to-left layouts that actually mirror.
- Evals and regressionsModel and prompt changes need an evaluation harness, or every upgrade is a gamble taken in production.
- Security patching and upkeepStreaming protocols, MCP spec revisions, dependency advisories β the maintenance starts the day the demo ends.
The published failure data
- 95%
- of enterprise GenAI pilots fail to reach production.MIT NANDA, Aug 2025
- 17% β 42%
- abandonment of AI initiatives, year over year.S&P Global, 2025
- 40%
- of agentic AI projects forecast to be canceled by 2027.Gartner, Jun 2025
- 57%
- of buyers expect AI ROI within three months.G2, 2025
Industry consensus puts the build threshold near ~1M conversations per year β below it, buying is the defensible call. Most product teams sit well below it.
The in-house math
An in-house build starts at one AI engineer ($200K/yr loaded, if you can hire one) plus the months before anything customer-safe ships. Syncanix starts at $199/mo, runs the same week, and every allowance is published.
Three ways vendors charge for the same conversation
Pricing model shapes matter more than list prices: the same end user, the same month, three very different bills.
| Model | How it bills | A heavy userβs month | Bill predictability |
|---|---|---|---|
| Per resolution | Each AI-resolved conversation billed β Intercom publishes Fin at $0.99 per resolution. | 12 resolved conversations bill 12 times β $11.88 for that one user at the published rate. | Scales with conversation volume; retry storms and clarification loops show up on the invoice. |
| Per conversation (custom contracts) | Enterprise per-conversation contracts β Decagon does not publish pricing. | Depends on the contract; volume terms are negotiated per deal. | Predictable only after a negotiation; not self-serve. |
| MAEU (Syncanix) | A monthly active end user counts once, however many conversations they have. Tiers from $199/mo with published message and action allowances. | 12 conversations from one user is still one MAEU. Caps are published; overage is prepaid credit packs, never a surprise invoice. | Computable on a napkin β users Γ tier, with visible caps. |
Published list prices as of mid-2026; the arithmetic is illustrative. Resolution rates and conversation depth vary by product β this compares pricing mechanics, not outcomes.