UX4Tech · Frontier AI Advisory

Governedby design.

SAP just shipped the most powerful enterprise AI platform of the decade. Whether it creates value or chaos comes down to one thing the brochure never mentions.

Scroll

A UX4Tech perspective on realizing the full promise of SAP & Microsoft AI

Sapphire 2026 · The Promise

The AI operating system for the enterprise has arrived.

At Sapphire 2026, SAP unveiled AI Foundation — positioned as the AI operating system for SAP business AI. A neuro-symbolic platform grounded in the deepest enterprise knowledge graph ever assembled, paired with two purpose-built foundation models and an agentic layer that has moved from copilot to genuine autonomy.

Tables in the knowledge graph

Business data fields

Specialized AI agents shipped

Joule Skills

Joule Studio's Agent Builder reached general availability in January 2026. The Generative AI Hub now serves the world's leading models — GPT‑5.2, Gemini 3.0 Pro, Claude Opus 4.6 — through one governed, sovereign, security-controlled layer. Principal propagation is enforced by default. Citations are surfaced. Grounding is neuro-symbolic.

This is a genuinely remarkable platform, built with serious attention to governance, security, and data sovereignty. SAP has done extraordinary architectural work to make enterprise AI safe by design. The capability is real. The promise is real.

And that is exactly why the next part matters.

The turn

Capability has just outpaced the way most enterprises govern it.

The platform is ready. The question is whether the discipline on the customer's side of the table keeps pace. History suggests it rarely does — and the data is sobering.

The $66 billion problem

What the brochure never shows you.

Across more than 5,400 large IT projects studied by McKinsey and the University of Oxford, the cumulative cost overrun reached $66 billion — more than the GDP of Luxembourg. The averages underneath that number are worth pausing on.

Average budget overrun

Less value than predicted

ERP projects miss their objectives

Narrow the lens to enterprise resource planning and the picture is consistent. S/4HANA implementations take, on average, 30% longer than anticipated. Practitioners who review large programs from the inside report overruns of 40–60% as routine — a $4M plan crossing $7M by go-live, enterprise programs passing $20M without blinking.

It is essential to separate two things in these numbers. The first is the technology — modern SAP and Microsoft platforms are extraordinary, and the customers who realize their value generate transformative outcomes. The second is the delivery model — the human, procedural, and governance layer that decides whether potential becomes actual.

The data above is a story about the delivery model. Not the technology.

Predicted value vs. delivered value — the average large IT program

100%

Value
predicted

44%

Value
delivered

The 56% gap is, per McKinsey, a failure of strategy and value — not of project management. Organizations have become highly efficient at building systems that do not yet deliver what the platform is fully capable of.

Why the gap keeps appearing

When a large program falls short, the post-mortem rarely reveals bad luck. It reveals a small set of recurring, documented, avoidable delivery-model patterns. We know this because the disputes that reach court leave a paper trail. A few matters of public record — firm names withheld, case details preserved:

$172M

A leading medical device maker (2025). Delivery fees ran ~36% above what was represented; the complaint cited governance and business-readiness concerns overridden under schedule pressure.

$585M

A major US utility. Stabilization ran ~$30M/month for two years — about 150% of the original implementation cost — amid data-conversion shortcuts and a test regime that kept reporting “green” while the business absorbed real impact.

$30M

A US county government. Alleged the program was staffed with consultants lacking sufficient platform experience — far from the senior expertise represented at sale.

$600M

A global tire manufacturer. A defect-laden go-live; the integrator's defense was that the customer insisted on going live before the system was ready. Governance failure cutting both ways.

Across these cases the recurring patterns concern the delivery model, not the platforms underneath. The most cited is the bait-and-switch: senior expertise wins the deal, then rotates off once the ink dries, leaving the work to staff who were never part of the design conversation. The architect in the sales meeting is, too often, a marketing artifact rather than the person accountable for the outcome.

In the AI Foundation era, every one of these risks no longer adds. It multiplies.

A program that misses its value case produces a costly transformation. A program that misses its value case with autonomous agents in flight produces a costly transformation with agents acting on incomplete grounding inside it. Agentic capability that can adjust approval thresholds, post documents, and route exceptions raises the stakes of every governance gap that used to be merely expensive.

SAP has built strong safeguards — neuro-symbolic grounding, principal propagation, citation surfacing, model optionality. The customer's job is to exercise them. That is precisely where the right partner earns its place.

The UX4Tech answer

Reference architecture, and a governed eyes view.

UX4Tech is built to be a different kind of partner — not by promising a mythical perfect architect, but by changing what the work depends on. Two disciplines define how we advise, in close alignment with SAP's own Enterprise Architecture Reference Library and Microsoft's Cloud Adoption Framework.

Discipline 01

Reference architecture

We never design from a blank page. Every client decision maps to a validated, known-good pattern — SAP clean-core, BTP extension models, Joule Skill and agent templates, Generative AI Hub usage patterns. Deviation is allowed, but justified in writing. Design intelligence becomes institutional, not person-dependent — the most effective antidote to the bait-and-switch.

Discipline 02

A governed eyes view

Every architecture is examined through a governance and risk lens before, during, and after build. Segregation of duties. Evidence-based sign-off. Independent review separated from the build team. Agents bounded by explicit scopes and human-in-the-loop checkpoints. Not a compliance afterthought — the lens we look through from the first conversation.

Below: six of the most expensive, best-documented value-realization failure categories — mapped to the principle that prevents each, and how our model operationalizes it. The first five are the traditional failure curve. The sixth is the new one the Foundation era invites.

Staffing integrity — defeating the bait-and-switch

The pattern

Senior expertise sells; less experienced staff deliver. Scarce practitioners get rotated to the next sales cycle the moment the contract closes.

How we mitigate

Because design decisions anchor to a documented reference library, senior expertise is encoded, not rotated out. The same expertise that informed the design walks into the room — not a junior consultant carrying senior slides.

Governed eyes: every decision is traceable to a named reference pattern — not an individual's best guess under deadline pressure.

Clean-core discipline — customization with a conscience

The pattern

Excessive customization multiplies complexity and brittleness. In the AI era, the equivalent is building bespoke agents for problems SAP's 40+ shipped agents already solve elegantly.

How we mitigate

Our architectures bias structurally toward standard functionality and clean-core extension on BTP. SAP's published clean-core guidance is our default; every deviation justifies itself before it is accepted.

Governed eyes: a customization register with risk-rated justifications and a gate that must clear before any core modification — code or agent — proceeds.

Data & testing rigor — no green lights over a broken system

The pattern

The most insidious failure: a system that reports success while failing. Generative models compound it — fluent, confident output that won't announce its own uncertainty unless designed to.

How we mitigate

Reference test architectures that mirror real business scenarios; designs proven in sandbox before they touch production. Agents that ground against verified sources, return citations, and decline rather than confabulate — exercising the safeguards SAP built.

Governed eyes: the reviewer is never the builder. “The agent answered” is never accepted as proof “the agent answered correctly.”

Value assurance — closing the 56% gap

The pattern

Systems get built; value does not always arrive. A strategy shortcoming more than an execution one — and the AI era risks reproducing it on a higher base spend.

How we mitigate

Each reference architecture pairs with an explicit value hypothesis. Our AI Readiness Audit sets a measurable baseline before spend is committed. Agent deployments get adoption, exception-rate and outcome KPIs from day one.

Governed eyes: a value register tracked from design to delivery. The gate question is never “are we busy?” — it is “is the value still on track?”

Governance & oversight — the discipline that ties it together

The pattern

Even experienced teams fall short when governance is not defined and enforced. Readiness concerns overridden under schedule pressure — a governance breakdown expressed as a delivery failure.

How we mitigate

Governance is not a phase; it is the lens. Our heritage is security and risk — we apply a CISO's discipline to the program itself, not only the system being built. Risk, controls, SoD and evidence are first-class architectural concerns.

Governed eyes: the standing to flag the readiness gap before go-live — rather than billing the remediation after.

AI architecture discipline — when generative AI joins the digital core

The new frontier

Shadow agents drifting outside the authorization model. Outputs taken on trust when grounding wasn't verified. Consumption drift. Model lock-in. Authorization sprawl. Agent-to-agent trust without a framework. SAP does its part; the customer governance model must do its part too.

The principle — four questions, answered in writing

What data does this agent ground against, and is it appropriate for this use case?
What is this agent authorized to do, and how is that scope enforced?
What is the human-in-the-loop boundary for material actions?
How is this agent's output audited — and how would we explain it to a regulator?

How we mitigate

Our AI reference architectures carry explicit patterns for each: grounding-source registers, scoped authorization templates, human-in-the-loop checkpoints, audit-by-design logging. We treat the Generative AI Hub as a governed shared resource and Joule Studio as a provisioned capability — preserving the democratization SAP intends while wrapping it in the maturity each regulatory profile demands.

Governed eyes: agents inventoried, scopes reviewed, outputs sampled, costs monitored, grounding catalogued. When a regulator asks how you know an agent's decision was correct and in-scope — there is a documented answer.

The honest pitch

The bar isn't a perfect architect who never appears on real projects. It's the actual industry mean — and that bar has been low for two decades.

We won't claim our model is flawless. No model is. We track our known unknowns and disclose them in scope letters — rather than discovering them in remediation invoices. Measured against the honest baseline, a disciplined, governed, reference-architecture approach isn't a gamble. It's the correction the delivery side of the industry has needed.

The platforms are readyThe patterns existThe discipline is the difference

Start with a Governed-Design Review

About UX4Tech

UX4Tech is an SAP partner advising enterprise customers on SAP modernization, AI Foundation adoption, and the governance disciplines required to make both stick. Our practice combines deep SAP architecture experience, a CISO-grade security and risk lens, and an AI-augmented delivery model that keeps senior expertise present from first conversation to final sign-off. We deliver to reference architectures aligned with SAP's published guidance, document our reasoning, and stand behind the outcomes. When the work is novel, we engage subject-matter experts on demand to validate the judgments that warrant a second pair of eyes — without the overhead, or the rotation risk, of a traditional bench.

References

McKinsey & Company / University of Oxford — Delivering large-scale IT projects on time, on budget, and on value (2012). 5,400+ projects; 45% avg budget overrun, 7% schedule overrun, 56% value deficit, $66B cumulative overrun.
Industry research on SAP S/4HANA implementations — ~30% timeline overruns on average; majority of executives reporting budget and quality misses (Horváth and others, 2025).
Gartner — ERP implementation objective-failure rate (55–75%), cited across industry analyses.
Public-record enterprise implementation disputes (firm names withheld; details preserved in general terms): the $172M S/4HANA dispute (2025); the major utility's ~$585M stabilization and settlement; the US county government's ~$30M dispute; the global tire manufacturer's $600M dispute. Matters of public record, widely analyzed in IT and ERP trade press.
SAP — Business AI Release Highlights, Q1 2026. Joule Studio Agent Builder GA; Generative AI Hub model additions; Joule GA in Datasphere; Enterprise Architecture Reference Library integration.
SAP — Sapphire 2026 announcements. AI Foundation as the AI operating system for SAP business AI; 452,000 tables, 7.3M data fields; SAP-RPT-1 and SAP-ABAP-1 foundation models.
SAP Community — 2026 Roadmap for Joule for Developers; independent analysis, SAP Joule Agentic AI 2026 (40+ agents, 2,400+ Skills, principal propagation).