Private AI · Data Sovereignty

Private AI vs ChatGPT Enterprise: what UK SMBs need to know about data sovereignty

Q: Does ChatGPT Enterprise train on my data?

No. OpenAI contractually commits that ChatGPT Enterprise and Team plan prompts and outputs are not used to train their models. Retention is typically 30 days with zero-retention available on request. The protection is contractual, not architectural — which is important if a contract or regulator requires stronger isolation.

Q: Is Microsoft Copilot data kept in the UK?

Yes if configured. Set your Microsoft 365 tenant region to United Kingdom and confirm EU Data Boundary is enabled. Copilot processing then stays within the EU, with UK regions (UK South/West) as the primary location for most UK tenants.

Q: When is a self-hosted private AI actually necessary?

When a client contract explicitly prohibits third-party AI processing, when a regulator requires evidence of physical data location, when handling legally privileged material, or when your IP is the asset (unpublished research, trade secrets). Most UK SMBs are well-served by commercial tools; self-hosted becomes the right call at the edges.

Q: How much does a private AI deployment cost for a UK SMB?

Typical build cost for a 25–100 user private AI on Azure UK region is £15,000–£40,000, with monthly run costs of £400–£2,000 depending on GPU hours and model size. On-premise appliance builds run £25,000–£60,000 plus electricity and maintenance.

Q: Which option is best for a London law firm?

Most 5–50 fee-earner firms start with Copilot for Microsoft 365 (integrates with Outlook/Word/SharePoint) for general productivity, plus Azure OpenAI in their own tenant for any privileged contract review or document analysis. This keeps client data within a UK/EU boundary, contractually ring-fenced from training, and under audit.

By Hak, VantagePoint Networks · Published 18 April 2026 · 10 min read

"Private AI" has become a marketing term, which means it has almost stopped meaning anything. Every vendor now claims their product is "private", "enterprise-grade" and "your data stays yours" — yet the definitions, the storage locations, and the training-data usage vary wildly between OpenAI, Microsoft, Anthropic, Google, and the growing set of self-hosted options.

If you run a UK SMB — especially in law, financial services, healthcare or professional services — "where does my data actually go?" is not a theoretical question. It's the one your clients, your insurers, your regulators, and eventually your auditors will ask. This article gives you an honest comparison so you can answer it.

The four real options in 2026

Strip away the marketing and there are essentially four delivery models you can pick for AI in a UK SMB. Each has a defensible use case — the trap is assuming they're interchangeable.

Option	Where data lives	Training on your data?	Typical cost (25 users)	Best for
Copilot for M365	Microsoft UK/EU data boundary, same tenant as your M365	No — contractual commitment	£~7,800/yr + base M365	Firms already on M365 Business Premium / E3, document-heavy work
ChatGPT Enterprise / Team	OpenAI (US, with EU residency option)	No — contractual	£~5,500–7,800/yr	Teams needing GPT-4/5-class reasoning outside M365
Claude for Work	Anthropic (US, EU region available)	No — contractual	£~6,000–7,800/yr	Long-document analysis, careful drafting, complex reasoning
Self-hosted private AI	Your infrastructure (Azure/UK DC/on-prem)	No — you control it	£15k–£60k build + £400–£2k/mo run	Privileged data, regulated firms, IP-sensitive work

Two things jump out. First, all four options say they don't train on your data — but the legal mechanism differs. Microsoft and Anthropic rely on contract plus architectural separation. Self-hosted removes the question entirely. Second, self-hosted is dramatically more expensive up-front, but has a very different long-run cost curve if your team is large or your use is heavy.

What "your data stays yours" actually means

When a vendor says your data isn't used to train their models, they typically mean four things — and you should confirm all four in writing before signing:

No training on prompts or outputs. Your chats don't become future model weights. This is the strongest and most universally respected commitment.
No human review of your content. Some providers keep the right to human-review flagged content for abuse detection. Ask whether you can opt out, and whether reviewers are UK/EU based.
Limited retention window. How long are your prompts and outputs kept on the provider's systems? 30 days is typical; some now offer zero-retention on request.
Data residency. Where physically does the processing happen? "EU Data Boundary" for Microsoft and "EU region" for Anthropic / OpenAI are real options, but they must be switched on — they are rarely the default.

If any of those four commitments feel vague in a vendor's own documentation, assume the commercial protection is weaker than marketing suggests. Press for plain-English clarification from your account manager, and get it in the MSA — not a blog post.

The case for sticking with a commercial provider

Most UK SMBs — including most I work with in law and finance — are genuinely well-served by Copilot, Claude for Work, or ChatGPT Enterprise. Here's when a commercial tool is the right answer:

Your data classification is commercial-sensitive but not privileged — internal plans, quotes, marketing copy, management reports.
You are processing your own intellectual property, not client material under NDA.
Your client contracts don't explicitly prohibit third-party processing.
Your regulator or insurer has given clear guidance that commercial AI with a UK/EU data-boundary commitment is acceptable.
Your team is under 150 and usage patterns don't justify the build cost of a private deployment.

In this zone, Copilot's integration with Outlook, Word, Excel and SharePoint is usually worth more than the marginal sovereignty gain from going private. The operational win beats the architectural purity.

The case for going private

Self-hosted or dedicated-tenant AI becomes the right call when one or more of the following applies:

You handle legally privileged communications — solicitor/client, in-house counsel memos, witness statements, regulatory investigations.
You operate under client contracts that forbid third-party processing — increasingly common in M&A, litigation, defence, and government work.
Your IP is the asset — proprietary research, algorithm development, unpublished manuscripts, trade secrets. You don't want any version of it sitting in a US provider's logs, even under contract.
You're regulated in a way that requires evidence of physical data location — some FCA firms, Ministry of Defence suppliers, NHS trusts.
You have 200+ heavy users and the per-seat economics of commercial tools have stopped making sense.

In those scenarios, a self-hosted deployment isn't over-engineering — it's the minimum viable answer to a client, insurer or regulator question.

What a private AI actually looks like

When I deploy a private AI for a London SMB, it's usually one of three architectures:

1. Azure OpenAI in your own tenant

Lowest friction if you're already on Microsoft. You get GPT-4/5-class models running inside your Azure subscription, in UK South or UK West region. Contractually, Microsoft doesn't see your prompts. Practically, this is "dedicated tenancy" not "self-hosted", but it passes the sovereignty test for the vast majority of UK regulators.

2. Open-weight models on Azure / AWS UK regions

Models like Llama 3, Mistral, or Qwen running on GPU VMs in UK-regional Azure or AWS. You operate it. You patch it. No vendor sees the prompts. This is "private" in the strictest sense, and the right answer when a client contract explicitly forbids third-party model providers.

3. On-premise appliance

For firms that already have a server room and a compliance reason to avoid public cloud entirely, a dedicated GPU box running an open-weight model is now commercially viable. Build cost is higher (£25k–£60k typical); running cost is effectively electricity plus maintenance.

Rule of thumb

If a single one of your largest client contracts prohibits "third-party AI processing", a private deployment probably pays for itself in the first year — not in compute, but in the client revenue it preserves.

The honest trade-offs of going private

Private AI sales pitches rarely discuss what you give up. Let's be direct:

Model quality. Open-weight models in 2026 are excellent but not yet at parity with GPT-5 or Claude 4.x on the hardest reasoning tasks. For summarisation, drafting and RAG-based Q&A, the gap is negligible. For frontier reasoning, it's still visible.
Update cadence. OpenAI and Anthropic ship improvements weekly. Your self-hosted model is frozen at the version you deployed, unless you budget for quarterly upgrades.
Operational overhead. Someone has to patch, monitor, and back it up. If you don't have an internal IT capability, budget for a managed service, or the TCO advantage disappears.
Integration work. Copilot is one click away from Outlook. A private model talking to your document store needs a RAG layer, authentication, audit logging and a chat interface. That's 6–10 weeks of work, not an afternoon.

A decision flow you can actually use

Read your top five client contracts. Any explicit prohibition on third-party AI processing? If yes → private is the floor.
Check your regulator's guidance. SRA, FCA, ICO have all published positions. If private or dedicated-tenant is named as acceptable → follow it.
Classify your usage. What fraction of prompts will involve privileged or client-confidential material? Under 10% → commercial tool plus strict policy. Over 30% → private.
Count heavy users. Under 50 → commercial almost always wins. Over 200 → run the economics; private frequently wins.
If you're still unsure → pilot both for 60 days on a small group and measure.

The worst decision is the one driven by whoever shouted loudest at the last management meeting. The second-worst is signing an enterprise contract because your account manager offered a discount. Do the five-question version above; most firms reach a defensible answer in an afternoon.

Frequently asked questions

Does ChatGPT Enterprise train on my data?

No. OpenAI contractually commits that Enterprise and Team prompts and outputs are not used to train their models. Retention is typically 30 days; zero-retention is available on request. Protection is contractual, not architectural — important if a contract or regulator requires stronger isolation.

Is Microsoft Copilot data kept in the UK?

Yes if configured. Set your tenant region to United Kingdom and enable EU Data Boundary. Copilot processing then stays within the EU, with UK South/West as primary for UK tenants.

When is a self-hosted private AI actually necessary?

When a client contract forbids third-party AI processing, when a regulator demands physical-location evidence, when handling legally privileged material, or when your IP is the asset. Most SMBs don't need it; some can't operate without it.

How much does a private AI deployment cost for a UK SMB?

Typical build for a 25–100 user private AI on Azure UK is £15,000–£40,000; monthly run costs £400–£2,000. On-premise appliance builds run £25,000–£60,000 plus electricity and maintenance.

Which option is best for a London law firm?

Most 5–50 fee-earner firms start with Copilot for M365 for general productivity, plus Azure OpenAI in their own tenant for privileged contract review. Client data stays within UK/EU, ring-fenced from training, and fully audited.

Not sure whether private AI is a pitch or a real requirement for your firm?

Free 30-minute strategy call. I'll walk you through the decision flow with your actual contracts and client mix — no slide deck, no sales pitch.

Free 30-min Strategy Call

Purpose	Data Used
Responding to enquiries & providing consultations	Name, email, phone, message
Delivering agreed IT services	Name, email, company details
Improving our website experience	Analytics, cookies
Legal & regulatory compliance	As required by law
Fraud prevention & site security	IP address, usage data

Type	Purpose	Required?
Essential	Cookie & theme preferences. Required for site functionality.	Always active
Analytics	Understanding visitor behaviour to improve the site.	Consent required

Private AI vs ChatGPT Enterprise: what UK SMBs need to know about data sovereignty

The four real options in 2026

What "your data stays yours" actually means

The case for sticking with a commercial provider

The case for going private

What a private AI actually looks like

1. Azure OpenAI in your own tenant

2. Open-weight models on Azure / AWS UK regions

3. On-premise appliance

The honest trade-offs of going private

A decision flow you can actually use

Frequently asked questions

Does ChatGPT Enterprise train on my data?

Is Microsoft Copilot data kept in the UK?

When is a self-hosted private AI actually necessary?

How much does a private AI deployment cost for a UK SMB?

Which option is best for a London law firm?

Keep reading

Microsoft Copilot rollout for London SMBs: how to deploy without leaking client data

Cyber Essentials Plus for London law firms and finance SMBs

Not sure whether private AI is a pitch or a real requirement for your firm?