North Ark
← All articles

AI on your data · 12 July 2026 · 8 min read

How to connect your company documents to AI without leaking data

Staff want ChatGPT on company files. IT wants no oversharing. The safe path is permissions-aware retrieval, tenant readiness, and phased rollout — not blocking AI until competitors pass you.

Shadow AI happened because blocking public ChatGPT did not give staff a safe alternative on company data. The fix is not another policy PDF — it is an approved path: classified corpora, Entra-scoped access, logging, and training on what not to paste into public tools.

North Ark implements that path with FC-19/FC-27 readiness first, then FC-26 RAG or FC-25 agents when the use case is clear.

Map data before models

List libraries, shares, and retention classes. Remove or archive stale open shares. Apply sensitivity labels where they reflect reality.

Define which corpora are in scope for AI — not "all of SharePoint."

Permissions follow the user

Retrieval must filter by user identity — same as opening a file in SharePoint. Test with accounts at different role levels; attempt to retrieve HR content as a standard user and confirm denial.

Service accounts for indexing need read-only scope on in-scope libraries only.

Logging and human escalation

Log prompts and retrieved sources for audit. Escalate low-confidence answers to humans. Ban pasting client PII into unapproved tools — give an approved internal assistant instead.

Fixed-price entry points

FC-27 combines Copilot readiness and M365 security baseline in one SOW. FC-07 orders Copilot vs RAG vs automation. See AI on your data and Shadow AI checklist for self-assessment.

Frequently asked questions

Is it safe to use ChatGPT on internal docs?
Public ChatGPT uploads leave your control. Use enterprise Copilot, private Azure OpenAI, or a deployed RAG assistant with contract and data residency terms that match your obligations — not consumer tools for confidential files.
How long until staff can use an internal assistant?
Readiness: one week (FC-19/FC-27). Pilot RAG or agent: 4–6 weeks (FC-26/FC-25). Trying to skip readiness usually delays launch when security finds overshared libraries mid-build.
What about client confidential data?
Exclude client matter libraries until labelled and permissioned. Many firms run assistants on internal SOPs and templates first — not full matter files — until governance sign-off.

Next step

connect documents to AI safely — ready to act?

Copilot + security pack — $4,990

Or book a free fit call