// PII Crawler vs Microsoft Purview

A scanner you own, not a tenant feature you license.

Microsoft Purview is the data governance and compliance suite built into Microsoft 365 and Azure: classification, sensitivity labels, DLP, retention, and eDiscovery, administered from the cloud and licensed through your tenant. PII Crawler is a standalone binary that scans files, shares, and databases on any machine, with no tenant and no cloud. They overlap on PII detection. They diverge on what you have to buy into.

Last reviewed May 2026 · based on publicly available information.
PII Crawler
  • $497 one-time license, perpetual
  • Single binary · Mac · Windows · Linux
  • No tenant · no Azure subscription · no E5
  • Air-gapped · 0 B outbound during scan
  • Files, network shares, SQL databases
Try free → no signup
Microsoft Purview
  • Licensed via Microsoft 365 E5 / E5 Compliance
  • Azure consumption for Data Map scanning
  • Cloud-administered · tied to your tenant
  • Classification, labels, DLP, retention, eDiscovery
  • Strongest inside the Microsoft estate (M365, Azure)
Public marketing as of May 2026.
// the fundamental difference

One governs the Microsoft estate. The other scans whatever you point it at.

Purview was built to govern and protect data across Microsoft 365 and Azure: sensitivity labels that travel with Office documents, DLP enforced in Exchange, SharePoint, OneDrive, Teams, and on Windows endpoints, plus retention, eDiscovery, audit, and a compliance-posture dashboard. It's administered from the cloud, and its licensing rides on Microsoft 365 (typically E5) and Azure consumption. If your organization lives in Microsoft 365, it's the native answer.

PII Crawler answers a narrower, ecosystem-agnostic question: "Where is PII sitting on these files, shares, and databases — including the Linux server, the legacy NAS, the Postgres instance, and the air-gapped subnet — and can I get an answer without a tenant or a cloud?"

If your data and your stack are Microsoft, Purview's native integration is hard to beat. If you need to scan the systems Purview doesn't reach cleanly, or you'd rather not depend on a tenant and a licensing tier to find PII, PII Crawler is the simpler tool.

// side by side

How they compare on the things that matter to a buyer.

PII Crawler
Microsoft Purview
Cost & commitment
Pricing model
$497 one-time, perpetual license
Bundled into Microsoft 365 E5 / E5 Compliance · Azure consumption for Data Map
Renewals
None — the binary is yours
Tied to your Microsoft 365 subscription
Cost as you grow
Flat · unlimited users & scans
Per-user licensing + per-scan Azure consumption
Already paying for it?
n/a · standalone purchase
Yes, if you already license E5 / E5 Compliance
Deployment & data flow
Architecture
Single signed binary · no agent · no daemon
Cloud-administered service in your Microsoft tenant
Microsoft tenant / Azure subscription required
No · runs entirely standalone
Yes · it is part of Microsoft 365 / Azure
Where data is processed
On the machine running the scan
In your Microsoft cloud tenant
Air-gapped capable
Yes · 0 B outbound during scan
No · cloud-administered by design
Time to first scan
Under a minute
Configure portal, policies, roles, and scan sources first
Remote / isolated machine workflow
scp binary · ssh · TUI · no cloud needed
Source must be reachable and registered with the tenant
Discovery coverage
Local files (PDF, Office, CSV, archives)
Yes · with OCR
Via registered sources, not arbitrary local disk
Network shares (SMB / NFS)
Yes
Limited · via Data Map source registration
SQL databases
Postgres / MySQL / SQL Server · sampled in memory
Azure SQL & registered sources via Data Map
Microsoft 365 (Exchange, SharePoint, OneDrive, Teams)
Not yet · use database / export workflows
Yes · native, a core strength
Cross-platform standalone (Linux, Mac, non-MS stacks)
Yes · no Microsoft dependency
Centered on the Microsoft ecosystem
Detection approach
Regex + NER (en_core_web_lg) · 30+ PII types
Sensitive Information Types + trainable classifiers
Governance & compliance
Sensitivity labels & persistent classification
No · point-in-time findings export
Yes · labels travel through Office apps
DLP enforcement across M365
No · reports findings, you act on them
Yes · Exchange, SharePoint, Teams, endpoints
Retention / records management
No
Yes · data lifecycle management
eDiscovery, audit, insider risk
No
Yes · part of the suite
Compliance-posture dashboard
CSV / JSON exports out of the box
Yes · Compliance Manager
Operations & integration
CI/CD integration
CLI emits JSON / CSV · --exit-code-on flag fails builds
Possible via Graph API; not the primary motion
Reports for GDPR / HIPAA / CCPA
CSV / JSON exports out of the box
Prebuilt assessments & reporting
Support
Email · fast · founder-led
Microsoft support plans · partner ecosystem
Trust
Source of compliance evidence
Verifiable on your own host (tcpdump the binary)
Microsoft attestations · Service Trust Portal
If you stop paying
Binary keeps working forever
Purview features turn off with the subscription
Comparisons reflect publicly available information about Microsoft Purview as of May 2026, plus our own product. Microsoft and Microsoft Purview are trademarks of the Microsoft group of companies. PII Crawler is not affiliated with Microsoft.
// pick the right one

We genuinely think one of these is wrong for you.

Pick Microsoft Purview if
  • Your data and workflows live in Microsoft 365 and Azure, and you want native classification and labels that travel through Office apps.
  • You need DLP enforced across Exchange, SharePoint, OneDrive, Teams, and Windows endpoints.
  • You need retention, records management, eDiscovery, audit, or insider risk as part of one compliance program.
  • You already license Microsoft 365 E5 (or E5 Compliance) and want to use what you're paying for.
  • You want a compliance-posture dashboard (Compliance Manager) across Microsoft services.
  • You have an admin team to configure the Purview portal, policies, and roles.
Pick PII Crawler if
  • You need to scan files, shares, and databases that aren't in Microsoft 365 — Linux servers, legacy NAS, Postgres/MySQL, Mac machines.
  • You don't have (or don't want to depend on) an E5 license or Azure subscription just to discover PII.
  • Your security review says nothing sensitive leaves the network, and you need an air-gapped scan a cloud-administered service can't do.
  • You want a one-time price you can expense, not licensing tied to per-user M365 tiers and Azure consumption.
  • You need an answer this week without configuring a tenant, policies, and roles.
  • You want PII checks embedded in CI/CD, or a quick standalone audit on a single box.
Try PII Crawler free → no signup
// FAQ

Questions buyers ask us about Microsoft Purview.

Purview is excellent for data that lives inside Microsoft 365 and Azure. PII Crawler is for the data it doesn't reach cleanly: non-Microsoft file servers, Linux boxes, legacy NAS, arbitrary SQL databases, and air-gapped subnets — plus fast standalone scans without configuring Purview's scanning or paying Azure Data Map consumption. Many teams use both.
No — that's Purview's home turf, and it does it natively. PII Crawler scans local files, network shares, and SQL databases. For PII inside Microsoft 365 content, Purview is the right tool.
No. PII Crawler is a standalone binary with no tenant, no cloud, and no subscription. That's the point: you can scan a machine that has nothing to do with Microsoft, including one with no internet access at all.
No. PII Crawler reports where PII is and exports the findings; it doesn't classify-and-label documents or enforce DLP policies. If you need labeling that travels through Office apps and DLP enforced across Microsoft services, Purview is built for exactly that.
Yes — that's a design goal. PII Crawler has no cloud dependency, so it runs on isolated subnets and air-gapped hosts. Purview is cloud-administered through your Microsoft tenant, so it can't scan an environment cut off from Microsoft's cloud.
Yes, and they complement each other well. Run Purview for the Microsoft estate — M365 content, labeling, DLP, retention — and reach for PII Crawler for everything outside it: non-Microsoft servers, databases, and air-gapped environments. The CSV / JSON exports drop cleanly into a broader workflow.
// the math

$497 once. Not a per-user E5 tier.

Microsoft Purview (typical)
E5+ /user/mo
bundled into Microsoft 365 E5 + Azure consumption
×Requires Microsoft 365 E5 / E5 Compliance licensing
×Azure consumption billing for Data Map scans
×Per-user pricing scales with headcount
×Tied to your tenant · cloud-administered
PII Crawler vs · $200 OFF
$497 $697 once
paid for itself the day you ran it
Unlimited users · machines · scans
No tenant · no Azure · no E5 required
Air-gapped on your hardware
Mac · Windows · Linux + CLI + TUI
First scan in under 60 seconds
Buy license → $497
14-day refund · no questions asked
// download

Run it on a real share before you decide.

Full trial. No credit card. Runs on your laptop or server.
macOS
darwin-arm64
piicrawler-cli-macos-arm.zip
Download ↓
Windows
win-x64 · signed
piicrawler-cli-windows-signed.zip
Download ↓
Linux
linux-x64
piicrawler-cli-linux.tar.gz
Download ↓