Every cloud DLP tool wants to ingest your files to protect them. PII Crawler scans your files, network shares, and databases on your own hardware — surfacing the SSNs, names, and card numbers hiding where SaaS tools can’t reach. Outbound traffic during a scan: 0 bytes.
PII leaks into places you never put it. Every SaaS scanner marketed at you wants to ingest your files into its cloud to find PII — the opposite of what your security team signed up for.
Pattern matching layered with named-entity recognition, reading inside the places PII hides: slide XML, spreadsheet cells, archives, and the images embedded in PDFs. Every match shows the text around it, so you can judge it without opening the file.
Layered detection keeps your report from being 80% false positives. But a real scan still surfaces matches that look like PII and aren’t — test fixtures, sample data, a vendor SDK. Dismiss them at the right scope and they stay dismissed: verdicts are saved with the scan, so a rerun never undoes your review.
123-45-6789 → fp · all 400 occurrences cleared
vendor/ → ignored, recursively
piicrawler export 42 --exclude-fp
Decommissioned laptops, server disks, and backup drives walk out the door full of customer records. A factory reset or quick format isn’t proof. Scan each disk before it goes to the recycler or resale, surface any PII a wipe missed, and keep the report as your record of disposal.
piicrawler scan /mnt/decom-disk-04 --out before.csv
shred / blancco / drive destruction
piicrawler scan /mnt/decom-disk-04 --out after.csv
No agents to install, no daemons to register. scp the single static binary to any host, ssh in, and run ./piicrawler — start a scan and review the results without leaving your terminal. Or skip the UI entirely: ./piicrawler scan / > report.json.
That's not a marketing claim. Run it offline and watch the network counter stay at zero.
PII Crawler is built by Eligian Labs — an independent team, now shipping the product’s second generation, and the same people who answer [email protected]. There’s no VC and no data business behind it: the revenue is the license, so the product has no reason to phone home. If it finds something sensitive, we never know.
“I appreciate your ongoing development of the product and super fast support!” — Jon S., customer
$497 to find the PII first. The average data breach costs $4.44M (IBM, 2025) — and it starts with a file nobody knew was there.
Nothing leaves your network. No account, no credit card. The download is the full product for 7 days.
tar -xzf piicrawler-cli-linux.tar.gz && sudo mv piicrawler /usr/local/bin/
piicrawler demo
piicrawler scan ~/share --out report.csv