Question 1

How do I verify a SaaS vendor uses true zero-knowledge encryption and cannot access my data?

Accepted Answer

Argon2id key derivation runs entirely in the browser/app (64MB memory, 3 iterations). AES-256-GCM encryption happens before any data leaves the device. The server never receives the plaintext password or the derived encryption key. Even a full anonym.legal server breach would yield only encrypted blobs without the keys to decrypt them. Example: A compliance officer at a German health insurer needs to process patient complaint logs using a cloud anonymization tool. GDPR Article 32 requires approp

Question 2

My company processes PHI — can we use cloud anonymization tools or do we need on-premise only?

Accepted Answer

Zero-knowledge design means original text is never stored on anonym.legal servers. European data storage (Hetzner EU data centers). The tool processes anonymization logic without retaining the source documents. This removes the primary blocker for HIPAA-covered entity adoption. Example: A hospital system's IT security team is evaluating tools for clinical documentation anonymization before sharing with a research partner. The HIPAA Privacy Officer needs to demonstrate compliance under 45 CFR 164

Question 3

SaaS breaches are up 300% — how can I trust any cloud tool with PII?

Accepted Answer

Zero-knowledge architecture means a full anonym.legal server compromise provides attackers with AES-256-GCM ciphertext without the keys to decrypt it. Combined with EU-based data storage and ISO 27001 controls, this provides the strongest possible breach impact minimization. Example: A CISO at a German insurance company is reviewing their 2025 vendor risk posture after the industry-wide SaaS breach surge. They require all PII-handling vendors to demonstrate cryptographic data isolation. anonym.l

Question 4

How do I know the PII anonymization tool I'm using isn't storing my sensitive data on their servers where it could be breached?

Accepted Answer

Argon2id (64MB memory, 3 iterations) key derivation runs entirely in the browser/desktop client. The derived AES-256-GCM key never leaves the device. anonym.legal servers receive only encrypted ciphertext and cannot decrypt it even with full database access. 24-word BIP39 recovery phrase enables key recovery without server involvement. Example: A CISO at a German health insurer evaluating anonymization tools for GDPR compliance. Their procurement checklist requires proof that the vendor cannot a

Question 5

After the LastPass breach, can I trust any cloud service with my company's sensitive data?

Accepted Answer

Zero-knowledge authentication with open architecture documentation. The 24-word BIP39 recovery phrase is the only way to restore access, meaning even anonym.legal staff cannot reset accounts or access user data. Session management with remote logout prevents persistent access after device loss. Example: A CISO at a 500-person law firm is reviewing vendor security after their password manager vendor suffered a breach. They need to demonstrate to their malpractice insurer that all tools handling c

Question 6

How do I pass a security questionnaire for a vendor that handles our sensitive documents?

Accepted Answer

Zero-knowledge authentication + ISO 27001 certification provides the strongest possible answer to VSQ encryption questions. anonym.legal can truthfully state that server compromise yields no usable plaintext data. Example: A Fortune 500 financial services company is adding anonym.legal to their approved vendor list. Their vendor risk team sends a 150-question security questionnaire. The zero-knowledge architecture allows the anonym.legal team to answer encryption, key management, and data access

Question 7

How do we pass vendor security assessments faster without sharing our encryption architecture documentation every time?

Accepted Answer

ISO 27001 certification provides the baseline framework. Zero-knowledge architecture documentation answers the specific question of server-side data access. DPIA completion satisfies GDPR Article 35 requirements. The combination dramatically shortens procurement cycles for regulated industries. Example: A procurement officer at a Fortune 500 financial services firm needs to onboard an anonymization tool for their data science team within Q4. anonym.legal's ISO 27001 certificate + zero-knowledge

Question 8

The DOJ's Epstein files showed that PDF black-box redaction can be reversed with copy-paste — are Word documents safer?

Accepted Answer

Office Add-in performs true PII replacement within the Word document itself. Text is permanently replaced with tokens, redacted marks, or anonymized placeholders. The original text is not hidden — it is gone from the document. Formatting (fonts, styles, bold, italic) is preserved. Headers, footers, and comments are processed. Full undo support for iterative review. Example: A government agency's legal team must produce 3,000 documents in response to a litigation hold. Previous productions using

Question 9

Our legal team spends 2-3 days manually redacting Word documents for each discovery production — is there a faster way?

Accepted Answer

Word Add-in works natively inside Microsoft Word — no conversion required. Preserves all formatting: fonts, styles, bold, italics, tables, headers, footers, footnotes, and comments. Supports per-entity operator configuration (different handling for names vs. SSNs vs. dates). Full undo support for iterative review. Reduces 2-3 days of manual work to hours. Example: A litigation boutique law firm handles 15 major matters annually, each requiring 5,000-50,000 document productions. Manual redaction

Question 10

We need to anonymize Excel spreadsheets with 100,000 rows of employee data — does existing redaction software handle structured data?

Accepted Answer

Excel Add-in processes spreadsheets natively. Cell-level PII detection across all visible and hidden sheets. Handles up to 100,000 rows per plan. Preserves spreadsheet structure and formulas. Per-entity configuration allows different handling for names (replace with pseudonym) vs. SSNs (replace with X's) vs. phone numbers (mask with partial display). Example: A German manufacturing company's HR department must share 50,000 employee records with an external compensation consultant. GDPR requires

Question 11

How do I redact sensitive data in Word documents without destroying the formatting?

Accepted Answer

Word Add-in works natively inside Microsoft Office. No export or conversion. Formatting is preserved at the paragraph, character, and style level. Bold names remain bold after anonymization. Table structures are preserved. Headers and footers are processed without disrupting page layout. The result is a properly formatted document ready for immediate use. Example: A UK law firm specializing in employment tribunals must produce witness statements with names and identifying information anonymized

Question 12

FOIA requests requiring redaction of thousands of Word documents are creating backlogs — what automation tools help?

Accepted Answer

Office Add-in processes Word documents natively with automation support. Batch processing (1-5,000 files via Desktop App) enables volume handling. Per-entity configuration allows agency-specific redaction rules (FOIA exemption B6 for personal information, B7 for law enforcement). Presets allow FOIA staff to apply consistent configurations across the entire request. Example: A federal agency's FOIA office receives a request for 8,000 Word documents related to a policy decision. With 5,638 FOIA st

Question 13

What Word redaction tools preserve styles, tables, and tracked changes during PII removal?

Accepted Answer

The Office Add-in operates directly within the Word document object model — no conversion to intermediate format. PII entities are detected in text runs, paragraphs, headers, footers, footnotes, and comments. Anonymization is applied in-place with full formatting preservation. Ctrl+Z undo reverts any change. This is architecturally distinct from all redaction tools that work at the rendered-document level. Example: A partner at a 50-person law firm needs to redact a 200-page merger agreement bef

Question 14

How do I anonymize PII in Excel spreadsheets that have thousands of rows of customer data without losing the structure?

Accepted Answer

The Office Add-in processes Excel at the cell level, supporting up to 100,000 rows and 20MB files. Per-entity operator configuration allows different handling for different entity types within the same spreadsheet. The full undo capability allows recovery if a formula column is accidentally flagged. Example: A data analyst at a retail company preparing customer purchase history for an external marketing analytics vendor. The 50,000-row Excel file contains customer names, emails, and loyalty IDs

Question 15

We have air-gapped workstations for classified work — is there a PII anonymization tool that works completely offline?

Accepted Answer

Desktop App built on Tauri 2.0 + Rust processes everything locally. After initial installation, no internet connection is required. All NLP models are embedded. The encrypted local vault stores configuration and presets. No data leaves the device at any point. Available on Windows, macOS, and Linux. Example: A defense contractor processing ITAR-controlled technical documents needs to anonymize them before sharing with a foreign partner under a license exception. All processing must occur on clea

Question 16

GDPR data sovereignty rules say our data can't leave Germany — how do we use cloud tools without violating this?

Accepted Answer

Desktop App processes all data locally. Nothing leaves the device. For organizations that also need cloud features, anonym.legal's web platform uses EU-based Hetzner data centers with zero-knowledge architecture. The Desktop App serves organizations with the strictest local-only requirements. Example: A German federal government agency must anonymize citizen complaint data before sharing with an external research institute. BfDI guidance prohibits processing on non-government infrastructure. ano

Question 17

Our hospital's cybersecurity team won't approve any cloud-based PHI processing tools — what desktop alternatives exist?

Accepted Answer

Desktop App provides cloud-quality anonymization (Presidio-based NLP with 48 languages and 260+ entity types) in a locally-installed application. No cloud connectivity required. Healthcare-specific entity types (MRN, NPI, DEA, health plan IDs) included. All 18 HIPAA Safe Harbor identifiers supported. Example: A mid-size regional hospital's clinical informatics team wants to create a research-ready dataset from their EHR. The CISO refuses to approve cloud processing of PHI. anonym.legal Desktop A

Question 18

We need to batch-process 5,000 documents locally without uploading them to any cloud — is that possible?

Accepted Answer

Desktop App batch processing supports 1-5,000 files per batch depending on plan. Parallel execution (1-5 concurrent files) for throughput. Mixed format support in a single batch. ZIP packaging for processed files. CSV/JSON export with processing metadata. Progress tracking and error handling. Example: A clinical research organization is building a de-identified dataset from 50,000 patient consultation notes. The hospital's IRB requires that processing occur on-site. anonym.legal's Desktop App pr

Question 19

How do I anonymize documents on a trading floor where data cannot leave the internal network?

Accepted Answer

Desktop App works completely offline after installation. Finance-specific entity types (IBAN, SWIFT, BIC, account numbers, routing numbers, cryptocurrency addresses) are pre-built. Batch processing handles volume. Encrypted local vault stores configurations and presets securely on-device. Example: A proprietary trading firm's compliance team must submit anonymized trade reports to a financial regulator. Reports contain client account numbers, trader names, and position sizes. All workstations ha

Question 20

We have a fully air-gapped network and cannot use any cloud-based tools. What PII anonymization options exist for air-gapped deployments?

Accepted Answer

The Tauri 2.0-based Desktop Application runs entirely offline after download. No network calls are made during processing. The local encrypted vault (AES-256-GCM + Argon2id) stores configurations and encryption keys without cloud sync. Batch processing supports 1-5,000 files depending on plan tier. All processing occurs on local hardware — no data ever leaves the device. Example: A data scientist at a defense contractor needs to de-identify personnel records before sharing with a FOIA-requesting

Question 21

Our legal team says patient data cannot leave our premises under any circumstances. What tools work completely locally?

Accepted Answer

The Desktop Application architecture (Tauri 2.0 + Rust) has been independently verified to make no network calls during document processing. The local vault stores all configuration and keys. Processing the Presidio sidecar runs entirely on the local machine. This architecture can be verified by network monitoring tools during security assessment. Example: A compliance officer at a Swiss private bank needs to anonymize client correspondence before sharing with an external auditor. Swiss banking

Question 22

We anonymized documents for sharing, but now legal needs the originals for discovery — how do we get them back?

Accepted Answer

AES-256-GCM reversible encryption preserves the mathematical relationship between the anonymized token and the original value. With the client-held encryption key, any anonymized document can be fully restored to its original content. Without the key, the anonymized version is computationally indistinguishable from a permanently redacted document. Legal teams share encrypted versions; produce originals when required using the retained key. Example: A pharmaceutical company shares clinical trial

Question 23

We de-identified patient data for research, but now need to contact specific patients based on research findings — how?

Accepted Answer

Reversible encryption creates a protected pseudonymization layer. The research dataset uses encrypted tokens. The decryption key is held by the designated data custodian. When re-contact is clinically justified and IRB-approved, the custodian decrypts the specific participant records to enable follow-up. The broader dataset remains protected — only the specific authorized decryption is performed. Example: A European oncology research center conducts a 5,000-patient study using anonym.legal's enc

Question 24

We anonymized documents to share with outside counsel, but now we need to produce the originals in discovery. How do we recover the original data?

Accepted Answer

Reversible encryption using AES-256-GCM generates deterministic encrypted tokens from original PII. The key is held only by the user. "John Smith" becomes "[ENC:x9f3a...]" consistently throughout the document — maintaining referential integrity. When authorized de-anonymization is needed (discovery production, audit verification, research follow-up), the user applies their key and all tokens restore to originals. The Chrome Extension auto-decrypts AI responses, so working with encrypted data is

Question 25

Our external auditors need to verify the original data behind our redacted financial reports — how do we handle this?

Accepted Answer

Reversible encryption allows selective de-anonymization. The finance team shares encrypted anonymized reports. Auditors working under formal engagement can be given decryption capability for their audit period. After audit completion, the key can be rotated — previous encrypted copies remain protected, auditors cannot retroactively access records outside their engagement. Example: A private equity firm shares portfolio company financial data with an external audit firm for annual review. Client

Frequently Asked Questions

Zero-Knowledge Authentication

How do I verify a SaaS vendor uses true zero-knowledge encryption and cannot access my data?

My company processes PHI — can we use cloud anonymization tools or do we need on-premise only?

SaaS breaches are up 300% — how can I trust any cloud tool with PII?

How do I know the PII anonymization tool I'm using isn't storing my sensitive data on their servers where it could be breached?

After the LastPass breach, can I trust any cloud service with my company's sensitive data?

How do I pass a security questionnaire for a vendor that handles our sensitive documents?

How do we pass vendor security assessments faster without sharing our encryption architecture documentation every time?

Office Add-in (Word & Excel)

The DOJ's Epstein files showed that PDF black-box redaction can be reversed with copy-paste — are Word documents safer?

Our legal team spends 2-3 days manually redacting Word documents for each discovery production — is there a faster way?

We need to anonymize Excel spreadsheets with 100,000 rows of employee data — does existing redaction software handle structured data?

How do I redact sensitive data in Word documents without destroying the formatting?

FOIA requests requiring redaction of thousands of Word documents are creating backlogs — what automation tools help?

What Word redaction tools preserve styles, tables, and tracked changes during PII removal?

How do I anonymize PII in Excel spreadsheets that have thousands of rows of customer data without losing the structure?

Desktop Application (Offline Processing)

We have air-gapped workstations for classified work — is there a PII anonymization tool that works completely offline?

GDPR data sovereignty rules say our data can't leave Germany — how do we use cloud tools without violating this?

Our hospital's cybersecurity team won't approve any cloud-based PHI processing tools — what desktop alternatives exist?

We need to batch-process 5,000 documents locally without uploading them to any cloud — is that possible?

How do I anonymize documents on a trading floor where data cannot leave the internal network?

We have a fully air-gapped network and cannot use any cloud-based tools. What PII anonymization options exist for air-gapped deployments?

Our legal team says patient data cannot leave our premises under any circumstances. What tools work completely locally?

Reversible Encryption (UNIQUE Tokens)

We anonymized documents for sharing, but now legal needs the originals for discovery — how do we get them back?

We de-identified patient data for research, but now need to contact specific patients based on research findings — how?

We anonymized documents to share with outside counsel, but now we need to produce the originals in discovery. How do we recover the original data?

Our external auditors need to verify the original data behind our redacted financial reports — how do we handle this?

Anonymous employee surveys revealed a serious harassment allegation — we need to follow up but can't identify who filed it. What should we do?

We use AI to process customer queries but need to restore original names for the final response — how does token mapping work across AI interactions?

We de-identified patient data for a research study. Now we need to re-contact participants for a follow-up. How do we identify them?

Multi-Format Document Support

PDF redaction is a specific problem — tools that just put a black box over text aren't truly redacting it, the text is still there in the PDF layer. How do we ensure true redaction?

We have PII spread across Word documents, PDFs, Excel spreadsheets, and CSV exports. We've been using different tools for each format — it's a mess. Is there one tool that handles all of them?

We have XLSX spreadsheets with PII scattered across hundreds of columns and rows — phone numbers in one column, names in another, SSNs mixed with account numbers. How do we anonymize these efficiently?

Our application logs contain user data in JSON format — API logs with user IDs, email addresses, and IP addresses mixed with technical fields. How do we anonymize logs for debugging without removing too much context?

We need to share research data in CSV format with a university partner. The CSV contains survey responses with PII mixed into free-text fields. Are there tools that can detect PII in CSV free-text columns?

Our e-discovery production includes PDFs, Word documents, Excel spreadsheets, and email exports. We need different tools for each — how do we unify this?

Our application logs contain customer PII in JSON format. How do we mask sensitive fields before sending logs to our analytics platform?

Text-Based Image PII Detection

We have thousands of scanned contract PDFs — they're image-based PDFs with no text layer. Standard PDF PII tools can't detect anything. How do we process scanned documents?

Our support team takes screenshots and shares them internally — these screenshots often contain customer data. How do we detect and remove PII from screenshots before sharing?

We receive forms filled out by hand and scanned — job applications, patient intake forms, insurance claims. The scanned images contain handwritten PII. Is there a way to automatically detect and redact it?

Employees share photos of whiteboards and printed materials in our collaboration tools. These often contain customer names and project details written on the whiteboard. How do we handle this type of PII?

We publish research papers and reports that contain screenshots of data analysis tools — these screenshots sometimes show individual-level data. How do we check images before publication?