Achieve Safe Harbor de-identification by automatically detecting and redacting all 18 HIPAA identifiers. Protect PHI with 99.7% accuracy and comprehensive audit trails.
Safe Harbor de-identification requires removal of these 18 specific identifier types. Our system detects and redacts all of them automatically.
Full name, last name, first name initials
All geographic subdivisions smaller than state
All elements of dates except year
All telephone numbers
All facsimile numbers
Social Security numbers
Medical record numbers
Health plan beneficiary numbers
Account numbers
Certificate or license numbers
Vehicle identifiers and serial numbers
Device identifiers and serial numbers
Web Universal Resource Locators
Internet Protocol address numbers
Biometric identifiers including fingerprints
Full face photographic images
Any other unique identifying characteristic
All 18 PHI identifiers detected and redacted
Patient names, relatives, employers, and household members detected with cultural awareness.
All geographic subdivisions smaller than state including street, city, ZIP code (first 3 digits excepted).
All dates except year for ages under 90. Birth dates, admission dates, discharge dates, death dates.
Phone numbers, fax numbers, email addresses, and other electronic contact identifiers.
SSN, MRN, health plan ID, account numbers, certificate/license numbers, and device IDs.
Biometric identifiers, photos, and any other unique identifying characteristic.
Safe Harbor compliant workflow
Securely submit medical records, clinical notes, or any documents containing PHI.
AI scans for all 18 HIPAA identifiers with context-aware accuracy.
Apply Safe Harbor compliant redaction to all detected identifiers.
Receive de-identified document with audit trail and compliance certificate.
Get started with just a few lines of code
import requests
api_key = "your_api_key"
url = "https://api.redactionapi.net/v1/redact"
data = {
"text": "John Smith's SSN is 123-45-6789",
"redaction_types": ["ssn", "person_name"],
"output_format": "redacted"
}
response = requests.post(url,
headers={"Authorization": f"Bearer {api_key}"},
json=data
)
print(response.json())
# Output: {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
const axios = require('axios');
const apiKey = 'your_api_key';
const url = 'https://api.redactionapi.net/v1/redact';
const data = {
text: "John Smith's SSN is 123-45-6789",
redaction_types: ["ssn", "person_name"],
output_format: "redacted"
};
axios.post(url, data, {
headers: { 'Authorization': `Bearer ${apiKey}` }
})
.then(response => {
console.log(response.data);
// Output: {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
});
curl -X POST https://api.redactionapi.net/v1/redact \
-H "Authorization: Bearer your_api_key" \
-H "Content-Type: application/json" \
-d '{
"text": "John Smith's SSN is 123-45-6789",
"redaction_types": ["ssn", "person_name"],
"output_format": "redacted"
}'
# Response:
# {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
The Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule establishes national standards for protecting individuals' medical records and other personal health information. For organizations that need to use or share health data while protecting patient privacy, HIPAA provides two de-identification methods: Safe Harbor and Expert Determination.
The Safe Harbor method, which our automated system implements, requires removal of 18 specific types of identifiers and requires that the covered entity have no actual knowledge that the remaining information could identify an individual. This provides a clear, rules-based approach to de-identification that can be automated reliably.
Safe Harbor offers a prescriptive approach: remove the 18 specified identifiers, and the data is considered de-identified. This works well for most use cases and can be fully automated. Expert Determination, in contrast, requires a qualified statistical expert to determine that the risk of identification is very small. While potentially preserving more data utility, it requires expensive expert involvement and is difficult to scale.
For most organizations, Safe Harbor provides the right balance of compliance certainty, cost efficiency, and automation potential. Our system implements Safe Harbor with 99.7% accuracy, enabling high-volume de-identification while maintaining compliance.
Healthcare documents present unique challenges for PHI detection. Clinical notes contain medical terminology that must be preserved while removing identifying information. Dictated reports may have non-standard formatting. Handwritten annotations require OCR. Family history sections mention relatives. Contextual references ("the 45-year-old diabetic patient") can be identifying.
Our healthcare-specific AI models understand these nuances. Trained on millions of medical documents, they recognize clinical contexts, understand medical terminology, and identify PHI even when embedded in complex medical narratives. The result is accurate de-identification that preserves clinical utility.
HIPAA's date rules merit special attention. All dates directly related to an individual must be removed, including birth date, admission date, discharge date, date of death, and all ages over 89. However, the year can generally be retained for individuals under 90. Our system intelligently handles these nuances, applying appropriate redaction rules based on context and patient age when known.
Geographic data smaller than state must be removed, with special rules for ZIP codes. The initial three digits of a ZIP code may be retained if the geographic unit contains more than 20,000 people. Our system implements these rules automatically, masking or removing geographic information appropriately based on population thresholds.
HIPAA compliance requires documentation of de-identification efforts. Our system generates comprehensive audit trails documenting: each detected PHI element, the identifier category, redaction method applied, confidence score, and processing timestamp. This documentation supports your compliance program and provides evidence for audits.
As a service processing PHI on behalf of covered entities, we execute Business Associate Agreements (BAAs) establishing our obligations under HIPAA. Our infrastructure implements all required Security Rule safeguards including encryption at rest and in transit, access controls, audit logging, and breach notification procedures. We undergo regular third-party security assessments to verify compliance.
RedactionAPI has transformed our document processing workflow. We've reduced manual redaction time by 95% while achieving better accuracy than our previous manual process.
The API integration was seamless. Within a week, we had automated redaction running across all our customer support channels, ensuring GDPR compliance effortlessly.
We process over 50,000 legal documents monthly. RedactionAPI handles it all with incredible accuracy and speed. It's become an essential part of our legal tech stack.
The multi-language support is outstanding. We operate in 30 countries and RedactionAPI handles all our documents regardless of language with consistent accuracy.
Trusted by 500+ enterprises worldwide





The 18 HIPAA Safe Harbor identifiers are: (1) Names, (2) Geographic data smaller than state, (3) Dates except year, (4) Phone numbers, (5) Fax numbers, (6) Email addresses, (7) Social Security numbers, (8) Medical record numbers, (9) Health plan beneficiary numbers, (10) Account numbers, (11) Certificate/license numbers, (12) Vehicle identifiers, (13) Device identifiers, (14) Web URLs, (15) IP addresses, (16) Biometric identifiers, (17) Full-face photos, (18) Any other unique identifying number.
Safe Harbor is one of two HIPAA-approved de-identification methods. It requires removal of 18 specific identifiers plus assurance that the remaining information cannot identify an individual. Our automated process removes all 18 identifiers and documents the de-identification for compliance purposes.
Yes, we generate comprehensive audit documentation including: list of detected PHI types, redaction methods applied, confidence scores, processing timestamps, and compliance certificates. This documentation supports your HIPAA compliance program and audit requirements.
Yes, we process all common healthcare document formats including PDF medical records, HL7 messages, FHIR resources, CDA documents, clinical notes, discharge summaries, lab results, and scanned documents with handwritten notes. Our OCR handles poor quality scans common in healthcare.
Our HIPAA profile achieves 99.7% accuracy across all 18 identifier types. We use healthcare-specific AI models trained on millions of medical documents to understand clinical terminology and context. For critical applications, human-in-the-loop review options are available.
Yes, our platform is fully HIPAA compliant. We execute Business Associate Agreements (BAAs) with covered entities. Our infrastructure meets all HIPAA Security Rule requirements including encryption, access controls, audit logging, and breach notification procedures.