Combined Entities
In addition to the entities described in the Eduction Grammar Reference, the IDOL PII Package includes grammar files that contain "combined" entities. These files are named combined_*.ecr (or combined_*_cjkvt.ecr for Japan) and the entities match addresses, dates, driving license numbers, and so on, from multiple countries.
- The entities that end in
/allmatch data for any supported non-CJKVT country or language. - The entities that end in
/all_cjkvtmatch data for any supported CJKVT country. - The entities that end in
/gdprmatch data for any supported country or language subject to GDPR.
For example:
- Using
pii/address/allfromcombined_address.ecrmatches a postal address from any non-CJKVT country. This is similar to using theaddress.ecrgrammar file and extractingpii/address/??. - Using
pii/address/gdprfromcombined_address.ecrmatches a postal address from any country subject to GDPR. This is similar to using theaddress.ecrgrammar file and extractinggdpr/address/??. - Using
pii/date/dob/context/allfromcombined_date.ecrmatches a date of birth written numerically or using words in any language. This is similar to using thedate.ecrgrammar file and extractingpii/date/dob/context/???.
The combined (/all, /all_cjkvt and /gdpr) entities provide a significant improvement in processing speed when you extract matches for all countries or languages.
The combined grammar files might produce fewer matches, because (by default) only a single match is returned in cases where the same characters in the input text would match multiple countries or languages.
TIP: If you need all matches, you can turn on the AllowMultipleResults configuration option. This option slows down the matching process because it does not stop after a single match, but is generally still faster than using the individual grammars.
| File | Entity |
|---|---|
| combined_address.ecr | pii/address/all |
| pii/address/gdpr | |
| pii/address/streetlocation/context/all | |
| pii/address/streetlocation/context/gdpr | |
| pii/address/city/context/all | |
| pii/address/city/context/gdpr | |
| pii/address/country/context/all | |
| pii/address/country/context/gdpr | |
| pii/address/postcode/context/all | |
| pii/address/postcode/context/gdpr | |
| combined_address_cjkvt.ecr | pii/address/all_cjkvt |
| pii/address/address1/context/all_cjkvt | |
| pii/address/region/context/all_cjkvt | |
| pii/address/country/context/all_cjkvt | |
| pii/address/postcode/context/all_cjkvt | |
| combined_banking.ecr |
pii/banking/account_number/context/all |
| pii/banking/account_number/context/gdpr | |
| pii/banking/account_number/landmark/all | |
| pii/banking/account_number/landmark/gdpr | |
| pii/banking/account_number/nocontext/all | |
| pii/banking/account_number/nocontext/gdpr | |
| pii/banking/context/all | |
| pii/banking/context/gdpr | |
| pii/banking/iban/context/all | |
| pii/banking/iban/context/gdpr | |
| pii/banking/iban/landmark/all | |
| pii/banking/iban/landmark/gdpr | |
| pii/banking/iban/nocontext/all | |
| pii/banking/iban/nocontext/gdpr | |
| pii/banking/landmark/all | |
| pii/banking/landmark/gdpr | |
| pii/banking/nocontext/all | |
| pii/banking/nocontext/gdpr | |
| pii/banking/roll_number/context/all | |
| pii/banking/roll_number/context/gdpr | |
| pii/banking/roll_number/landmark/all | |
| pii/banking/roll_number/landmark/gdpr | |
| pii/banking/roll_number/nocontext/all | |
| pii/banking/roll_number/nocontext/gdpr | |
| pii/banking/routing_number/context/all | |
| pii/banking/routing_number/context/gdpr | |
| pii/banking/routing_number/landmark/all | |
| pii/banking/routing_number/landmark/gdpr | |
| pii/banking/routing_number/nocontext/all | |
| pii/banking/routing_number/nocontext/gdpr | |
| pii/banking/swiftcode/context/all | |
| pii/banking/swiftcode/context/gdpr | |
| pii/banking/swiftcode/landmark/all | |
| pii/banking/swiftcode/landmark/gdpr | |
| pii/banking/swiftcode/nocontext/all | |
| pii/banking/swiftcode/nocontext/gdpr | |
| combined_banking_cjkvt.ecr | pii/banking/account_number/context/all_cjkvt |
| pii/banking/account_number/landmark/all_cjkvt | |
| pii/banking/account_number/nocontext/all_cjkvt | |
| pii/banking/context/all_cjkvt | |
| pii/banking/landmark/all_cjkvt | |
| pii/banking/nocontext/all_cjkvt | |
| pii/banking/routing_number/context/all_cjkvt | |
| pii/banking/routing_number/landmark/all_cjkvt | |
| pii/banking/routing_number/nocontext/all_cjkvt | |
| pii/banking/swiftcode/context/all_cjkvt | |
| pii/banking/swiftcode/landmark/all_cjkvt | |
| pii/banking/swiftcode/nocontext/all_cjkvt | |
| combined_date.ecr | pii/date/dob/context/all |
| pii/date/dob/landmark/all | |
| pii/date/dob/context/gdpr | |
| pii/date/dob/landmark/gdpr | |
| pii/date/nocontext/all | |
| pii/date/nocontext/gdpr | |
| combined_date_cjkvt.ecr | pii/date/dob/context/all_cjkvt |
| pii/date/dob/landmark/all_cjkvt | |
| pii/date/nocontext/all_cjkvt | |
| combined_driving.ecr | pii/driving/context/all |
| pii/driving/nocontext/all | |
| pii/driving/landmark/all | |
| pii/driving/context/gdpr | |
| pii/driving/nocontext/gdpr | |
| pii/driving/landmark/gdpr | |
| combined_driving_cjkvt.ecr | pii/driving/context/all_cjkvt |
| pii/driving/nocontext/all_cjkvt | |
| pii/driving/landmark/all_cjkvt | |
| combined_health.ecr | pii/health/ehic/context/gdpr |
| pii/health/ehic/nocontext/gdpr | |
| pii/health/ehic/landmark/gdpr | |
| pii/health/id/context/all | |
| pii/health/id/nocontext/all | |
| pii/health/id/landmark/all | |
| pii/health/id/context/gdpr | |
| pii/health/id/nocontext/gdpr | |
| pii/health/id/landmark/gdpr | |
| combined_health_cjkvt.ecr | pii/health/id/context/all_cjkvt |
| pii/health/id/nocontext/all_cjkvt | |
| pii/health/id/landmark/all_cjkvt | |
| combined_medical_terms.ecr | pii/medical_terms/all |
| pii/medical_terms/gdpr | |
| pii/medical_terms/blood_test/all | |
| pii/medical_terms/blood_test/gdpr | |
| pii/medical_terms/disability/social_security/all | |
| pii/medical_terms/disability/social_security/gdpr | |
| pii/medical_terms/disease_condition/all | |
| pii/medical_terms/disease_condition/gdpr | |
| pii/medical_terms/drug_brand/all | |
| pii/medical_terms/drug_brand/gdpr | |
| pii/medical_terms/drug_generic/all | |
| pii/medical_terms/drug_generic/gdpr | |
| pii/medical_terms/lab_test/all | |
| pii/medical_terms/lab_test/gdpr | |
| pii/medical_terms/lifestyle/all | |
| pii/medical_terms/lifestyle/gdpr | |
| pii/medical_terms/medication/all | |
| pii/medical_terms/medication/gdpr | |
| pii/medical_terms/specialty/all | |
| pii/medical_terms/specialty/gdpr | |
| pii/medical_terms/surgical_procedure/all | |
| pii/medical_terms/surgical_procedure/gdpr | |
| combined_medical_terms_cjkvt.ecr | pii/medical_terms/all_cjkvt |
| combined_name.ecr | pii/name/all |
| pii/name/gdpr | |
| combined_name_cjkvt.ecr | pii/name/all_cjkvt |
| pii/name/latin/all_cjkvt | |
| pii/name/cjkvt/all_cjkvt | |
| combined_national_id.ecr | pii/id/context/all |
| pii/id/nocontext/all | |
| pii/id/landmark/all | |
| pii/id/context/gdpr | |
| pii/id/nocontext/gdpr | |
| pii/id/landmark/gdpr | |
| combined_national_id_cjkvt.ecr | pii/id/context/all_cjkvt |
| pii/id/nocontext/all_cjkvt | |
| pii/id/landmark/all_cjkvt | |
| combined_passport.ecr | pii/passport/context/all |
| pii/passport/nocontext/all | |
| pii/passport/landmark/all | |
| pii/passport/context/gdpr | |
| pii/passport/nocontext/gdpr | |
| pii/passport/landmark/gdpr | |
| combined_passport_cjkvt.ecr | pii/passport/context/all_cjkvt |
| pii/passport/nocontext/all_cjkvt | |
| pii/passport/landmark/all_cjkvt | |
| combined_postcode.ecr | pii/postcode/context/all |
| pii/postcode/nocontext/all | |
| pii/postcode/landmark/all | |
| pii/postcode/context/gdpr | |
| pii/postcode/nocontext/gdpr | |
| pii/postcode/landmark/gdpr | |
| combined_postcode_cjkvt.ecr | pii/postcode/context/all_cjkvt |
| pii/postcode/nocontext/all_cjkvt | |
| pii/postcode/landmark/all_cjkvt | |
| combined_telephone.ecr | pii/telephone/context/all |
| pii/telephone/nocontext/all | |
| pii/telephone/landmark/all | |
| pii/telephone/context/gdpr | |
| pii/telephone/nocontext/gdpr | |
| pii/telephone/landmark/gdpr | |
| combined_telephone_cjkvt.ecr | pii/telephone/context/all_cjkvt |
| pii/telephone/nocontext/all_cjkvt | |
| pii/telephone/landmark/all_cjkvt | |
| combined_tin.ecr | pii/tin/context/all |
| pii/tin/nocontext/all | |
| pii/tin/landmark/all | |
| pii/tin/context/gdpr | |
| pii/tin/nocontext/gdpr | |
| pii/tin/landmark/gdpr | |
| combined_tin_cjkvt.ecr | pii/tin/context/all_cjkvt |
| pii/tin/nocontext/all_cjkvt | |
| pii/tin/landmark/all_cjkvt | |
| combined_voter_id.ecr | pii/voter_id/context/all |
| pii/voter_id/nocontext/all | |
| pii/voter_id/landmark/all |