You can use the Eduction task to redact information in documents.
To enable redaction, set the configuration parameter RedactedOutput=True. If you want to specify the value or characters that replace the redacted text, use the configuration parameter RedactionOutputString or RedactionReplacementCharacter.
For example, the following configuration redacts addresses contained in a document's DRECONTENT or ADDRESS fields:
[ImportTasks] Post0=Eduction:EductionSettings [EductionSettings] ResourceFiles=C:\Autonomy\IDOLServer\Eduction\address_gb.ecr SearchFields=DRECONTENT,ADDRESS RedactedOutput=True
The fields specified by SearchFields are not modified. CFS places the redacted text in fields with a _REDACTED suffix. For example:
#DREFIELD ADDRESS="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ" #DREFIELD ADDRESS_REDACTED="[redacted]"
The Eduction task also adds the value, offset, and score for any matched entities to the document. For example:
#DREFIELD /offset="298" #DREFIELD /score="1" #DREFIELD /value="Cambridge Business Park, Cowley Road, Cambridge, CB4 0WZ"
|
|