The ExtractMetadata task extracts metadata from the file associated with a document. This task extracts a subset of the metadata obtained by standard KeyView filtering. It is faster than standard KeyView filtering and does not extract the file content.
When documents are ingested, CFS automatically extracts metadata. Do not use this task unless you have set the fields AUTN_NO_FILTER and AUTN_NO_EXTRACT on a document and want to extract basic metadata only.
The ExtractMetadata task is configured as a Pre task. Specify the name of the section that contains settings for the task. For example:
[ExtractMetadata] Pre0=Lua:scripts/nofilter.lua Pre1=ExtractMetadata:ExtractMetadataSettings [ExtractMetadataSettings] FieldnamePrefix=FIELD_ ReservedFieldnames=Reserved1,Reserved2
The Pre0 task runs a Lua script that adds the fields AUTN_NO_FILTER and AUTN_NO_EXTRACT to documents. Adding these fields prevents KeyView from filtering the documents and extracting subfiles.
The Pre1 task runs the ExtractMetadata task using the settings contained in the [ExtractMetadataSettings] section of the CFS configuration file.
The FieldnamePrefix parameter specifies a prefix for the names of the metadata fields that are added to the document. The ReservedFieldnames parameter specifies a comma-separated list of field names that the task must not use. If the task needs to add a metadata field with one of the specified names, it prefixes the name with an underscore. For example, with the settings specified above, the task would not add a field named FIELD_Reserved1. Instead, the task would add _FIELD_Reserved1.
|
|