Extract

Extraction is an unattended activity which performs data extraction on folders in a Batch.

Remarks

This activity can only run with Folder scope. The folders to be extracted must be classified, and the assigned content type must have a Data Model.

Inherits from: Unattended Activity

Properties

The following 7 properties are defined.

Property Name Description
General
Default Content Type Type: Content Type

The content type to be assigned to unclassified folders.

Flag Invalid Items Type: Boolean, Default: False

If set to true, folders will be flagged if the extracted data elements contain any validation errors.

Purge OCR Details Type: Boolean, Default: False

If set to true, ocr details and values will be purged after extraction has been performed. Purging ocr details may be necessary to reduce the size of the index data file stored on Batch Folders. Extremely large index data files can degrade performance when displaying the Index Panel.

Preserve Values Type: Boolean, Default: False

If set to true, existing field values on the document for which no extracted value is found will be preserved.

Processing Options
Error Disposition Type: IssueDisposition, Default: Flag, Log

Determines what happens when an error occurs processing an activity.

Maximum Consecutive Errors Type: Int32, Default: 0

The maximum number of consecutive errors, after which a critical stop will be raised. A critical stop will cause services to stop running.

Concurrency Mode Type: ConcurrencyMode, Default: Multiple

Specifies the parallel processing mode for this activity. Can be one of the following values:

  • Multiple - Multiple instances can run concurrently.
  • PerMachine - Only a single instance can run per machine.
  • Single - Only a single instance can run per Grooper repository.
This value determines the type of Thread Pool on which the activity can be executed.

See Also

Content Type

Used By

Batch Folder - Apply Activity, Batch Process Step