Classify

Performs automated document classification using training and/or rules defined in a Content Model.

Remarks

This activity attempts to assign a Document Type to each Batch Folder object in scope. The set of possible document types and the method of classication are defined in a Content Model. To configure and use this activity, the following prerequisites must be performed:

In cases where a document cannot be classified confidently, the Default Content Type specified on the Content Model will be assigned. If the Content Model has no Classify Method assigned, then the Default Content Type will be assigned to all documents.

Inherits from: Unattended Activity

Properties

The following 8 properties are defined.

Property Name Description
General
Content Model Scope Type: Content Type

The Content Model or Content Category containing the set of allowed Document Types.

Classification Level Type: ClassificationLevel, Default: DocType

The level within the Content Model where the set of allowed content types exist. Can be one of the following values:

In most cases, documents should be classified as document types. In some cases, however, it may be convenient to classify documents into a category rather than as a specific document type. In such cases, a numeric level can be specified, which indicates a specific number of levels below the Content Model Scope.

Output Level Type: ClassificationLevel, Default: DocType

Sets the level that classification will be output. Can be one of the following values:

Model Refresh Interval Type: Int32, Default: 60, Range: 0 - 2147483647

The interval (in seconds) at which content model information will be refreshed from the repository. Controls how frequently a service running this activity will check for changes to the content model, such as new document types or additional training. A value of 0 will disable automatic refresh, and newly-created document types will not be recognized until services are restarted.

Supress Candidate List Type: Boolean, Default: False

If set to true, disables saving of the candidate list. By default, the classification process saves a list of a potential classification candidates for each document. This list is only useful if a review step using the Classification Viewer control is included in the Batch Process.

Processing Options
Error Disposition Type: IssueDisposition, Default: Flag, Log

Determines what happens when an error occurs processing an activity.

Maximum Consecutive Errors Type: Int32, Default: 0

The maximum number of consecutive errors, after which a critical stop will be raised. A critical stop will cause services to stop running.

Concurrency Mode Type: ConcurrencyMode, Default: Multiple

Specifies the parallel processing mode for this activity. Can be one of the following values:

  • Multiple - Multiple instances can run concurrently.
  • PerMachine - Only a single instance can run per machine.
  • Single - Only a single instance can run per Grooper repository.
This value determines the type of Thread Pool on which the activity can be executed.

See Also

Content Type

Used By

Batch Folder - Apply Activity, Batch Process Step