Geometric

Defines a rectangular region using anchors, and extracts the character data bounded by the region as a Section Instance.

Remarks

Geometric section extraction is performed as follows:

  1. The Main Extractor establishes the initial region.
  2. Each edge of the region can then be adjusted using text anchors, page boundaries, or manual rules.
  3. The result can then be expanded to detected lines which bound the region..
NOTE: This method cannot be used in cases where the section content spans multiple pages.

Inherits from: Section Extract Method

Properties

The following 6 properties are defined.

Property Name Description
General
Main Extractor Type: Embedded Extractor

The extractor to be used to identify the initial region for each Section Instance. Each instance returned by this extractor will become a Section Instance in the output. Please note that any results returned by this extractor which span multiple pages will be discarded.

Left Adjustment Type: Edge Adjustment

Defines an optional adjustment for the left edge of each output instance.

Top Adjustment Type: Edge Adjustment

Defines an optional adjustment for the top edge of each output instance.

Right Adjustment Type: Edge Adjustment

Defines an optional adjustment for the right edge of each output instance.

Bottom Adjustment Type: Edge Adjustment

Defines an optional adjustment for the bottom edge of each output instance.

Line Detect Limit Type: Border, Default: 0

If this property is set, line detection will be enabled, and the section region will be expanded to nearby lines. Line detection is performed after the initial region has been established using the Main Extractor and edge adjustments. This property defines the maximum distance from each edge of the initial region to a detected line. Any empty value disables line detection.

See Also

Border, Edge Adjustment, Embedded Extractor

Used By

Data Section