Go to DCV Architecture
The following diagram shows how sections connect to each other to form tasks:
flowchart LR;
A[1.Region Pre-Detection]-->C[2.1.Shared Detection]
C---D[2.2.Barcode Localization]
C---E[2.2.Text-line Localization]
C---F[2.2.Document Detection]
D---G[3.Barcode Decoding]
E---H[3.Text-line Recognition]
F---I[3.Document Normalization]
style A fill:#f96,stroke:#333,stroke-width:4px
In this article, we’ll discuss the first section of a task - Region Pre-Detection:
Section 1 - Region Pre-Detection
The purpose of this section is to reduce the overall time cost of the task by finding regions of interest (ROIs) and thus ignoring other parts of the image during subsequent processing. It is not required for the task and can be skipped in many cases.
Constituting Stages
This section consists of multiple stages:
- Cropping: to crop out the original ROI specified by the user. If not specified, return the original image as a whole.
- Down-scaling: to down-scale a massive image.
- Grayscaling: to convert a colour image to grayscale.
- Transforming: to transform a grayscale image.
- Pre-Detecting: to pre-detect the regions exhibit specific features.
Output and Parameters
Each of the five stages has its own output (known as an intermediate result) and a specific parameter that can regulate the operation:
Stage | Intermediate Result Type | Related Parameter |
---|---|---|
Cropping | IRUT_COLOUR_IMAGE |
Location |
Down-scaling | IRUT_SCALED_DOWN_COLOUR_IMAGE |
ScaleDownThreshold |
Grayscaling | IRUT_GRAYSCALE_IMAGE |
ColourConversionModes |
Transforming | IRUT_TRANSFORMED_GRAYSCALE_IMAGE |
GrayscaleTransformationModes |
Pre-Detecting | IRUT_PREDETECTED_REGIONS |
RegionPredetectionModes |