Go to DCV Architecture
The following diagram shows how sections connect to each other to form tasks:
flowchart LR;
A[1.Region Pre-Detection]-->C[2.1.Shared Detection]
C---D[2.2.Barcode Localization]
C---E[2.2.Text-line Localization]
C---F[2.2.Document Detection]
D---G[3.Barcode Decoding]
E---H[3.Text-line Recognition]
F---I[3.Document Normalization]
style H fill:#f96,stroke:#333,stroke-width:4px
In this article, we’ll discuss the section Text-line Recognition which is usually the 3rd section of a “Recognize-Text-Lines” task.
Section 3 - Text-line Recognition
The purpose of this section is to recognize the text from the text-line areas identified in the previous section “Text-line Localization”.
Constituting Stages
This section consists of the following stages:
- Cropping: to cut out the text-line areas based on text-line localization results. This results in one or multiple colour images.
- Grayscaling: to convert the colour image(s) to grayscale.
- Transforming: to transform the grayscale image(s).
- Text-line-recognizing: to recognize the text.
Output and Parameters
Each of these stages has its own output (known as an intermediate result) and usually a specific parameter that can regulate the operation:
Stage | Intermediate Result Type | Related Parameter |
---|---|---|
Cropping | IRUT_COLOUR_IMAGE |
N/A |
Grayscaling | IRUT_GRAYSCALE_IMAGE |
ColourConversionModes |
Transforming | IRUT_TRANSFORMED_GRAYSCALE_IMAGE |
GrayscaleTransformationModes |
Text-line-recognizing | IRUT_RECOGNIZED_TEXT_LINES |
DictionaryPath DictionaryCorrectionThresholds StringLengthRange StringRegExPattern |