Tag: ocr
-
In this tutorial, you will learn how to build a professional desktop application for reading Machine Readable Zones (MRZ) from passports and ID cards with portrait and document detection capabilities. By the end of this guide, you’ll have a fully functional GUI application built with PySide6 and powered by Dynamsoft...
-
When dealing with bulk document scanning, one of the biggest challenges is making scanned documents searchable and organized. Unlike digital-born documents, scanned images are essentially pictures—you can’t search them, copy text from them, or organize them by content. This is where Optical Character Recognition (OCR) becomes invaluable. In this comprehensive...
-
In today’s digital workplace, the ability to quickly scan, process, and understand documents is crucial. Whether you’re digitizing invoices, processing legal documents, or managing medical records, having an efficient document workflow can save hours of manual work. In this comprehensive tutorial, you’ll learn how to build a modern web-based document...
-
Machine Readable Zone (MRZ) recognition is a critical feature in applications that require scanning and processing identity documents, such as passports and visas. On Android devices, implementing accurate MRZ recognition can significantly enhance the efficiency and user experience of such apps. In this article, we will explore how to build...
-
In today’s digital world, automating document processing has become essential for businesses across various industries. Whether you’re building a hotel check-in system, airport kiosk, identity verification platform, or government service portal, the ability to quickly and accurately extract information from passports and ID cards can significantly improve user experience and...
-
Organizations of all sizes face a common challenge: efficiently managing the vast documents that power their operations. Manual data entry is a persistent bottleneck—time-consuming, error-prone, and costly—that hurts productivity and increases expenses. Optical Character Recognition (OCR) technology solves this by converting scanned documents into fully searchable, editable text for modern...
-
When scanning barcodes in production (warehouse, logistics, healthcare, retail), image quality issues like blur, motion, glare, low contrast, or physical damage can cause even strong barcode algorithms to fail. Fortunately, most linear (1D) barcodes also include human-readable text. That text becomes a powerful assist layer: if the barcode fails, OCR...
-
Apple’s vision framework provides APIs for performing computer vision tasks such as face detection, document boundary detection, and text extraction. In this tutorial, we’ll explore how to leverage these APIs and Dynamsoft MRZ SDK to build a robust iOS ID scanner app. What you’ll build: A SwiftUI iOS app that...
-
In today’s digital workplace, document scanning and text recognition are vital capabilities for many business applications. In this tutorial, you’ll learn how to build a Windows document scanner application with Optical Character Recognition (OCR) using: .NET 8 C# Dynamic Web TWAIN REST API Windows.Media.Ocr API (Windows built-in OCR engine) By...
-
In the digital age, PDFs are one of the most common formats for sharing and archiving documents. They preserve layout, fonts, and images across platforms. However, not all PDFs are created equal. While many contain actual text data, others—especially scanned documents—are essentially just images wrapped in a PDF container. This...
-
Building a web-based OCR (Optical Character Recognition) application has never been easier with modern JavaScript libraries. In this comprehensive tutorial, we’ll create a powerful OCR app that can process images, multi-page TIFFs, and PDFs, converting them into searchable PDF documents - all running entirely in the browser with free tools....
-
SEMI (Semiconductor Equipment and Materials International) font is a special dot matrix font used for marking silicon wafers. In this tutorial, we’ll walk through building a Python application to recognize these specialized markings using Dynamsoft Capture Vision SDK. What you’ll build: A cross-platform Python script that loads a custom SEMI...
-
The Dynamsoft MRZ Scanner JavaScript SDK is a powerful, high-level wrapper built on top of the lower-level Dynamsoft Capture Vision API, designed to help developers add Machine Readable Zone (MRZ) recognition to any web application with ease. It is open-source and provides a clean and customizable API for scanning MRZ...
-
VIN stands for vehicle identification number. It is a unique 17-character code to identify individual vehicles. The VIN contains information about your car’s make, model, and year, as well as the country where it was manufactured and other important details. You may find the VIN label on your car’s dashboard,...
-
VIN stands for vehicle identification number. It is a unique 17-character code to identify individual vehicles. The VIN contains information about your car’s make, model, and year, as well as the country where it was manufactured and other important details. You may find the VIN label on your car’s dashboard,...
-
The Machine Readable Zone (MRZ) is a section on passports, IDs, visas, and other travel documents that encodes key personal information, such as the holder’s name, nationality, document number, date of birth, gender, and document expiration date, in a standardized format. In this article, you’ll learn how to use the...
-
When scanning documents via scanners, we may get misoriented document images. The use of automatic document feeding makes it happen more often. We can use image processing to detect the orientation. There are many ways to do this. For example, in Latin script text, ascenders are more likely to occur...
-
In this article, we continue our series on parcel scanning technologies by delving into the world of C++. Building on our previous JavaScript tutorial, we will explore how to implement barcode scanning and OCR text extraction using C++. This guide will cover setting up the development environment, integrating necessary libraries,...
-
In today’s fast-paced world, efficiently managing and tracking parcels is crucial for businesses. The use of 1D and 2D barcodes has revolutionized the way parcels are labeled, scanned, and processed. Additionally, Optical Character Recognition (OCR) technology has enabled the digitization of text on parcels, facilitating automated data extraction and processing....
-
Identity documents are adopted in many countries to prove a person’s identity. They usually have a card design around 8.6 cm x 5.4 cm in size with three-line MRZ, like ID cards in Germany and the Netherlands. Dutch ID Card Specimen: In this article, we are going to talk about...
-
In the previous article, we built a JavaFX demo app to scan documents using Dynamic Web TWAIN Service’s REST API. The demo app can scan documents via protocols like TWAIN, WIA, SANE, and ICA and save the documents into a PDF file using PDFBox. In this article, we are going...
-
MRZ stands for “machine-readable zone”. It is usually at the bottom of an identity page for machines to read its info like document type, name, nationality, date of birth, sex and expiration date, etc. A dataset of MRZ images is needed to train an OCR engine of MRZ or evaluate...
-
Full-text search is a process of examining all of the words in every stored document to find documents meeting a certain criteria (e.g. text specified by a user). It is possible for the full-text-search engine to directly scan the contents of the documents if the number of documents is small....
-
Paper documents are still ubiquitous in our everyday work, like invoices, checks and printed books. It takes effort to manage them. With Dynamic Web TWAIN, we can build a web application to scan documents from document scanners. But further steps are required to better manage the documents and OCR is...