AI in ID Card Verification: Text Extraction, Auto Cropping Image, and Evaluation

AI technologies, particularly OCR, can enhance the identity verification process. Understanding the workings of OCR helps us to understand better its capabilities in automating text extraction, image cropping, and image quality evaluation. All of these leads to a new, faster, and more accurate approach of identity verification.
March 1, 2024

Have you ever thought about why the process of verifying identity works so fast nowadays? Traditionally, Know Your Customer (KYC), which is a procedure of identifying and verifying customers usually used by financial companies or institutions, was executed manually by a human personnel. At that time, The verifying process was far from simple and could take minutes if not hours. You need to go to the office in person, queue, wait for a staff member to inspect the documents, answer their questions, let them compare your answers and real life appearance with the documents, etc.

However, with AI technologies and their capabilities in understanding language and image processing, verifying a person’s identity by their ID card or document is no longer a hassle! One of the most influential technologies that can help us is called Optical Character Recognition (OCR).

What is OCR?

Optical Character Recognition (OCR) is one of many AI systems that can automate data extraction from scanned documents, photographs of texts, and image-only PDFs. Machines can’t comprehend a photo like how humans see shapes and read text from pictures. However, with OCR technology, the system can recognize text in images and convert them into machine-readable digital files which can also be edited and analyzed by other softwares.

ekyc indonesia ocr ktp

How Does OCR Work?

Understanding the mechanisms behind OCR can help us to better comprehend its precise contributions to the e-KYC process. In general, the operational framework of OCR involves four key steps:

  1. Image acquisition

In the first stage, OCR scans the image and converts the document into binary data. This step is crucial as it distinguishes various brightness levels within the image, classifying light areas as background and dark areas as text, or vice versa.

  1. Pre-processing

The OCR software also takes charge of improving the quality of acquired images by utilizing a series of its image processing capabilities. OCR can refine images by correcting skewed images, eliminating noise through despeckling, and cleaning up boxes and lines. In addition, an OCR program can also analyze the structure of a document, dividing the page into elements such as blocks of texts, tables, or images.

  1. Text recognition

After identifying the text areas, the third and most crucial step is the text recognition. This stage involves the use of two primary OCR algorithms: pattern matching and feature extraction. Pattern matching compares character images (glyphs) with stored examples of similar font and scale. This is a method particularly suitable for known font documents. On the other hand, feature extraction breaks down glyphs into distinctive features like angled lines, crossed lines, or curves to identify the best match. This approach offers adaptability when faced with specific letters or numbers with unique features.

  1. Post-processing

Finally, following successful text recognition, the system converts the recognized text data into a coherent digital file. This last step ensures that the extracted information can be easily utilized for further analysis or integration into various digital systems.

id card ktp photo ai

How Does OCR Help with ID Card Verification?

The integration of OCR technology in e-KYC mainly focus on their text and image processing abilities:

  1. Text information Extraction

OCR can automate the extraction of crucial text information from ID cards and documents, such as name, date of birth, home address, occupation, etc. This automation accelerates the process of cross checking data and minimizes the likelihood of manual errors that may arise during typing or transcribing.

  1. Auto Cropping Images

As previously stated, OCR has the ability to process images up to a certain extent, with auto cropping falling within its capabilities. Through the automated cropping of photos and signatures from ID cards, OCR plays a crucial role in streamlining face matching and signature verification processes, ultimately saving time and improving precision.

  1. Image Quality Evaluation

Another advantage stemming from OCR and its image processing capabilities is its ability to conduct a thorough evaluation of the image quality. This includes the assessment of various quality parameters such as:

  • Blurred: determining if the image is blurry or clear
  • Bright: determining whether the image is excessively bright or not
  • Dark: determining whether the image is excessively dark or not
  • Flash: detecting if the image contains a reflection from the camera flash
  • Cropped: identifying whether the image has been cropped or if any part is cut out
  • Rotated: identifying whether the image has been rotated or not
  • Copy: assessing whether the photographed ID is a real product or a photocopy
  • Taken from screen: assessing whether the photographed ID is a real product or an image captured from the screen
  • ID card: validating whether the image is a real ID card photo or not

By ensuring that submitted images meet the required standards, OCR significantly enhances the overall reliability and accuracy of the overall e-KYC process.

All in all, the integration of AI into the e-KYC process is predominantly driven by OCR technology, specifically in automating text extraction and image cropping. Additionally, OCR also enables a quick evaluation of the quality of the photo submitted by the user. The capabilities of OCR have introduced a new, faster, and more accurate approach to the overall identity verification process, particularly for photos and signatures in ID card verification.

ekyc ojk

  1. Amazon Web Service: What Is OCR (Optical Character Recognition)?
    https://aws.amazon.com/what-is/ocr/
  2. IBM Cloud Education: What Is Optical Character Recognition (OCR)?
    https://www.ibm.com/blog/optical-character-recognition/

Written by Jessica Donnyson
contact us

Ready to accelerate your digital transformation?

Send us an email, and we will answer your questions regarding our products and services.
Contact Us