Műegyetemi Digitális Archívum

Rethinking Numerical Table Recognition: A Transparent Algorithmic Solution for Specific OCR Problems

Date

Type

Könyvfejezet

Language

en

Reading access rights:

Open access

Rights Holder

Szerző

Conference Date

2025-02-03

Conference Place

Budapest

Conference Title

3rd Workshop on Intelligent Infocommunication Networks, Systems and Services

ISBN, e-ISBN

978-963-421-982-8

Container Title

3rd Workshop on Intelligent Infocommunication Networks, Systems and Services

Version

Post print

Faculty

Faculty of Electrical Engineering and Informatics

First Page

1

Subject (OSZKAR)

optical character recognition
artificial intelligence
artificial neural networks
image processing
table ocr

Gender

Konferenciacikk

University

Budapest University of Technology and Economics

OOC works

Abstract

Optical Character Recognition (OCR) is a well-established technology for the recognition of printed and handwritten text/numbers. However, its application to tabular data remains limited, with existing solutions often being costly and/or underperforming, especially when applied to archival data. These challenges stem from the fact that many OCR models are not optimized to handle the unique structural and stylistic characteristics of historical tables. For instance, nineteenth- and twentieth-century Hungarian price tables frequently feature unconventional formatting, such as midline decimal points, irregular separators, and the absence of dividing lines between cells, all of which hinder existing OCR solution's performance. To overcome these limitations, we present a transparent and customizable solution tailored for tables, for which existing softwares are inefficient. The algorithm processes table images by dividing them into cells, even when explicit dividing lines are absent, and accurately identifies decimal points, separators, and numerical values. Evaluation on a dataset consisting of historical price tables demonstrated the efficacy of our approach. Our custom digit recognition network achieved a test accuracy of 99.3%, while the complete system delivered a cell-level accuracy of 97.5% across 40 test images. These results confirm the reliability of our solution for handling tabular data, even with unique properties. Our method not only addresses the challenges of processing archival tables, but also provides a transparent and adaptable framework for broader applications. It has significant potential for practical applications in archives and libraries, and could also inspire advancements in other fields, where available solutions struggle.

Description

Keywords