Home > Research > Publications & Outputs > Text information extraction in images and video
View graph of relations

Text information extraction in images and video: a survey

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
Close
<mark>Journal publication date</mark>05/2004
<mark>Journal</mark>Pattern Recognition
Issue number5
Volume37
Number of pages21
Pages (from-to)977-997
Publication StatusPublished
<mark>Original language</mark>English

Abstract

Text data present in images and video contain useful information for automatic annotation, indexing, and structuring of images. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging. While comprehensive surveys of related problems such as face detection, document analysis, and image & video indexing can be found, the problem of text information extraction is not well surveyed. A large number of techniques have been proposed to address this problem, and the purpose of this paper is to classify and review these algorithms, discuss benchmark data and performance evaluation, and to point out promising directions for future research.