Recognition of telops in Arabic news broadcasting

Seiya Iwata, Wataru Oyama, Tetsushi Wakabayashi, Fumitaka Kimura

Research output: Contribution to journalArticle

Abstract

The authors have conducted studies on Arabic telop recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper describes a dedicated OCR for recognizing low resolution telop in video images. A telop recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving telop is difficult due to combing noise caused by the interlacing of scan-lines. A technique to detect and eliminate the combing noise to correctly recognize the moving telop is proposed. This paper also proposes a technique based on insertion operation with minimum edit distance between successive two telops to connect them. The method to connect the moving telops is necessary for automatic language translation. The proposed method using edit distance for bi-gram sequence of telops (Method-B) is shown to be robust to recognition error of characters and successfully connect the telops.

Original languageEnglish
Pages (from-to)1668-1676
Number of pages9
JournalIEEJ Transactions on Electronics, Information and Systems
Volume136
Issue number12
DOIs
Publication statusPublished - Jan 1 2016
Externally publishedYes

Fingerprint

Optical character recognition
Character recognition
Broadcasting

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering

Cite this

Recognition of telops in Arabic news broadcasting. / Iwata, Seiya; Oyama, Wataru; Wakabayashi, Tetsushi; Kimura, Fumitaka.

In: IEEJ Transactions on Electronics, Information and Systems, Vol. 136, No. 12, 01.01.2016, p. 1668-1676.

Research output: Contribution to journalArticle

Iwata, Seiya ; Oyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka. / Recognition of telops in Arabic news broadcasting. In: IEEJ Transactions on Electronics, Information and Systems. 2016 ; Vol. 136, No. 12. pp. 1668-1676.
@article{3ab86cb51d2843a99a03d0ab0e935dee,
title = "Recognition of telops in Arabic news broadcasting",
abstract = "The authors have conducted studies on Arabic telop recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper describes a dedicated OCR for recognizing low resolution telop in video images. A telop recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving telop is difficult due to combing noise caused by the interlacing of scan-lines. A technique to detect and eliminate the combing noise to correctly recognize the moving telop is proposed. This paper also proposes a technique based on insertion operation with minimum edit distance between successive two telops to connect them. The method to connect the moving telops is necessary for automatic language translation. The proposed method using edit distance for bi-gram sequence of telops (Method-B) is shown to be robust to recognition error of characters and successfully connect the telops.",
author = "Seiya Iwata and Wataru Oyama and Tetsushi Wakabayashi and Fumitaka Kimura",
year = "2016",
month = "1",
day = "1",
doi = "10.1541/ieejeiss.136.1668",
language = "English",
volume = "136",
pages = "1668--1676",
journal = "IEEJ Transactions on Electronics, Information and Systems",
issn = "0385-4221",
publisher = "The Institute of Electrical Engineers of Japan",
number = "12",

}

TY - JOUR

T1 - Recognition of telops in Arabic news broadcasting

AU - Iwata, Seiya

AU - Oyama, Wataru

AU - Wakabayashi, Tetsushi

AU - Kimura, Fumitaka

PY - 2016/1/1

Y1 - 2016/1/1

N2 - The authors have conducted studies on Arabic telop recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper describes a dedicated OCR for recognizing low resolution telop in video images. A telop recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving telop is difficult due to combing noise caused by the interlacing of scan-lines. A technique to detect and eliminate the combing noise to correctly recognize the moving telop is proposed. This paper also proposes a technique based on insertion operation with minimum edit distance between successive two telops to connect them. The method to connect the moving telops is necessary for automatic language translation. The proposed method using edit distance for bi-gram sequence of telops (Method-B) is shown to be robust to recognition error of characters and successfully connect the telops.

AB - The authors have conducted studies on Arabic telop recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper describes a dedicated OCR for recognizing low resolution telop in video images. A telop recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving telop is difficult due to combing noise caused by the interlacing of scan-lines. A technique to detect and eliminate the combing noise to correctly recognize the moving telop is proposed. This paper also proposes a technique based on insertion operation with minimum edit distance between successive two telops to connect them. The method to connect the moving telops is necessary for automatic language translation. The proposed method using edit distance for bi-gram sequence of telops (Method-B) is shown to be robust to recognition error of characters and successfully connect the telops.

UR - http://www.scopus.com/inward/record.url?scp=85000925663&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85000925663&partnerID=8YFLogxK

U2 - 10.1541/ieejeiss.136.1668

DO - 10.1541/ieejeiss.136.1668

M3 - Article

VL - 136

SP - 1668

EP - 1676

JO - IEEJ Transactions on Electronics, Information and Systems

JF - IEEJ Transactions on Electronics, Information and Systems

SN - 0385-4221

IS - 12

ER -