Recognition and transition frame detection of Arabic news captions for video retrieval

Seiya Iwata, Wataru Oyama, Tetsushi Wakabayashi, Fumitaka Kimura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90%. When compared with the ABBY FineReader 11 ® commercial OCR the dedicated OCR improves the recall of the Arabic characters in AlJazeera broadcasting news from 70.74% to 95.85% for non-interlaced moving news captions and from 23.82% to 96.29% for interlaced moving news captions.

Original languageEnglish
Title of host publication2016 23rd International Conference on Pattern Recognition, ICPR 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4005-4010
Number of pages6
ISBN (Electronic)9781509048472
DOIs
Publication statusPublished - Apr 13 2017
Externally publishedYes
Event23rd International Conference on Pattern Recognition, ICPR 2016 - Cancun, Mexico
Duration: Dec 4 2016Dec 8 2016

Other

Other23rd International Conference on Pattern Recognition, ICPR 2016
CountryMexico
CityCancun
Period12/4/1612/8/16

Fingerprint

Optical character recognition
Broadcasting
Character recognition

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition

Cite this

Iwata, S., Oyama, W., Wakabayashi, T., & Kimura, F. (2017). Recognition and transition frame detection of Arabic news captions for video retrieval. In 2016 23rd International Conference on Pattern Recognition, ICPR 2016 (pp. 4005-4010). [7900260] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICPR.2016.7900260

Recognition and transition frame detection of Arabic news captions for video retrieval. / Iwata, Seiya; Oyama, Wataru; Wakabayashi, Tetsushi; Kimura, Fumitaka.

2016 23rd International Conference on Pattern Recognition, ICPR 2016. Institute of Electrical and Electronics Engineers Inc., 2017. p. 4005-4010 7900260.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Iwata, S, Oyama, W, Wakabayashi, T & Kimura, F 2017, Recognition and transition frame detection of Arabic news captions for video retrieval. in 2016 23rd International Conference on Pattern Recognition, ICPR 2016., 7900260, Institute of Electrical and Electronics Engineers Inc., pp. 4005-4010, 23rd International Conference on Pattern Recognition, ICPR 2016, Cancun, Mexico, 12/4/16. https://doi.org/10.1109/ICPR.2016.7900260
Iwata S, Oyama W, Wakabayashi T, Kimura F. Recognition and transition frame detection of Arabic news captions for video retrieval. In 2016 23rd International Conference on Pattern Recognition, ICPR 2016. Institute of Electrical and Electronics Engineers Inc. 2017. p. 4005-4010. 7900260 https://doi.org/10.1109/ICPR.2016.7900260
Iwata, Seiya ; Oyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka. / Recognition and transition frame detection of Arabic news captions for video retrieval. 2016 23rd International Conference on Pattern Recognition, ICPR 2016. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 4005-4010
@inproceedings{dbeae7853ee04fda87cd9ca27d79dd3c,
title = "Recognition and transition frame detection of Arabic news captions for video retrieval",
abstract = "The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90{\%}. When compared with the ABBY FineReader 11 {\circledR} commercial OCR the dedicated OCR improves the recall of the Arabic characters in AlJazeera broadcasting news from 70.74{\%} to 95.85{\%} for non-interlaced moving news captions and from 23.82{\%} to 96.29{\%} for interlaced moving news captions.",
author = "Seiya Iwata and Wataru Oyama and Tetsushi Wakabayashi and Fumitaka Kimura",
year = "2017",
month = "4",
day = "13",
doi = "10.1109/ICPR.2016.7900260",
language = "English",
pages = "4005--4010",
booktitle = "2016 23rd International Conference on Pattern Recognition, ICPR 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Recognition and transition frame detection of Arabic news captions for video retrieval

AU - Iwata, Seiya

AU - Oyama, Wataru

AU - Wakabayashi, Tetsushi

AU - Kimura, Fumitaka

PY - 2017/4/13

Y1 - 2017/4/13

N2 - The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90%. When compared with the ABBY FineReader 11 ® commercial OCR the dedicated OCR improves the recall of the Arabic characters in AlJazeera broadcasting news from 70.74% to 95.85% for non-interlaced moving news captions and from 23.82% to 96.29% for interlaced moving news captions.

AB - The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90%. When compared with the ABBY FineReader 11 ® commercial OCR the dedicated OCR improves the recall of the Arabic characters in AlJazeera broadcasting news from 70.74% to 95.85% for non-interlaced moving news captions and from 23.82% to 96.29% for interlaced moving news captions.

UR - http://www.scopus.com/inward/record.url?scp=85019138531&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85019138531&partnerID=8YFLogxK

U2 - 10.1109/ICPR.2016.7900260

DO - 10.1109/ICPR.2016.7900260

M3 - Conference contribution

AN - SCOPUS:85019138531

SP - 4005

EP - 4010

BT - 2016 23rd International Conference on Pattern Recognition, ICPR 2016

PB - Institute of Electrical and Electronics Engineers Inc.

ER -