Abstract
The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AlJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90%. When compared with the ABBY FineReader 11 ® commercial OCR the dedicated OCR improves the recall of the Arabic characters in AlJazeera broadcasting news from 70.74% to 95.85% for non-interlaced moving news captions and from 23.82% to 96.29% for interlaced moving news captions.
Original language | English |
---|---|
Title of host publication | 2016 23rd International Conference on Pattern Recognition, ICPR 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 4005-4010 |
Number of pages | 6 |
ISBN (Electronic) | 9781509048472 |
DOIs | |
Publication status | Published - Apr 13 2017 |
Externally published | Yes |
Event | 23rd International Conference on Pattern Recognition, ICPR 2016 - Cancun, Mexico Duration: Dec 4 2016 → Dec 8 2016 |
Other
Other | 23rd International Conference on Pattern Recognition, ICPR 2016 |
---|---|
Country/Territory | Mexico |
City | Cancun |
Period | 12/4/16 → 12/8/16 |
All Science Journal Classification (ASJC) codes
- Computer Vision and Pattern Recognition