Character-independent font identification

Daichi Haraguchi, Shota Harada, Kenji Iwana Brian, Yuto Shinahara, Seiichi Uchida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

There are a countless number of fonts with various shapes and styles. In addition, there are many fonts that only have subtle differences in features. Due to this, font identification is a difficult task. In this paper, we propose a method of determining if any two characters are from the same font or not. This is difficult due to the difference between fonts typically being smaller than the difference between alphabet classes. Additionally, the proposed method can be used with fonts regardless of whether they exist in the training or not. In order to accomplish this, we use a Convolutional Neural Network (CNN) trained with various font image pairs. In the experiment, the network is trained on image pairs of various fonts. We then evaluate the model on a different set of fonts that are unseen by the network. The evaluation is performed with an accuracy of 92.27%. Moreover, we analyzed the relationship between character classes and font identification accuracy.

Original languageEnglish
Title of host publicationDocument Analysis Systems - 14th IAPR International Workshop, DAS 2020, Proceedings
EditorsXiang Bai, Dimosthenis Karatzas, Daniel Lopresti
PublisherSpringer
Pages497-511
Number of pages15
ISBN (Print)9783030570576
DOIs
Publication statusPublished - 2020
Event14th IAPR International Workshop on Document Analysis Systems, DAS 2020 - Wuhan, China
Duration: Jul 26 2020Jul 29 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12116 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th IAPR International Workshop on Document Analysis Systems, DAS 2020
CountryChina
CityWuhan
Period7/26/207/29/20

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Character-independent font identification'. Together they form a unique fingerprint.

Cite this