Databases of mathematical documents

Masakazu Suzuki, Christopher Malon, Seiichi Uchida

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

This paper describes the specifications for three ground-truthed mathematical character and symbol image databases, called InftyCDB-1, InftyCDB-2, and InftyCDB-3. In the former two databases, the ground-truth of each character is composed of type, font, quality (touching/broken) and link (relative position), etc. InftyCDB-1 includes all the characters and symbols of 30 articles on mathematics, and is organized so that it can be used as word image database or as mathematical formula image database. InftyCDB-2, which is a continuation of InftyCDB-1, includes 37 articles including French and German articles and is organized like InftyCDB-1. InftyCDB-3 is a single character database for training and evaluating single-character recognition engines.

Original languageEnglish
Pages (from-to)7-14
Number of pages8
JournalResearch Reports on Information Science and Electrical Engineering of Kyushu University
Volume12
Issue number1
Publication statusPublished - Mar 1 2007

Fingerprint

Character recognition
Engines
Specifications

All Science Journal Classification (ASJC) codes

  • Computer Science(all)
  • Electrical and Electronic Engineering

Cite this

Databases of mathematical documents. / Suzuki, Masakazu; Malon, Christopher; Uchida, Seiichi.

In: Research Reports on Information Science and Electrical Engineering of Kyushu University, Vol. 12, No. 1, 01.03.2007, p. 7-14.

Research output: Contribution to journalArticle

@article{dd6d85c791994d6ebf2d1d405649a379,
title = "Databases of mathematical documents",
abstract = "This paper describes the specifications for three ground-truthed mathematical character and symbol image databases, called InftyCDB-1, InftyCDB-2, and InftyCDB-3. In the former two databases, the ground-truth of each character is composed of type, font, quality (touching/broken) and link (relative position), etc. InftyCDB-1 includes all the characters and symbols of 30 articles on mathematics, and is organized so that it can be used as word image database or as mathematical formula image database. InftyCDB-2, which is a continuation of InftyCDB-1, includes 37 articles including French and German articles and is organized like InftyCDB-1. InftyCDB-3 is a single character database for training and evaluating single-character recognition engines.",
author = "Masakazu Suzuki and Christopher Malon and Seiichi Uchida",
year = "2007",
month = "3",
day = "1",
language = "English",
volume = "12",
pages = "7--14",
journal = "Research Reports on Information Science and Electrical Engineering of Kyushu University",
issn = "1342-3819",
publisher = "Kyushu University, Faculty of Science",
number = "1",

}

TY - JOUR

T1 - Databases of mathematical documents

AU - Suzuki, Masakazu

AU - Malon, Christopher

AU - Uchida, Seiichi

PY - 2007/3/1

Y1 - 2007/3/1

N2 - This paper describes the specifications for three ground-truthed mathematical character and symbol image databases, called InftyCDB-1, InftyCDB-2, and InftyCDB-3. In the former two databases, the ground-truth of each character is composed of type, font, quality (touching/broken) and link (relative position), etc. InftyCDB-1 includes all the characters and symbols of 30 articles on mathematics, and is organized so that it can be used as word image database or as mathematical formula image database. InftyCDB-2, which is a continuation of InftyCDB-1, includes 37 articles including French and German articles and is organized like InftyCDB-1. InftyCDB-3 is a single character database for training and evaluating single-character recognition engines.

AB - This paper describes the specifications for three ground-truthed mathematical character and symbol image databases, called InftyCDB-1, InftyCDB-2, and InftyCDB-3. In the former two databases, the ground-truth of each character is composed of type, font, quality (touching/broken) and link (relative position), etc. InftyCDB-1 includes all the characters and symbols of 30 articles on mathematics, and is organized so that it can be used as word image database or as mathematical formula image database. InftyCDB-2, which is a continuation of InftyCDB-1, includes 37 articles including French and German articles and is organized like InftyCDB-1. InftyCDB-3 is a single character database for training and evaluating single-character recognition engines.

UR - http://www.scopus.com/inward/record.url?scp=34548097756&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548097756&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:34548097756

VL - 12

SP - 7

EP - 14

JO - Research Reports on Information Science and Electrical Engineering of Kyushu University

JF - Research Reports on Information Science and Electrical Engineering of Kyushu University

SN - 1342-3819

IS - 1

ER -