Structural Analysis of Instruction Utterances Using Linguistic and Visual Information

Tomohide Shibata, Masato Tachiki, Daisuke Kawahara, Masashi Okamoto, Sadao Kurohashi, Toyoaki Nishida

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper focuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic information alone.

Original languageEnglish
Pages (from-to)393-400
Number of pages8
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3213
Publication statusPublished - Dec 1 2004

Fingerprint

Structural Analysis
Linguistics
Structural analysis
Cooking
Video Retrieval
Hierarchical Structure
Unit
Vision

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Structural Analysis of Instruction Utterances Using Linguistic and Visual Information. / Shibata, Tomohide; Tachiki, Masato; Kawahara, Daisuke; Okamoto, Masashi; Kurohashi, Sadao; Nishida, Toyoaki.

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 3213, 01.12.2004, p. 393-400.

Research output: Contribution to journalArticle

Shibata, Tomohide ; Tachiki, Masato ; Kawahara, Daisuke ; Okamoto, Masashi ; Kurohashi, Sadao ; Nishida, Toyoaki. / Structural Analysis of Instruction Utterances Using Linguistic and Visual Information. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2004 ; Vol. 3213. pp. 393-400.
@article{2c41eedb70dc42cfa879471ef201f9c7,
title = "Structural Analysis of Instruction Utterances Using Linguistic and Visual Information",
abstract = "In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper focuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic information alone.",
author = "Tomohide Shibata and Masato Tachiki and Daisuke Kawahara and Masashi Okamoto and Sadao Kurohashi and Toyoaki Nishida",
year = "2004",
month = "12",
day = "1",
language = "English",
volume = "3213",
pages = "393--400",
journal = "Lecture Notes in Computer Science",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - Structural Analysis of Instruction Utterances Using Linguistic and Visual Information

AU - Shibata, Tomohide

AU - Tachiki, Masato

AU - Kawahara, Daisuke

AU - Okamoto, Masashi

AU - Kurohashi, Sadao

AU - Nishida, Toyoaki

PY - 2004/12/1

Y1 - 2004/12/1

N2 - In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper focuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic information alone.

AB - In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper focuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic information alone.

UR - http://www.scopus.com/inward/record.url?scp=35048896983&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35048896983&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:35048896983

VL - 3213

SP - 393

EP - 400

JO - Lecture Notes in Computer Science

JF - Lecture Notes in Computer Science

SN - 0302-9743

ER -