Survey of conversational behavior: Towards the design of a balanced corpus of everyday Japanese conversation

Hanae Koisot, Tomoyuki Tsuchiya, Ryoko Watanabet, Daisuke Yokomori, Masao Aizawa, Yasuharu Den

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

5 被引用数 (Scopus)

抄録

In 2016, we set about building a large-scale corpus of everyday Japanese conversation-a collection of conversations embedded in naturally occurring activities in daily life. We will collect more than 200 hours of recordings over six years, publishing the corpus in 2022. To construct such a huge corpus, we have conducted a pilot project, one of whose purposes is to establish a corpus design for collecting various kinds of everyday conversations in a balanced manner. For this purpose, we conducted a survey of everyday conversational behavior, with about 250 adults, in order to reveal how diverse our everyday conversational behavior is and to build an empirical foundation for corpus design. The questionnaire included when, where, how long, with whom, and in what kind of activity informants were engaged in conversations. We found that ordinary conversations show the following tendencies: i) they mainly consist of chats, business talks, and consultations; ii) in general, the number of participants is small and the duration of the conversation is short; iii) many conversations are conducted in private places such as homes, as well as in public places such as offices and schools; and iv) some questionnaire items are related to each other. This paper describes an overview of this survey study, and then discusses how to design a large-scale corpus of everyday Japanese conversation on this basis.

本文言語英語
ホスト出版物のタイトルProceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
編集者Nicoletta Calzolari, Khalid Choukri, Helene Mazo, Asuncion Moreno, Thierry Declerck, Sara Goggi, Marko Grobelnik, Jan Odijk, Stelios Piperidis, Bente Maegaard, Joseph Mariani
出版社European Language Resources Association (ELRA)
ページ4434-4439
ページ数6
ISBN(電子版)9782951740891
出版ステータス出版済み - 2016
イベント10th International Conference on Language Resources and Evaluation, LREC 2016 - Portoroz, スロベニア
継続期間: 5 23 20165 28 2016

出版物シリーズ

名前Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016

その他

その他10th International Conference on Language Resources and Evaluation, LREC 2016
国/地域スロベニア
CityPortoroz
Period5/23/165/28/16

All Science Journal Classification (ASJC) codes

  • 言語学および言語
  • 図書館情報学
  • 言語および言語学
  • 教育

フィンガープリント

「Survey of conversational behavior: Towards the design of a balanced corpus of everyday Japanese conversation」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル