SEARCH

Search Details

Itoh Toshihiko

Faculty of Information Science and Technology Media and Network Technologies Information Media Science and TechnologyAssociate Professor
Education and Research Center for Mathematical and Data ScienceAssociate Professor

Researcher basic information

■ Degree
  • 博士(工学), 豊橋技術科学大学
■ URL
researchmap URLホームページURL■ Various IDs
J-Global ID■ Research Keywords and Fields
Research Keyword
  • 対話制御
  • 発話意図
  • 音声対話
  • 音声対話システム
  • 対話リズム
  • ユーザ満足度
  • 音声言語理解
  • 発話タイミング
  • 韻律
  • 身体性
  • 動画
  • 話者交替
  • 共同補完
  • ペン入力
  • アニメーション生成
  • 音声インターフェース
  • 学習支援システム
  • 生命音声認識
  • 教材知識ベース
  • MULTEXT
  • 文献検索
  • 日本語教育
  • 顔表情
  • 韻律コーパス
  • 携帯情報端末
  • 文脈処理
  • 基本周波数
  • フォーム入力
  • 姓名音声認識
  • 音声言語情報処理
  • Speech Language Processing
Research Field
  • Informatics, Intelligent robotics
  • Informatics, Perceptual information processing
  • Humanities & Social Sciences, Educational technology
  • Informatics, Intelligent informatics
■ Educational Organization

Career

■ Career
Career
  • 2007 - 2010
    北海道大学 大学院・情報科学研究科, 准教授
  • 1999 - 2002
    Shizuoka University
  • 1999 - 2002
    Shizuoka University, Research Assistant
Educational Background
  • 1999, Toyohashi University of Technology, 工学研究科, 電子情報, Japan
  • 1999, Toyohashi University of Technology, Graduate School, Division of Engineering
  • 1996, Toyohashi University of Technology, Faculty of Engineering, 情報, Japan
  • 1996, Toyohashi University of Technology, Faculty of Engineering

Research activity information

■ Awards
  • Dec. 2018, 電子情報通信学会, 平成30年度ヒューマンコミュニケーション賞
    PCノートテイカーによる誤入力文章の自動修正システム
    平井 康義;伊藤 敏彦
■ Papers
  • A Mirror-effect-based Mutual Tutorial System for Learning Operations on Different Interfaces of the Same Software Service
    Keiko Katsuragawa; Takeshi Oono; Minoru Tomikashi; Satoru Kogure; Toshihiko Itoh; Tatsuhiro Konishi; Yukihiro Itoh
    IPSJ Journal, 50, 1, 181, 192, Information Processing Society of Japan (IPSJ), 15 Jan. 2009
    Japanese, Recently, cell phones, PCs and car navigation systems are increasingly used for taking advantage of a single software service. Although such a system typically offers different interfaces according to the users' environments, not every user is familiar to the operation of all the available devices; hence, the users face difficulties in switching from one device to another. One of the biggest problems of a service accessible from multiple devices is that the users must learn different operations on different interfaces. Needless to say, this is a heavy burden on the users and it is desirable...
  • Prediction of driving actions from driving signals
    Toshihiko Itoh; Shinya Yamada; Kazumasa Yamamoto; Kenji Araki
    In-Vehicle Corpus and Signal Processing for Driver Behavior, 197, 210, Springer Science and Business Media, LLC, 2009
    English, International conference proceedings
  • Subjective Experiments on Influence of Response Timing in Spoken Dialogues
    Toshihiko Itoh; Norihide Kitaoka; Ryota Nishimura
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 1803, +, 2009, [Peer-reviewed]
    English, International conference proceedings
  • Spoken language understanding method using confidence measure and dialogue history
    Noriki Fujiwara; Toshihiko Itoh; Kenji Araki; Atsuhiko Kai; Tatsuhiro Konishi; Yukihiro Itoh
    Systems and Computers in Japan, 38, 9, 21, 31, Aug. 2007
    English, Scientific journal
  • 音声認識・言語理解性能や状況の違いによるレタスク指向音声対話の言語的・音響的特徴の比較
    伊藤敏彦; 山田真也; 荒木健治
    日本音響学会誌, 63, 5, 251, 261, The Acoustical Society of Japan (ASJ), 01 May 2007
    Japanese, 人間同士又は人間と機械との音声対話において,対話相手の音声認識・言語理解性能,対話状況や対話相手の違いによって生じる被験者発話の言語的・音響的特徴の差に関して実音声対話データの分析結果から明らかにする。機械との対話を扱うため,比較的単純な状況設定としてカーナビゲーションシステムにおける目的地検索・設定タスクを想定し,その音声インタフェースという具体的な状況設定において被験者発話に現れる言語的・音響的な特徴の差を比較した。想定した状況は,音声認識・言語理解率が100%と約80%の場合,対話相手が人間,応答能力が制限された人間,又は機械の場合,そして運転中又は停車中の場合である。これらの対話状況の違いにより発話にどのような違いがあるか,被験者24名による実対話音声の収録データに基づいて分析を行った。運転操作中の状況設定に関しては,擬似的な運転操作環境を設定した。その結果,運転操作の有無による言語的な特徴の差異はほとんどないが,音響的な特微の違いが一部見られたほか,対話相手側の応答に関する能力が制限されると被験者発話において幾つかの言語的・音響的な特徴が現れることが明らかになった。
  • Analysis of changes in dialogue rhythm due to dialogue acts in task-oriented dialogues
    Noriki Fujiwara; Toshihiko Itoh; Kenji Araki
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 4629, 564, 573, 2007, [Peer-reviewed]
    English, International conference proceedings
  • Integrated Japanese dependency analysis using a dialog context
    Yuki Ikegaya; Yasuhiro Noguchi; Satoru Kogure; Toshihiko Itoh; Tatsuhiro Konishi; Makoto Kondo; Hideki Asoh; Akira Takagi; Yukihiro Itoh
    Transactions of the Japanese Society for Artificial Intelligence, 22, 3, 291, 310, Japanese Society for Artificial Intelligence, 2007
    Japanese, Scientific journal
  • Spoken Language Understanding Method Using Confidence Measure and Dialogue History
    FUJIWARA Noriki; ITOH Toshihiko; ARAKI Kenji; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro
    The IEICE transactions on information and systems (Japanese edetion), 89, 7, 1493, 1503, The Institute of Electronics, Information and Communication Engineers, 01 Jul. 2006
    Japanese, 実環境での音声対話システムの使用において,誤認識を回避することは難しい.誤認識が起きると,システムはユーザの期待する応答とかけ離れた応答を行い,対話がスムーズに進まなくなることも多い.そこで本研究では,音声認識器が誤認識した場合でも,認識信頼度と対話履歴を用いることで正しくユーザの意図を推定することができる音声言語理解手法を提案する.これは,音声認識器が誤認識した場合でも多くの場合,複数候補(N-best)中に正解が含まれていること,システムが誤認識した場合にはユーザは大体訂正反応を示すこと,タスク指向対話には強い一貫性がありユーザは基本的に意味的・文脈的に関係した内容以外を発話しないことを利用する.また,提案手法ではあらかじめすべての認識可能単語を理解候補として保持し,言語理解部の対話戦略において音声認識結果中の単語との意味的関連性などを考慮している.これにより音声認識結果のN-best中に正解の一部が含まれていない場合でも,複数のユーザ発話の認識結果に基づくことで正しい意図を推定することが可能となっている.評価データにおいて,提案手法における対話単位での理解率は72.2%(21,430/29,670対話),単語単位での理解率は87.1%(77,544/89,010単語)であり,従来手法の最新認識結果の上位候補を優先するシステムの57.9% (17,178/29,670対話...
  • Is Voice Quality Enough? - Study on How the Situation and User's Awareness Influence the Utterance Features
    Shinya Yamada; Toshihiko Itoh; Kenji Araki
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 481, 484, 2006
    English, International conference proceedings
  • Action Prediction Method Using Recursive Different and Common Parts Extraction Method with N-gram
    XU Jin'an; ITOH Toshihiko; ARAKI Kenji
    Human interface. The Transaction of Human Interface Society, 7, 1, 55, 67, ヒュ-マンインタフェ-ス学会, 25 Feb. 2005
    Japanese
  • 「ソーシャルインタフェース」N‐gramを用いた再帰差異共通部分抽出法によるユーザの行動予測手法
    XU J; 伊藤敏彦; 荒木健治
    ヒューマンインタフェース学会論文誌, 7, 1, 55, 67, Feb. 2005
    Japanese
  • Linguistic and acoustic features depending on different situations - The experiments considering speech recognition rate
    Shinya Yamada; Toshihiko Itoh; Kenji Araki
    9th European Conference on Speech Communication and Technology, 3393, 3396, 2005
    English, International conference proceedings
  • A point-pass-based action prediction method
    JA Xu; T Itoh; K Araki; K Tochinai
    IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2, 103, 108, 2004
    English, International conference proceedings
  • Evaluation of action prediction method using inductive learning with N-gram
    JA Xu; T Itoh; K Araki; K Tochinai
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 1605, 1609, 2004
    English, International conference proceedings
  • Construction and Evaluation of a Sub System DPS-PC Which Supports Users in Making and Editing a Drive Plan (Navigation)(Mobile Systems and Intelligent Transport Systems (ITS) under Ubiquitous Environment)
    KATSURAGAWA KEIKO; YANAGI TAKURA; OONO KEN; WATANABE MASAKI; ITOH TOSHIHIKO; KONISHI TATSUHIRO; ITOH YUKIHIRO
    IPSJ Journal, 44, 12, 2990, 3001, Information Processing Society of Japan (IPSJ), 15 Dec. 2003
    Japanese, In this paper, we propose a drive planning system that supports users in making a plan for a trip. We introduce a sub-system named DPS-PC which runs on stand-alone PC. We think if we can register our trip plan to an ITS system previously, the ITS services it provides for us will be more rich. DPS-PC has the function to help users decide several factors of a trip: multiple destinations and waypoints, arrival and departure times, the number of days that the trip will take and the route. The drive is planned interactively by a dialog with the system through a natural language interface. We dis...
  • Comparison of Linguistic and Acoustic Features Caused by Different Dialogue Situations in a Landmark-input Task
    ITO Toshihiko; KAI Atsuhiko; IWAMOTO Yoshiyuki; MIZUTANI Makoto; YUASA Hiroki; KONISHI Tatsuhiro; ITOH Yukihiro
    Transactions of Information Processing Society of Japan, 43, 7, 2118, 2129, Information Processing Society of Japan (IPSJ), 15 Jul. 2002
    Japanese, This paper presents the characteristic differences of acoustic and linguistic features observed for different spoken dialogue situations in human-human and human-machine interactions. We compare the acoustic and linguistic features of the user's dialogue speech both for a spoken dialogue system and an actual human-operator service in several landmark-setting tasks for a car navigation system. It is known that speech-based interaction has the potential to distract drivers and degrade safety. On the other hand, it is not clear whether a different dialogue situation causes some acoustic or linguistic differences on their utterances in a speech interface system. We collected a set of spoken dialogue data by 10 subject speakers under several dialogue situations. For the car-driving condition, we prepared a virtual driving simulation system. We analyzed the characteristic differences of user utterances caused by different dialogue situations or the system understanding errors. As a result, we observed that the existence of a car-driving task affects some prosodic features and the difference of humanmachine and human-human dialogue conditions also affects the other acoustic and linguistic features, while no significant differences are observed for the other acoustic and linguistic features whether they performed a car-driving task or not.
  • A Dialogue Interface of a Drive Planning System(Recent Advancements of Spoken Language Interfaces and Dialogue Systems)
    Itoh Yukihiro; Konishi Tatsuhiro; Itoh Toshihiko; Katsuragawa Keiko
    Journal of Japanese Society for Artificial Intelligence, 17, 3, 285, 290, The Japanese Society for Artificial Intelligence, 01 May 2002
    Japanese
  • Juncture segmentation of Japanese prosodic unit based on the spectrographic features
    Kitazawa Shigeyoshi; Itoh Toshihiko; Kitamura Tatsuya
    7th International Conference on Spoken Language Processing, ICSLP 2002, 1201, 1204, International Speech Communication Association, 2002
    English, International conference proceedings
  • Semantic interpreter and a cooperative response generator for a robust spoken dialogue system
    Seiichi Nakagawa; Satoru Kogure; Toshihiko Itoh
    International Journal of Pattern Recognition and Artificial Intelligence, 14, 5, 553, 569, World Scientific Publishing Co. Pte. Ltd, 2000
    English, Scientific journal
  • A Spoken Dialogue System with Cooperative Response and Evaluation for the System(on Next Generation Human Interface and Interaction)
    ITOH TOSHIHIKO; KOGURE SATORU; NAKAGAWA SEIICHI
    IPSJ Journal, 39, 5, 1248, 1257, Information Processing Society of Japan (IPSJ), 15 May 1998
    Japanese, We have developed a robust dialogue system which aids users in information retrieval through spontaneous speech. Dialog system through natural language must be designed so that it can cooperatively response to users. Based on this consideration, we developed a cooperative response generator in the dialogue system. The response generator is composed of dialog manager, problem solver, knowledge databases, and response sentence generator. The response generator receives a semantic representation (that is, semantic network) which the interpreter builds for the user's utterance and generates as ...
  • A Sightseeing Guidance Spoken Dialogue System with Multi-Modal Interface
    Nakagawa Seiichi; Denda Akihiro; Itoh Toshihiko
    Journal of Japanese Society for Artificial Intelligence, 13, 2, 241, 251, The Japanese Society for Artificial Intelligence, 01 Mar. 1998
    Japanese, Recent improvements of speech recognition and natural language processing enable dialogue systems to deal with spontaneous speech. With the aim of supporting these systems, multi-modal man-machine interface has been introduced to the system widely. We have been aiming at realization of a robust dialogue system using spontaneous speech as main input modality. Although our conventional system was developed with a robust natural language interpreter, since its user interface was built only on speech, the system did not always give enough usability. However, in this case, response sentences bec...
■ Other Activities and Achievements
  • Exploring the Use of Word Replacement-Based Manzai for Humorous Responses in Open-Domain Dialogue
    片岸祥帆; 伊藤敏彦, 情報処理学会研究報告(Web), 2025, NL-263, 2025
  • 算術タスクを用いた文脈内学習による外挿能力の分析
    進藤稜真; 竹下昌志; ジェプカ ラファウ; 伊藤敏彦, 言語処理学会年次大会発表論文集(Web), 31st, 2025
  • Verification of character impersonation response generation using game facial expression patterns
    連慎治; 伊藤敏彦, 人工知能学会全国大会論文集(Web), 39th, 2025
  • Vector Generation Reflecting Real Person’s Chat-Style Characteristics and Verification of Identification Accuracy
    倉知祥太朗; 伊藤敏彦, 人工知能学会全国大会論文集(Web), 39th, 2025
  • Research on Natural Topic Transition, Detection of Information Acquisition Timing in Mental Health Support Systems Using Casual Conversations
    YOO Dongkeun; 伊藤敏彦, 電子情報通信学会技術研究報告(Web), 123, 416(NLC2023 23-26), 2024
  • 雑談中の発話と文脈から話者情報を抽出するLLMの能力に関する検証
    連慎治; 竹下昌志; 伊藤敏彦, 言語処理学会年次大会発表論文集(Web), 30th, 2024
  • Extracting Conceptual Relation Graphs for Solving Story Tasks
    岡田憩; RZEPKA Rafal; 伊藤敏彦, 電子情報通信学会技術研究報告(Web), 124, 173(NLC2024 1-18), 2024
  • A Hierarchical Story Outlines Generation Method with Revision Functions
    會田尚平; 伊藤敏彦, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024
  • Detecting Persona Information in Chat using Subject Recovery with Machine Translation
    連慎治; 伊藤敏彦; 荒木健治, 電子情報通信学会技術研究報告(Web), 122, 287(NLC2022 9-18), 2022
  • Prediction of User Personas from Utterances for Chatting
    連慎治; 伊藤敏彦; 荒木健治, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022
  • Automatic Correction System For Erroneous Input Sentences By PC Notetaker
    平井康義; 伊藤敏彦, 電子情報通信学会技術研究報告, 117, 502(WIT2017 66-91), 2018
  • PCノートテイキングのためのアシスタントシステムの検討
    平井康義; 伊藤敏彦, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2017, 2017
  • 機械学習を用いたリアルタイム有声休止判定システム
    小川翼; 伊藤敏彦, 人工知能学会言語・音声理解と対話処理研究会資料, 68th, 2013
  • 機械学習を用いたリアルタイム発話継続,発話終了予測システム
    伊藤敏彦; 小川翼, 人工知能学会言語・音声理解と対話処理研究会資料, 68th, 2013
  • A-17-20 Investigation of Driving-Behavior Modeling for Recognizing Driving Situation
    Ema Junki; Wang Longbiao; Kai Atsuhiko; Itoh Toshihiko, Proceedings of the Society Conference of IEICE, 2010, 0, 166, 166, 31 Aug. 2010
    The Institute of Electronics, Information and Communication Engineers, Japanese
  • 車の運転状況の認識のための運転行動モデルの検討
    江間旬記; WANG Longbiao; 甲斐充彦; 伊藤敏彦, 電子情報通信学会大会講演論文集, 2010, 166, 166, 31 Aug. 2010
    The Institute of Electronics, Information and Communication Engineers, Japanese
  • Influence of Response Timing in Speech Dialogues
    ITOH TOSHIHIKO; KITAOKA NORIHIDE; NISHIMURA RYOTA, 電子情報通信学会技術研究報告, 108, 283(NLC2008 19-23), 7, 12, 03 Nov. 2008
    In order to examine the validity of the findings on the dialogue rhythm which we previously pointed out, we made some dialogue samples with various rhythm/utterance timing and evaluated them subjectively from the view points of naturalness of whole the dialogues, unnaturalness of synthesized speech, and intelligibility of the dialogues. We used short task-oriented four-turns dialogues using speech synthesizer in Experiment No. 1, and approx. one-minute chat-like dialogues in No. 2 using natural human utterances and synthesized voices. The results of these experiments supported our previous analysis that the utterance timing is important for natural dialogue and the timing of each utterance mainly depends on the contents of the utterance., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Subjective Experiments on Influence of Response Timing in Speech Dialogues
    ITOH TOSHIHIKO; KITAOKA NORIHIDE; NISHIMURA RYOTA, 情報処理学会研究報告, 2008, 68(SLP-72), 99, 104, 11 Jul. 2008
    Japanese
  • Predicting Speech Recognition Performance Degradation Using Utterance Overlapping Information in Spoken Dialogue Systems
    NAKANO MIKIO; FUNAKOSHI KOTARO; ITOH TOSHIHIKO; ARAKI KENJI; HASEGAWA YUJI; TSUJINO HIROSHI, 人工知能学会全国大会論文集(CD-ROM), 22nd, 1H1-04, 4, 2008
    人工知能学会, Japanese
  • Influence of mutual dialogue acts on dialogue rhythm in task-oriented dialogues
    藤原 敬記; 伊藤 敏彦; 荒木 健治, 言語・音声理解と対話処理研究会, 50, 0, 45, 50, 23 Jul. 2007
    人工知能学会, Japanese
  • Influence of Dialogue Acts on Dialogue Rhythm in Task-oriented Dialogues
    FUJIWARA Noriki; ITOH Toshihiko; ARAKI Kenji, IPSJ SIG Notes, 2007, 47, 37, 42, 24 May 2007
    We consider that factors such as prosody of systems' utterances and dialogue rhythm axe important to attain a natural human-machine dialogue. However, it has been not revealed the relations between dialogue rhythm and speaker's various states in task-oriented dialogue. In this study, we collected task-oriented dialogues and analyzed the relations between "dialogue structures, kinds of dialogu acts (contents of utterances), Aizuchi (backchannel/acknowledgment), Repeat and interjection" and "dialogue rhythm (response timing, F0, and speech rate)". From the results, we understood that dialogue rhythm is affected by dialogue structures and dialogue acts significantly, moreover, utterances of Aizuchi and Repeat conform to restrictions to keep dialogue rhythm., Information Processing Society of Japan (IPSJ), Japanese
  • A study of integration method of word confidence measure from multiple recognition results and linguistic understanding
    ONJI YUTA; KOGURE SATORU; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 人工知能学会言語・音声理解と対話処理研究会資料, 49th, 57, 62, 02 Mar. 2007
    人工知能学会, Japanese
  • A method of dynamically generating word occurrence probabilities according to the contextual information and the system response
    Iwasaki Yoshinori; Kogure Satoru; Itoh Toshihiko; Kai Atsuhiko; Konishi Tatsuhiro; Itoh Yukihiro, IPSJ SIG Notes, 2007, 11, 67, 72, 09 Feb. 2007
    Recently, the technology of speech recognition and natural language processing, and the performance of computer calculation ability has been highly improved, so we can utilize speech interface to handle information service in car. Cars' spoken dialogue systems like existent navigation system, however, often misrecognized user utterances. In this paper, the system predicts the frequency uttered word using the contextual information and the system response, and raise word occurrence probabilities of those words. As a result, we make the correct answer word appear in the recognition result eas..., Information Processing Society of Japan (IPSJ), Japanese
  • A method of dynamically generating word occurrence probabilities according to the contextual information and the system response
    IWASAKI YOSHINORI; KOGURE SATORU; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 情報処理学会研究報告, 2007, 11(HI-122 SLP-65), 67, 72, 09 Feb. 2007
    Recently, the technology of speech recognition and natural language processing, and the performance of computer calculation ability has been highly improved, so we can utilize speech interface to handle information service in car. Cars' spoken dialogue systems like existent navigation system, however, often misrecognized user utterances. In this paper, the system predicts the frequency uttered word using the contextual information and the system response, and raise word occurrence probabilities of those words. As a result, we make the correct answer word appear in the recognition result easily. As a result of the evaluation experiment, the word recognition rate rose from 83.5% to 85.1% according to use the proposal method. We show the effectiveness of the method., Information Processing Society of Japan (IPSJ), Japanese
  • E_009 Making prototype development environments of dialogue control with high modularity
    Shigeta Yoshihiro; Ikegaya Yuki; Noguchi Yasuhiro; Kogure Satoru; Itoh Toshihiko; Konishi Tatsuhiro; Kondo Makoto; Itoh Yukihiro, 情報科学技術フォーラム一般講演論文集, 5, 2, 157, 158, 21 Aug. 2006
    Forum on Information Technology, Japanese
  • Identification of Utterances Made to a System by Using In-Car Speech Acoustic Features
    YAMADA Shinya; ITOH Toshihiko; ARAKI Kenji, IPSJ SIG Notes, 2006, 40, 7, 12, 11 May 2006
    This paper presents usefulness of identifying user's utterances made to a spoken dialogue system using machine learning which uses acoustic features of user's utterances recorded in various situations. We have already performed dialogue experiments with two speakers (human-human or human-machine patterns) in several situations and we newly performed the experiments with three speakers (human-human-machine). The dialogue task simulates voice control of a car navigation system, where we made users perform goal settings or look the goal up in destination database. We prepared a spoken dialogue..., Information Processing Society of Japan (IPSJ), Japanese
  • Spoken Dialog System considered Rhythm and Synchronized Tendency of Conversation
    SHOJI Keisuke; TAKAHASHI Mika; IBARA Seiya; ITOH Toshihiko; ARAKI Kenji, IPSJ SIG Notes, 2006, 40, 43, 48, 11 May 2006
    The best rhythm of the conversation between humans is developed during their conversation. It can be expected that users conscious of rhythm will perform smoother conversation. In this paper, as one of the methods is to copy human communication abilities as much as possible, we develop spoken dialog system which puts the importance to the rhythm of dialog aiming at improvement of user satisfaction by encouraging an user to utter naturally. Elements of rhythm of dialog that we paid our attention is speaking rate and timing of an utterance and backchanneling from a system. To realize such nat..., Information Processing Society of Japan (IPSJ), Japanese
  • Identification of Utterances Made to a System by Using In-Car Speech Acoustic Features
    YAMADA SHIN'YA; ITOH TOSHIHIKO; ARAKI KENJI, 情報処理学会研究報告, 2006, 40(SLP-61), 7, 12, 11 May 2006
    This paper presents usefulness of identifying user's utterances made to a spoken dialogue system using machine learning which uses acoustic features of user's utterances recorded in various situations. We have already performed dialogue experiments with two speakers (human-human or human-machine patterns) in several situations and we newly performed the experiments with three speakers (human-human-machine). The dialogue task simulates voice control of a car navigation system, where we made users perform goal settings or look the goal up in destination database. We prepared a spoken dialogue system for all experiments and prepared a human operator for the experiment with two speakers. We used the dialogue data achieved from the experiments and identified user's utterances made to the spoken dialogue system. Additionally, by comparison with utterances which were collected from different situations, we researched the influence of various conditions on performance of identifying utterances., Information Processing Society of Japan (IPSJ), Japanese
  • Spoken Dialog System considered Rhythm and Synchronized Tendency of Conversation
    SHOJI KEISUKE; TAKAHASHI MIKA; IBARA SEIYA; ITOH TOSHIHIKO; ARAKI KENJI, 情報処理学会研究報告, 2006, 40(SLP-61), 43, 48, 11 May 2006
    The best rhythm of the conversation between humans is developed during their conversation. It can be expected that users conscious of rhythm will perform smoother conversation. In this paper, as one of the methods is to copy human communication abilities as much as possible, we develop spoken dialog system which puts the importance to the rhythm of dialog aiming at improvement of user satisfaction by encouraging an user to utter naturally. Elements of rhythm of dialog that we paid our attention is speaking rate and timing of an utterance and backchanneling from a system. To realize such natural rhythm, we newly designed three modules-Understanding Component to predict user's task intention in the middle of his utterance while performing language understanding by a pause unit; Response Generator which generates the response considering rhythm and uses a user model; Rhythm Generator to perform a speaker change judgment including backchanneling judgment and rythm synchronization in real time. These components are to construct a task oriented spoken dialog system., Information Processing Society of Japan (IPSJ), Japanese
  • 単語信頼度と検索結果を利用した協調的応答戦略
    高木 浩吉; 小暮 悟; 伊藤 敏彦, 言語・音声理解と対話処理研究会, 46, 0, 33, 38, 03 Mar. 2006
    人工知能学会, Japanese
  • Cooperative response strategy using word plausibility score and retrieval result
    TAKAGI HIROYOSHI; KOGURE SATORU; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 人工知能学会言語・音声理解と対話処理研究会資料, 46th, 33, 38, 03 Mar. 2006
    人工知能学会, Japanese
  • 音声言語理解のための助詞・付属語の信頼度利用に関する調査
    藤原敬記; 伊藤敏彦; 荒木健治, 日本音響学会研究発表会講演論文集(CD-ROM), 2006, 2006
  • Analysis of Linguistic and Acoustic Features Depending on Different Situations and Discussions from various viewpoints : The Experiments Considering Voice Quality
    YAMADA Shinya; ITOH Toshihiko; ARAKI Kenji, IPSJ SIG Notes, 2005, 127, 67, 72, 21 Dec. 2005
    This paper presents our analyses of human-human and human-machine interactions and the characteristic differences of linguistic and acoustic features observed in different spoken dialogue situations and with different dialogue partners. The linguistic and acoustic features of the user's speech to a spoken dialogue system and a human operator are compared in several goal setting and destination database searching tasks for a car navigation system. It is said that it is not clear enough whether different dialogue situations, different dialogue partners and different speech recognition rate ca..., Information Processing Society of Japan (IPSJ), Japanese
  • Speech Recognition using CFG in combination with Word Spotting for Robust Language Understanding
    SUZUKI Sadayuki; KOGURE Satoru; ITOH Toshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro, IPSJ SIG Notes, 2005, 127, 115, 120, 21 Dec. 2005
    In this paper, we propose the technique for improving the N-best candidates accuracy in a spontaneous utterance by destination setting task with car navigation, combining the N-best candidates using a grammatical constraint at sentence and the word lattice using word spotting, to improve performance in the framework of speech understanding from the N-best candidates in early research. The system calculates the reliability of the word of each utterance by using the word lattice. We use the reliability to raise the word likelihood and to exchange the word of the N-best candidate by grammatica..., Information Processing Society of Japan (IPSJ), Japanese
  • Speech Recognition using CFG in combination with Word Spotting for Robust Language Understanding
    SUZUKI Sadayuki; KOGURE Satoru; ITOH Toshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro, IEICE technical report. Speech, 105, 496, 25, 30, 15 Dec. 2005
    In this paper, we propose the technique for improving the N-best candidates accuracy in a spontaneous utterance by destination setting task with car navigation, combining the N-best candidates using a grammatical constraint at sentence and the word lattice using word spotting, to improve performance in the framework of speech understanding from the N-best candidates in early research. The system calculates the reliability of the word of each utterance by using the word lattice. We use the reliability to raise the word likelihood and to exchange the word of the N-best candidate by grammatica..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Speech Recognition using CFG in combination with Word Spotting for Robust Language Understanding
    SUZUKI SADAYUKI; KOGURE SATORU; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 電子情報通信学会技術研究報告, 105, 496(SP2005 105-138), 25, 30, 15 Dec. 2005
    Japanese
  • Analysis of Linguistic and Acoustic Features Depending on Different Situations and Discussions from various viewpoints : The Experiments Considering Voice Quality
    YAMADA Shinya; ITOH Toshihiko; ARAKI Kenji, IEICE technical report. Speech, 105, 495, 67, 72, 14 Dec. 2005
    This paper presents our analyses of human-human and human-machine interactions and the characteristic differences of linguistic and acoustic features observed in different spoken dialogue situations and with different dialogue partners. The linguistic and acoustic features of the user's speech to a spoken dialogue system and a human operator are compared in several goal setting and destination database searching tasks for a car navigation system. It is said that it is not clear enough whether different dialogue situations, different dialogue partners and different speech recognition rate ca..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Analysis of Linguistic and Acoustic Features Depending on Different Situations and Discussions from various viewpoints-The Experiments Considering Voice Quality-
    YAMADA SHIN'YA; ITOH TOSHIHIKO; ARAKI KENJI, 電子情報通信学会技術研究報告, 105, 495(SP2005 90-104), 67, 72, 14 Dec. 2005
    Japanese
  • 音声認識率や状況の違いによる音声対話の言語的・音響的特徴の比較(合同セッション「対話」)
    伊藤 敏彦; 山田 真也; 荒木 健治, 情報処理学会研究報告. 自然言語処理研究会報告, 2005, 50, 101, 106, 26 May 2005
    人間同士または人間と機械との音声対話において, タスク遂行役の音声認識率、対話状況や対話相手の違いによって生じる言語・音響的な特徴の差異に関して実音声対話データの分析結果から明らかにする.機械との対話を扱うため, 比較的単純な状況設定としてカーナビゲーションシステムにおける目的地検索・設定タスクを想定し, その音声インタフェースという具体的な状況設定においてユーザ発話に現れる言語・音響的な特徴の差異を比較した.想定した状況は, 音声認識率が100%と約80%の場合, 対話相手が人間, 応答能力が制限された人間, 又は機械の場合, 運転中又は停車中の場合である.これらの対話状況の違いにより発話形態にどのような違いがあるか, 被験者24名による実対話音声の収録データに基づいて分析を行なった.運転操作中の状況設定に関しては, 擬似的な運転操作環境を設定した.さらに, 対話状況の違いと併せて, 対話相手が誤認識・誤理解した場合の次発話の言語・音響的な分析も行った.その結果, 運転操作の有無による言語的な特徴の差異はほとんどないが, 音響的な特徴の違いが一部見られたほか, 応答が自然音声か合成音声かで幾つかの言語・音響的な特徴の差異が明らかになった., 一般社団法人情報処理学会, Japanese
  • Linguistic and Acoustic Features Depending on Different Situations and Speech Recognition Rate
    ITOH Toshihiko; YAMADA Shinya; ARAKI Kenji, IPSJ SIG Notes, 2005, 50, 101, 106, 26 May 2005
    This paper presents the characteristic differences of linguistic and acoustic features observed in different spoken dialogue situations and with different dialogue partners: human-human vs. human-machine interactions. We compare the linguistic and acoustic features of the user's speech to a spoken dialogue system and to a human operator in several goal setting and destination database searching tasks for a car navigation system. It has been pointed out that speech-based interaction has the potential to distract the driver's attention and degrade safety. On the other hand, it is not clear en..., Information Processing Society of Japan (IPSJ), Japanese
  • Linguistic and Acoustic Features Depending on Different Situations and Speech Recognition Rate
    ITOH TOSHIHIKO; YAMADA SHIN'YA; ARAKI KENJI, 情報処理学会研究報告, 2005, 50(NL-167 SLP-56), 101, 106, 26 May 2005
    This paper presents the characteristic differences of linguistic and acoustic features observed in different spoken dialogue situations and with different dialogue partners: human-human vs. human-machine interactions. We compare the linguistic and acoustic features of the user's speech to a spoken dialogue system and to a human operator in several goal setting and destination database searching tasks for a car navigation system. It has been pointed out that speech-based interaction has the potential to distract the driver's attention and degrade safety. On the other hand, it is not clear enough whether different dialogue situations and different dialogue partners cause any differences of linguistic or acoustic features on one's utterances in a speech interface system. Additionally, research about influence of speech recognition rate is not enough either. We collected a set of spoken dialogues by 24 subject speakers for each experiment under several dialogue situations. For a car driving situation, we prepared a virtual driving simulation system. We also prepared two patterns where we have two dialogue partners with different speech recognition rate (100% and about 80%). We analyzed the characteristic differences of user utterances caused by different dialogue situations and with different dialogue partners in two above mentioned patterns., Information Processing Society of Japan (IPSJ), Japanese
  • Speech intent presumption method using confidence score of speech recognition and dialogue history for robust meaning understanding
    Mizuno Satoshi; Takagi Hiroyoshi; Kogure Satoru; Kai Atsuhiko; Itoh Toshihiko; Konishi Tatsuhiro; Itoh Yukihiro, IPSJ SIG Notes, 2005, 12, 77, 82, 04 Feb. 2005
    The spoken dialogue interface and the task oriented dialogue system has come to be used by improving the speech recognition, the language understanding technologies, and the computer performance. We need a more robust language understanding for the system to come to be used more generally. Our paper deals with speech intent presumption method using the confidence score of speech recognition and dialogue history for robust meaning understanding. This language understanding results are generated by using the speech recognition results (n-best) and the identification results. Thus, the accurac..., Information Processing Society of Japan (IPSJ), Japanese
  • Speech intent presumption method using confidence score of speech recognition and dialogue history for robust meaning understanding
    MIZUNO SATOSHI; TAKAGI HIROYOSHI; KOGURE SATOSHI; KAI ATSUHIKO; ITOH TOSHIHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 情報処理学会研究報告, 2005, 12(SLP-55), 77, 82, 04 Feb. 2005
    The spoken dialogue interface and the task oriented dialogue system has come to be used by improving the speech recognition, the language understanding technologies, and the computer performance. We need a more robust language understanding for the system to come to be used more generally. Our paper deals with speech intent presumption method using the confidence score of speech recognition and dialogue history for robust meaning understanding. This language understanding results are generated by using the speech recognition results (n-best) and the identification results. Thus, the accuracy of the category identification influences the language understanding accuracy. Then, we used the presumption of user's speech intention in order to improve the language understanding accuracy. As the result of evaluation experiment, we show that the language understanding performance used our proposed method is higher than the language understanding method which simply gives priority to the first hypothesis of an n-best., Information Processing Society of Japan (IPSJ), Japanese
  • 日本語対話訓練システムにおける学習者へ誤りを指摘する機構の設計
    薬袋直貴; 白鳥雄史; 伊藤敏彦; 小西達裕; 近藤真; 伊東幸宏, 教育システム情報学会全国大会講演論文集, 29th, 37, 38, 20 Aug. 2004
    Japanese
  • Rethinking Plans and Scripts Realization in the Age of Web-mining
    Rzepka Rafal; Itoh Toshihiko; Araki Kenji, IPSJ SIG Notes, 2004, 73, 11, 18, 15 Jul. 2004
    In this paper we introduce some ideas for reusing cognitive science concepts which realizing before was impossible due to the technical limits. We concentrate on the Schankian scripts which could help to build plans as the basic method for achieving goals. In contradistinction to the authors of classic cognitivistic ideas, we can currently use powerful computers and terabytes of data which could help to make their concepts usable in not restricted domains for any kind of application using commonsense knowledge. Many useful projects were abandoned because of difficulties due to the manual in..., Information Processing Society of Japan (IPSJ), English
  • 言語情報を利用した韻律ラベリング手法の評価
    桐山伸也; 北沢茂良; 伊藤敏彦, 日本音響学会研究発表会講演論文集, 2004, 237, 238, 17 Mar. 2004
    Japanese
  • 日本語韻律コーパスのためのJ‐ToBIラベリング
    北沢茂良; 桐山伸也; 伊藤敏彦, 日本音響学会研究発表会講演論文集, 2004, 349, 350, 17 Mar. 2004
    Japanese
  • 対話訓練システムのための言語処理・文脈処理に関する研究
    伊東幸宏; 小西達裕; 近藤真; 伊藤敏彦, 静岡大学情報学研究, 9, 119, 123, 10 Mar. 2004
    Japanese
  • Design of the Modular Interaction Control Rules and Research on the Means of Application to the Concrete Interaction Domain.
    SUZUKI YUKIKO; IKEGAYA YUKI; NOGUCHI YASUHIRO; ITOH TOSHIHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO; TAKAGI AKIRA, 人工知能学会言語・音声理解と対話処理研究会資料, 40th, 73, 78, 05 Mar. 2004
    人工知能学会, Japanese
  • Investigation of statistical techniques for intention understanding in spoken dialog
    Shiraki Masayuki; Itoh Toshihiko; Kai Atsuhiko; Nakatani; Hiromasa, IPSJ SIG Notes, 2004, 15, 69, 74, 06 Feb. 2004
    Recently, research on spoken dialog systems has been active with progress of the speech recognition technology. However, it is difficult to extract user intention correctly from natural utterance. Most of these difficulties are due to the errors of speech recognition results, and a variety of linguistic phenomena included in natural utterance. We propose statistical methods to extract user intention from natural utterance. By learning examples, a set of rules which are robust to various linguistic phenomena can be automatically acquired. In this paper, N-gram model, vector space model, and ..., Information Processing Society of Japan (IPSJ), Japanese
  • Investigation of statistical techniques for intention understanding in spoken dialog
    SHIRAKI MASAYUKI; ITOH TOSHIHIKO; KAI ATSUHIKO; NAKATANI HIROMASA, 情報処理学会研究報告, 2004, 15(SLP-50), 69, 74, 06 Feb. 2004
    Recently, research on spoken dialog systems has been active with progress of the speech recognition technology. However, it is difficult to extract user intention correctly from natural utterance. Most of these difficulties are due to the errors of speech recognition results, and a variety of linguistic phenomena included in natural utterance. We propose statistical methods to extract user intention from natural utterance. By learning examples, a set of rules which are robust to various linguistic phenomena can be automatically acquired. In this paper, N-gram model, vector space model, and Support Vector Machine (SVM) are used for understanding user intention. We perform the experiments of intention understanding and evaluate the performances of those methods., Information Processing Society of Japan (IPSJ), Japanese
  • Integrated Japanease Dependency Analysis Using a Dialogue Context
    IKEGAYA YUKI; NOGUCHI YASUHIRO; SUZUKI YUKIKO; ITOH TOSHIHIKO; KONISHI TATSUHIRO; KONDO MAKOTO; TAKAGI AKIRA; NAKASHIMA HIDEYUKI; ITO YUKIHIRO, 人工知能学会全国大会論文集(CD-ROM), 18th, 3E2-10, 2004
    Japanese
  • Construction and Evaluation of Spoken Dialogue Type Car Interface Using a Situation and the Context
    Yuasa Hiroki; Mizuno Satoshi; Itoh Toshihiko; Kai Atsuhiko; Konishi Tatsuhiro; Itoh Yukihiro, IPSJ SIG Notes, 2003, 124, 199, 204, 18 Dec. 2003
    This paper deals with the construction of a spoken dialogue system which interprets an input by using the situation/context. The system has restricting input styles to "Operate an object" "An attribute is a value" in order to achieve higher recognition rate. The system further accepts more than one input in an utterance. We have conducted an evaluation experiment by 20 subjects. The experiment involves operating an air-conditioner and a stereo in a car. By analyzing the collected dialogues, the validity of the language interpretation using the situation/context has been confirmed. In additi..., Information Processing Society of Japan (IPSJ), Japanese
  • The effect of the style unification in a spoken language interface with car navigation system
    MORITA Hiroyasu; HAYASHI Michihiro; ITOH Toshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro; KATSURAGAWA Keiko; OONO Takeshi, IPSJ SIG Notes, 2003, 124, 205, 210, 18 Dec. 2003
    When a human interface accepts voice input, the vocabulary at sentence styles to be used are different from those for another device accepting voice input. The increase of such devices forces users to learn different input methods. In this paper, we propose a spoken language interface using a consistent input method which can be applied to every voice input device. We examine problems of voice input in car navigation systems and describe the tequnique for unification of sentence styles. We have implemented a system for destination search and conducted two experiments for evaluating the system., Information Processing Society of Japan (IPSJ), Japanese
  • Construction and Evaluation of Spoken Dialogue Type Car Interface Using Situation and the Context.
    YUASA HIROKI; MIZUNO SATOSHI; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO, 電子情報通信学会技術研究報告, 103, 517(NLC2003 50-90), 199, 204, 18 Dec. 2003
    Japanese
  • The effect of the style unification in a spoken language interface with car navigation system.
    MORITA HIROYASU; HAYASHI MICHIHIRO; ITOH TOSHIHIKO; KAI ATSUHIKO; KONISHI TATSUHIRO; ITO YUKIHIRO; KATSURAGAWA KEIKO; ONO TAKESHI, 電子情報通信学会技術研究報告, 103, 517(NLC2003 50-90), 205, 210, 18 Dec. 2003
    Japanese
  • Construction and Evaluation of Spoken Dialogue Type Car Interface Using a Situation and the Context
    Yuasa Hiroki; Mizuno Satoshi; Itoh Toshihiko; Kai Atsuhiko; Konishi Tatsuhiro; Itoh Yukihiro, IEICE technical report. Natural language understanding and models of communication, 103, 517, 199, 204, 11 Dec. 2003
    This paper deals with the construction of a spoken dialogue system which interprets an input by using the situation/context. The system has restricting input styles to "Operate an object" "An attribute is a value" in order to achieve higher recognition rate. The system further accepts more than one input in an utterance. We have conducted an evaluation experiment by 20 subjects. The experiment involves operating an air-conditioner and a stereo in a car. By analyzing the collected dialogues, the validity of the language interpretation using the situation/context has been confirmed. In additi..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • The effect of the style unification in a spoken language interface with car navigation system
    MORITA Hiroyasu; HAYASHI Michihiro; ITOH TOshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro; KATSURAGAWA Keiko; OONO Takeshi, IEICE technical report. Natural language understanding and models of communication, 103, 517, 205, 210, 11 Dec. 2003
    When a human interface accepts voice input, the vocabulary at sentence styles to be used are different from those for another device accepting voice input. The increase of such devices forces users to learn different input methods. In this paper, we propose a spoken language interface using a consistent input method which can be applied to every voice input device. We examine problems of voice input in car navigation systems and describe the tequnique for unification of sentence styles. We have implemented a system for destination search and conducted two experiments for evaluating the system., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Construction and Evaluation of Spoken Dialogue Type Car Interface Using a Situation and the Context
    Yuasa Hiroki; Mizuno Satoshi; Itoh Toshihiko; Kai Atsuhiko; Konishi Tatsuhiro; Itoh Yukihiro, IEICE technical report. Speech, 103, 519, 199, 204, 11 Dec. 2003
    This paper deals with the construction of a spoken dialogue system which interprets an input by using the situation/context. The system has restricting input styles to "Operate an object" "An attribute is a value" in order to achieve higher recognition rate. The system further accepts more than one input in an utterance. We have conducted an evaluation experiment by 20 subjects. The experiment involves operating an air-conditioner and a stereo in a car. By analyzing the collected dialogues, the validity of the language interpretation using the situation/context has been confirmed. In additi..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • The effect of the style unification in a spoken language interface with car navigation system
    MORITA Hiroyasu; HAYASHI Michihiro; ITOH TOshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro; KATSURAGAWA Keiko; OONO Takeshi, IEICE technical report. Speech, 103, 519, 205, 210, 11 Dec. 2003
    When a human interface accepts voice input, the vocabulary at sentence styles to be used are different from those for another device accepting voice input. The increase of such devices forces users to learn different input methods. In this paper, we propose a spoken language interface using a consistent input method which can be applied to every voice input device. We examine problems of voice input in car navigation systems and describe the tequnique for unification of sentence styles. We have implemented a system for destination search and conducted two experiments for evaluating the system., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Development of Prosodic Labeling Methods Utilizing Linguistic Information
    KIRIYAMA SHIN'YA; MITSUTA YOSHIFUMI; HOSOKAWA YUTA; ITOH TOSHIHIKO; KITAZAWA SHIGEYOSHI, 電子情報通信学会技術研究報告, 103, 332(SP2003 94-102), 35, 40, 30 Sep. 2003
    We have developed the methods to generate prosodic labels automatically, utilizing the linguistic information. Large-scale prosodic databases are strongly desired for years, however, the construction of databases depend on hand labeling, because of diversity of prosody. Our purpose is development of "a prosodic labeling support system." We aim at not automating the whole labeling process, but making the hand labeling work more efficient by providing the labelers with the appropriate support information. The methods of auto-generating initial phoneme and prosodic labels utilizing linguistic information are proposed and evaluated. The experimental results showed that more than 70% of J-ToBI labels were correctly generated, and proved the efficiency of the proposed methods. The results also enabled us to study how to generate support information based on tendency of timing errors of the phoneme labels for each phoneme, or possibility of plural candidates of accentual phrase boundaries for J-ToBI labels., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Development of Prosodic Labeling Methods Utilizing Linguistic Informartion
    KIRIYAMA Shinya; MITSUTA Yoshifumi; HOSOKAWA Yuta; ITOH Toshihiko; KITAZAWA Shigeyoshi, Technical report of IEICE. DSP, 103, 330, 35, 40, 23 Sep. 2003
    We have developed the methods to generate prosodic labels automatically, utilizing the linguistic information. Large-scale prosodic databases are strongly desired for years, however, the construction of databases depend on hand labeling, because of diversity of prosody. Our purpose is development of "a prosodic labeling support system." We aim at not automating the whole labeling process, but making the hand labeling work more efficient by providing the labelers with the appropriate support information. The methods of auto-generating initial phoneme and prosodic labels utilizing linguistic ..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Development of Prosodic Labeling Methods Utilizing Linguistic Information
    KIRIYAMA Shinya; MITSUTA Yoshifumi; HOSOKAWA Yuta; ITOH Toshihiko; KITAZAWA Shigeyoshi, IEICE technical report. Speech, 103, 332, 35, 40, 23 Sep. 2003
    We have developed the methods to generate prosodic labels automatically, utilizing the linguistic information. Large-scale prosodic databases are strongly desired for years, however, the construction of databases depend on hand labeling, because of diversity of prosody. Our purpose is development of "a prosodic labeling support system." We aim at not automating the whole labeling process, but making the hand labeling work more efficient by providing the labelers with the appropriate support information. The methods of auto-generating initial phoneme and prosodic labels utilizing linguistic ..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • 日本語MULTEXTコーパスにおける言語情報を用いたBreak Indexラベリング
    三ツ田佳史; 桐山伸也; 北沢茂良; 伊藤敏彦, 日本音響学会研究発表会講演論文集, 2003, 363, 364, 17 Sep. 2003
    Japanese
  • 対話音声における文脈の相違が強調文節の判断時間に与える影響
    伊藤佳世; 桐山伸也; 北沢茂良; 伊藤敏彦; 北村達也, 日本音響学会研究発表会講演論文集, 2003, 361, 362, 17 Sep. 2003
    Japanese
  • 初等プログラミング教育における組み込みコード品質検証ツールの試験的活用
    はつ川友宏; 伊藤敏彦; 坂根裕; 新谷誠; 小西達裕; 伊東幸宏, 教育システム情報学会全国大会講演論文集, 28th, 141, 142, 30 Aug. 2003
    Japanese
  • 日本語対話訓練システムにおけるシチュエーション判定部の構築
    白鳥雄史; 伊藤敏彦; 小西達裕; 近藤真; 伊東幸宏, 教育システム情報学会全国大会講演論文集, 28th, 33, 34, 30 Aug. 2003
    Japanese
  • A Method for Mapping Sentence Meanings to a Dialogue Context
    NOGUCHI YASUHIRO; IKEGAYA YUKI; SUZUKI YUKIKO; ITOH TOSHIHIKO; KONISHI TATSUHIRO; KONDO MAKOTO; TAKAGI AKIRA; NAKASHIMA HIDEYUKI; ITO YUKIHIRO, 人工知能学会全国大会論文集, 17th, Pt.1, 1C1.05,1-4, 23 Jun. 2003
    Japanese
  • Construction of a Dialog System with a Method of Mapping Sentence Meanings to a Dialogue Context
    IKEGAYA YUKI; NOGUCHI YASUHIRO; SUZUKI YUKIKO; ITOH TOSHIHIKO; KONISHI TATSUHIRO; KONDO MAKOTO; TAKAGI AKIRA; NAKASHIMA HIDEYUKI; ITO YUKIHIRO, 人工知能学会全国大会論文集, 17th, Pt.2, 3B1.05,1-4, 4, 23 Jun. 2003
    人工知能学会, Japanese
  • An analysis of dependency structuer and an experiment of automatic BI labeling in Japanese MULTEXT prosodic corpus
    MITSUTA Yoshifumi; KIRIYAMA Shinya; KITAZAWA Shigeyoshi; ITOH TOSHIHIKO, 日本音響学会研究発表会講演論文集, 2003, 1, 379, 380, 18 Mar. 2003
    Japanese
  • J-ToBI labeling on the Japanese-MULTEXT prosodic corpus
    KIRIYAMA Shinya; ITOH Toshihiko; KITAZAWA Shigeyoshi, 日本音響学会研究発表会講演論文集, 2003, 1, 381, 382, 18 Mar. 2003
    Japanese
  • Phoneme composition effect compensated measurement of speech rate
    MOCHIZUKI Kazuya; KIRIYAMA Shinya; ITOH Toshihiko; KITAZAWA Shigeyoshi, 日本音響学会研究発表会講演論文集, 2003, 1, 383, 384, 18 Mar. 2003
    Japanese
  • 日本語MULTEXT韻律コーパスにおけるJ‐ToBIラベリング
    桐山伸也; 伊藤敏彦; 北沢茂良, 日本音響学会研究発表会講演論文集, 2003, 1, 381, 382, 18 Mar. 2003
    Japanese
  • Maximum-Likelihood Spoken Language Understanding Using CSR Confidence Measure and Dialogue History
    MIZUTANI Makoto; ITOH Toshihiko; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro, IPSJ SIG Notes, 2003, 14, 113, 118, 07 Feb. 2003
    Although the ear-navigation system attracts attention as one of the spoken dialogue interfaces, a dialogue will not progress smoothly by miss recognition under the influence of a natural speech and a run noise, and a user will feel displeasure. Thus, this research aims, at the construction of a dialogue system which can obtain a smooth dialogue and the high degree of user satisfaction by performing language understanding and response generation using the confidence measure (CM) based on continuous speech recognizer (CSR) and the dialogue history. This paper show s the spoken language unders..., Information Processing Society of Japan (IPSJ), Japanese
  • PROJECT REPORTS : A Study on Natural Language Processing and Context Processing for a Dialogue Training System
    Itoh Yukihiro; Konishi Tatsuhiro; Kondo Makoto; Itoh Toshihiko, Studies in information, Shizuoka University, 9, 0, 119, 123, 2003
    Shizuoka University, Japanese
  • 移植性の高い音声対話システムにおける対話戦略設計ツールの評価 (テーマ:一般)
    小暮 悟; 伊藤 敏彦; 中川 聖一, 言語・音声理解と対話処理研究会, 36, 0, 71, 76, 07 Nov. 2002
    人工知能学会, Japanese
  • K-50 Development of Dialogue Strategy Design Tool for Highly Portable Spoken Dialogue Systems
    Kogure Satoru; Itoh Toshihiko; Nakagawa Seiichi, 情報科学技術フォーラム一般講演論文集, 2002, 3, 467, 468, 13 Sep. 2002
    Forum on Information Technology, Japanese
  • Influence of the difficulty of a concurrent task on linguistic competence
    Iwamoto Yoshiyuki; Itoh Toshihiko; Kai Atsuhiko; Konishi Tatsuhiro; Itoh Yukihiro, IPSJ SIG Notes, 2002, 50, 61, 67, 24 May 2002
    We investigated the characteristic change of utterances under the different dialogue situations; the situation of using a voice interface of machine versus talking with a human, and the situation of talking with driving versus without driving. The result of a statistical analysis revealed that a driving task does not affect the linguistic features of utterances and the result differed from our assumption. Since this result may be due to a relatively low coguitive load in driving task, we conducted a dialogue experiment under the situation of a concurrent driving task with different difficul..., Information Processing Society of Japan (IPSJ), Japanese
  • 同時処理タスクの難易度の変化における言語能力への影響
    岩本 善行; 伊藤 敏彦; 甲斐 充彦; 小西 達裕; 伊東 幸宏, 情報処理学会研究報告. 自然言語処理研究会報告, 2002, 44, 125, 131, 23 May 2002
    音声入力インタフェースを使用する状況において、対話相手が人間又は機械、運転中又は停車中といった対話状況の違いにより発話にどのような特徴の変化があるのかを調べる為に、対話を収集し分析を行った。その書き起こしや言語的・音響的特徴の統計的な分析結果では、運転の有無は発話の言語的特徴に影響を与えないというものであり、我々の仮説とは異なっていた。しかしながら、運転タスクの難易度が低すぎたことによる影響の可能性がある為、運転操作に必要な認知的負荷を変化させた場合の発話の言語的・音響的特徴に関する分析を行った。その結果、発話の言語的特徴においては、ほとんど運転タスクの影響を受けず、音響的特徴に若干の影響を与える事が明らかになった。, 一般社団法人情報処理学会, Japanese
  • Influence f context and word order in the identification of focal prominence in Japanese dialogue
    KITAMURA Tatsuya; ITOH Kayo; ITOH Toshihiko; KITAZAWA Shigeyoshi, IEICE technical report. Speech, 102, 35, 61, 66, 19 Apr. 2002
    This paper studies the influence of prosodic features, context, and word order on the identification of focused clauses in Japanese dialogue, using a psychoacoustic experiment. In the experiment, question and answer speech was used as stimuli. The questions were to create two different contexts in the stimuli, and the answers had focal prominence at different clauses and had different word orders. The experimental results indicate that (1) prosodic characteristics are more significant for focus identification, (2) context has some effect on identification, and (3) it is probable that the wo..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Influence of context and word order in the identification of focal prominence in Japanese dialogue
    KITAMURA Tatsuya; ITOH Kayo; ITOH Toshihiko; KITAZAWA Shigeyoshi, Technical report of IEICE. EA, 102, 33, 61, 66, 19 Apr. 2002
    This paper studies the influence of prosodic features, context, and word order on the identification of focused clauses in Japanese dialogue, using a psychoacoustic experiment. In the experiment, question and answer speech was used as stimuli. The questions were to create two different contexts in the stimuli, and the answers had focal prominence at different clauses and had different word orders. The experimental results indicate that (1) prosodic characteristics are more significant for focus identification, (2) context has some effect on identification, and (3) it is probable that the wo..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • An analysis of morae counts between accent-kernels observed in the Japanese MULTEXT prosodic corpus
    MOCHIZUKI Kazuya; KITAZAWA Shigeyoshi; KITAMURA Tatsuya; ITOH Toshihiko, 日本音響学会研究発表会講演論文集, 2002, 1, 369, 370, 18 Mar. 2002
    Japanese
  • The Technique for Interpreting a Ungrammatical Sentences in a Study of Japanese Education System for Nonnative Speakers.
    成瀬聡; 鈴木正浩; 伊藤敏彦; 小西達裕; 近藤真; 伊東幸宏, 人工知能学会知的教育システム研究会資料, 34, 0, 99, 104, 02 Mar. 2002
    人工知能学会, Japanese
  • A Prototype Tool of Designing Dialogue Script for Highly Portable Spoken Dialogue Systems
    KOGURE Satoru; ITOH Toshihiko; NAKAGAWA Seiichi, 情報処理学会研究報告. HI, ヒューマンインタフェース研究会報告, 2002, 10, 139, 144, 01 Feb. 2002
    Recently the technology for speech recognition and language processing for spoken dialogue systems has been improved, and speech recognition systems and dialogue systems have been developed to be practical use. In order to become more practical, not only those fundamental techniques but also the techniques of portability and expansibility should be developed. We already presented the portability of spoken dialogue systems. In our past research, we demonstrated the portability of the speech recognition module and the interpreter. In this paper, we focused on the portability of the dialogue m..., Information Processing Society of Japan (IPSJ), Japanese
  • A Prototype Tool of Designing Dialogue Script for Highly Portable Spoken Dialogue Systems
    KOGURE Satoru; ITOH Toshihiko; NAKAGAWA Seiichi, IPSJ SIG Notes, 2002, 10, 139, 144, 01 Feb. 2002
    Recently the technology for speech recognition and language processing for spoken dialogue systems has been improved, and speech recognition systems and dialogue systems have been developed to be practical use. In order to become more practical, not only those fundamental techniques but also the techniques of portability and expansibility should be developed. We already presented the portability of spoken dialogue systems. In our past research, we demonstrated the portability of the speech recognition module and the interpreter. In this paper, we focused on the portability of the dialogue m..., Information Processing Society of Japan (IPSJ), Japanese
  • Prosodic phrase labeling based on prosodic features for developing prosodic database
    KITAMURA Tatsuya; ITOH Toshihiko; MOCHIZUKI Kazuya; KITAZAWA Shigeyoshi, IEICE technical report. Speech, 101, 603, 23, 30, 17 Jan. 2002
    A very detailed segmentation of prosodic phrase has carried out in order to construct a Japanese prosodic database. The database, referred to here as "Japanese Multext", contains read style speech and spontaneous style speech by three male speakers and three female speakers in Tokyo dialect. The "prosodic phrase", we introduced as a unit of the segmentation, was defined and regarded as a unit of language speech perception. For the exact segmentation, the wide-band spectrum, the narrow-band spectrum, fine speech wave and fundamental frequency shapes, and transition of amplitude of the higher..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • 日本語対話訓練システムにおける非文解釈手法
    鈴木正浩; 伊藤敏彦; 小西達裕; 近藤真; 伊東幸宏, 教育システム情報学会全国大会講演論文集, 27th, 2002
  • Biuding of an Hotel Reservation Dialogue System Based on Locating Sentence Meanings in a Given Context.
    池ケ谷有希; 野口靖浩; 鈴木夕紀子; 伊藤敏彦; 小西達裕; 近藤真; 高木朗; 中島秀之; 伊東幸宏, 人工知能学会言語・音声理解と対話処理研究会資料, 36th, 2002
  • Analysis and detection of spoken corrections in spoken dialog between human and car navigation system
    KAI Atsuhiko; ISHIMARU Akiko; ITOH Toshihiko; KONISHI Tatsuhiko; ITOH Yukihiro, 日本音響学会研究発表会講演論文集, 2001, 2, 63, 64, 01 Oct. 2001
    Japanese
  • Analysis of Utterances according to Dialogue Situations for Car Navigation System
    ITOH Toshihiko; IWAMOTO Yoshiyuki; MIZUTANI Makoto; YUASA Hiroki; KAI Atsuhiko; KONISHI Tatsuhiro; ITOH Yukihiro, 日本音響学会研究発表会講演論文集, 2001, 2, 65, 66, 01 Oct. 2001
    Japanese
  • An Analysis of Accent and Rhythm on Japanese MULTEXT
    KITAZAWA Shigeyoshi; KITAMURA Tatsuya; MOCHIZUKI Kazuya; ITOH TOSHIHIKO, 日本音響学会研究発表会講演論文集, 2001, 2, 227, 228, 01 Oct. 2001
    Japanese
  • CONSTRUCTING A DRIVE PLANNING SYSTEM WITH A NATURAL LANGUAGE INTERFACE
    Katsuragawa Keiko; Niwa Michihiro; Yanagi Takura; Watanabe Masaki; Itoh Toshihiko; Konishi Tatsuhiro; Itoh Yukihiro, 情報処理学会研究報告. MBL, [モバイルコンピューティングとワイヤレス通信], 2001, 83, 229, 236, 06 Sep. 2001
    In this paper, we propose a drive planning system that supports users in making a plan for a trip. This system has the function to help users decide several factors of a trip: multiple destinations and waypoints, arrival and departure times, the number of days that the trip will take and the route. It also proposes taking a rest on a long distance trip in order to ensure safe driving. The drive is planned interactively by a dialog with the system through a natural language interface. We propose a method to construct such a drive planning system, describe the implementation of a prototype dialog system and present the result of evaluating its usefulness., Information Processing Society of Japan (IPSJ), Japanese
  • CONSTRUCTING A DRIVE PLANNING SYSTEM WITH ANATURAL LANGUAGE INTERFACE
    Katsuragawa Keiko; Niwa Michihiro; Yanagi Takura; Watanabe Masaki; Itoh Toshihiko; Konishi Tatsuhiro; Itoh Yukihiro, 情報処理学会研究報告. ITS, [高度交通システム], 2001, 83, 229, 236, 06 Sep. 2001
    In this paper, we propose a drive planning system that supports users in making a plan for a trip. This system has the function to help users decide several factors of a trip: multiple destinations and waypoints, arrival and departure times, the number of days that the trip will take and the route. It also proposes taking a rest on a long distance trip in order to ensure safe driving. The drive is planned interactively by a dialog with the system through a natural language interface. We propose a method to construct such a drive planning system, describe the implementation of a prototype di..., Information Processing Society of Japan (IPSJ), Japanese
  • Constructing a Natural Language Interface of Drive Planning System
    Niwa Michihiro; Akiyama Taizou; Yanagi Takura; Watanabe Masaki; Itoh Toshihiko; Konishi Tatsuhiro; Itoh Yukihiro, IPSJ SIG Notes, 2000, 101, 55, 60, 27 Oct. 2000
    In this paper, we describe a natural language interface for Drive Planning System which supports drivers to make a plan for a trip. The system enables us to make a plan for a trip interactively by using natural language. We propose following methods:a parsing technique for restricted sentence patterns in a specific domain, a method for semantic analysis integrated into the parsing process, and a method for contextual analysis identifying references of pronouns and omitted words. We implemented a prototype dialogue system for planning trip by car and evaluated the system., Information Processing Society of Japan (IPSJ), Japanese
  • Analysis of filled pauses and their use in a dialogue system
    Itoh Toshihiko; Minematsu Nobuaki; Nakagawa Seiichi, The Journal of the Acoustical Society of Japan, 55, 5, 333, 342, 01 May 1999
    本研究では, 独話や対話に存在する間投詞に着目し「発話中の間投詞は聞き手に対してどのような働きを持つのか」「協調的なシステムの応答文生成において間投詞は有効・必要なのか」という観点から, 聴取実験による検討を行った。その結果, 間投詞に関する幾つかの知見を得ることができた。これらの知見に基づき, 対話システムにおいて「より自然なシステム応答」及び「情報検索・応答文生成によって不可避的に生じる無音が引き起こす不自然さの軽減」を目的として, システム応答音声中に間投詞を挿入することを考案した。そして, WOZ (Wizard of OZ)による音声対話システムを用いて, 間投詞が付与されたシステム応答に対する評価実験を行った。実験結果より間投詞が, 音声対話システムにおける応答文生成時間の確保や, 発話権の維持, 及びシステムが動作中であることを示すサインとして有用であることが分かり, 間投詞挿入による効果が確認された。, The Acoustical Society of Japan (ASJ), Japanese
  • Proposal of a Standard Utterance-Unit Tagging Scheme
    Araki Masahiro; Itoh Toshihiko; Kumagai Tomoko; Ishizaki Masato, Journal of Japanese Society for Artificial Intelligence, 14, 2, 251, 260, 01 Mar. 1999
    In this paper, we propose a standard utterane-unit tagging scheme, which has been developed by the discourse tagging working group under SIG-SLUD, JSAI. Utterance-unit tagging mainly addresses the type of illocutionary force and the role of The interaction unit. We have made a first version of the tagging scheme by surveying existing tagging schemes developed by several research groups. We have evaluated it on an experimental basis and thereby revised it to the new version that we propose as a standard scheme. The reliability of this scheme is demonstrated by another tagging experiment., The Japanese Society for Artificial Intelligence, Japanese
  • The study of portability for a spoken dialogue systems : a sightseeing guidance spoken dialogue system and a database retrieval system
    KOGURE Satoru; ITOH Toshihiko; NAKAGAWA Seiichi, IPSJ SIG Notes, 99, 14, 13, 18, 05 Feb. 1999
    Recently the study of robustness and usability for speech recognition and language processing has been established, and speech recognition systems and dialogue systems have been developed to be practical use. But if these systems will be come practical, it is important that not only those fundamental techniques but also the techniques of portability and expansibility should be developed. Based on this consideration, we examined our system in portability by transfering the domain of the system from the Mt. Fuji sightseeing guidance to the Mikawa sightseeing guidance. Also we designed a domai..., Information Processing Society of Japan (IPSJ), Japanese
  • Comparison of semantic interpeter on spoken dialogue using CFG/bigram based speech recognizer
    Kogure Satoru; Itoh Toshihiko; Hirose Yoshifumi; Kai Atsuhiko; Nakagawa Seiichi, 全国大会講演論文集, 57, 2, 239, 240, 05 Oct. 1998
    Information Processing Society of Japan (IPSJ), Japanese
  • Application of the generation of filled pauses to a dialgoue system and analysis of a usr's behavior
    ITOH Toshihiko; NAKAGAWA Seiichi, IPSJ SIG Notes, 98, 68, 61, 66, 24 Jul. 1998
    We investigated filled pauses found in lecture speech and dialgoue speech from the following viewpoints;1)the role of the filled pauses in listener's understanding, 2)the necessity or effectiveness of generating filled pauses to make its responses more cooperative. And a series of listening tests were carried out. As a result, we obtained several findings on the above issues. Based on the findings, in this paper, we propose that a speech dialogue system should insert filled pauses in response senteces to increase the naturalness of the responses and to exclude unnatural silent segments(corr..., Information Processing Society of Japan (IPSJ), Japanese
  • Analysis of filled pauses in dialogue speech and application of the generation to a dialogue system
    ITOH Toshihiko; MINEMATSU Nobuaki; NAKAGAWA Seiichi, 人工知能学会全国大会論文集 = Proceedings of the Annual Conference of JSAI, 12, 0, 499, 502, 16 Jun. 1998
    Japanese
  • Consideration of sightseeing guidance spoken dialogue system with multi-modal interface through subjects'evaluation experiments
    Denda Akihiro; Itoh Toshihiko; Nakagawa Seiichi, 全国大会講演論文集, 56, 2, 86, 87, 17 Mar. 1998
    Information Processing Society of Japan (IPSJ), Japanese
  • The Current Status of Standardisation of Discourse Coding Schemes.
    市川あきら; 荒木雅弘; 石崎雅人; 板橋秀一; 伊藤敏彦; 柏岡秀紀; くれ松明; 小磯花絵; 吉村隆, 人工知能学会言語・音声理解と対話処理研究会資料, 21st, 1998
  • A Spoken Dialogue System with Cooperative Response and Evaluation for the System
    55, 5, 333, 342, 1998
  • Implementation of Anthropomorphic Interface on Multi-Modal Sightseeing Guidance Dialogue System and Evaluation of the System
    DENDA Akihiro; ITOH Toshihiko; KOGURE Satoru; NAKAGAWA Seiichi, IPSJ SIG Notes, 97, 101, 39, 46, 24 Oct. 1997
    In our laboratory, we have developed the multi-modal interface with speech input/output, graphical output and touch input for our spoken dialogue system; "Mt. Fuji Sightseeing Guidance System by Spoken Japanese". Furthermore, we implemented an agent interface with real face image/animation and real speech/synthesized speech to the system and carried out evaluation experiments which consist of task completions and questionnaires to evaluate the interface and whole system. The results indicate that users prefer "mechanical/artificial" and "consistent" agent. And they indicate the usefulness o..., Information Processing Society of Japan (IPSJ), Japanese
  • Role of interjections in listening to dialogue speech
    Itoh Toshihiko; Minematsu Nobuaki; Nakasawa Seiichi, 全国大会講演論文集, 55, 2, 27, 28, 24 Sep. 1997
    本研究では協調的な問題解決の対話音声中に存在する間投詞に着目し「発話中の間投詞は聞き手に対してどのような働きを持つか。」, 「協調的な応答文生成において間投詞は有効又必要なのか。」という観点から, 知覚実験による検討を行なった。実験は対話音声より, 間投詞部分を1) 抜き出して切り詰めた音声試料, 2) 同一時間長の無音置換を施した音声試料, 3) 異なる箇所で発声された同一種類の間投詞, 4) 異なる種類の間投詞と置換した音声試料, 5) 2)の無音区間の長さを様々に変化させた音声試料, 6) 間投詞の直前に位置する無音区間を様々に変化させた音声試料, を各々用意し被験者に提示した。1)〜4)までの音声試料に対しては自然である(違和感を全く感じない)との反応を示した。5), 6)に対しては「長い無音区間が不自然に感じる」との反応が幾らかあった。以下, 本実験の目的・計画・結果・考察について述べる。なお, 本稿で言う無音置換とはバックグランドノイズとの置換を意味する。, Information Processing Society of Japan (IPSJ), Japanese
  • Client-server-based continuous speech recognition system on PC
    Itoh Toshihiko; Kai Atsuhiko; Yamamoto Kazumasa; Nakasawa Seiichi, 全国大会講演論文集, 55, 2, 33, 34, 24 Sep. 1997
    近年, バーソナルコンピューター(PC)の性能が向上し, 音声・動画といった計算パワーが必要なマルチメディア関係のアブリケーションも多く見られるようになってきた。そのため, アプリケーションの入力インターフェイスとしてもキーボード・マウスだけでなく, これまでは計算量の問題から実現が難しかったソフトウェアによる音声認識も使用可能となってきた。パソコン上でソフトウェアのみで動作する音声認識システムはいくつか提案されている。我々はワークステーショシ上で開発された音声認識システムをベースに, PC上で動作する音声認識システムを開発した。この音声認識システムは音声入力・分析クライアントと音声認識サーバから構成されておりネットワークを介した文, 句などの複数単語の系列(連続音声)の音声認識か可能である。, Information Processing Society of Japan (IPSJ), Japanese
  • Continuous speech recognition software for spontaneous speech running on networked PC and WS
    KAI Atsuhiko; ITOH Toshihiko; YAMAMOTO Kazumasa; NAKAGAWA Seiichi, 日本音響学会研究発表会講演論文集, 1997, 2, 175, 176, 01 Sep. 1997
    Japanese
  • Evaluation of cooperative responces in spoken dialog system
    ITOH TOSHIHIKO; Nakagawa Seiichi, 全国大会講演論文集, 54, 2, 235, 236, 12 Mar. 1997
    自然言語による対話システムにおいては、システムがユーザと協調的に対話を進めていくことは重要である。データベース検索における協調的応答生成に関しては質問の答以外に付加的な情報を与えたり、失敗した質問に対する理由や代案を提示するものが多い。例えば、ユーザの質問文に検索に必要な情報が含まれていなかったり、検索結果の数が多い場合などはユーザへの質問を行なったり、ユーザの望む検索結果が得られなかった場合、それに代わる代案を提供する。このようなユーザへの協調的応答によってユーザにかかる負担や不安を軽減することを我々は試みている。本稿では、我々が協調的応答生成に関して改良した音声対話シスチムについて、「システムの使い勝手の良さ」、「協調的応答」に着目して行なった評価実験について述べる。, Information Processing Society of Japan (IPSJ), Japanese
  • Evaluation Experiment of A Sightseeing Guidance Spoken Dialogue System with Multi-Modal Interface
    DENDA Akihiro; ITOH Toshihiko; KOGURE Satoru; NAKAGAWA Seiichi, IPSJ SIG Notes, 97, 16, 47, 52, 07 Feb. 1997
    Recent improvements of speech recognition and natural language processing enable dialogue systems to deal with spontaneous speech. With the aim of supporting these systems, multi-modal man-machine interface has been introduced to the system widely. In addition to increasing the total performance of the systems, the multi-modal interface is expected to make the dialogues between a user and the system more natural and abundant in content. In our laboratory, we have developed the multi-modal interface with speech input/output, graphical output and touch input for our spoken dialogue system, "M..., Information Processing Society of Japan (IPSJ), Japanese
  • Progress Report of The Discourse Tagging Working Group.
    荒木雅弘; 青柳達也; 石崎雅人; 伊藤敏彦; 柏岡秀紀; 熊谷智子; 小磯花絵; 田本真詞; 吉村隆, 人工知能学会言語・音声理解と対話処理研究会資料, 18th, 1997
  • Extraction of focus and cooperative responces in spoken dialog system
    ITOH TOSHIHIKO; Nakagawa Seiichi, 全国大会講演論文集, 53, 2, 353, 354, 04 Sep. 1996
    自然言語による対話システムにおいては、システムがユーザと協調的に対話を進めていくことは重要である。発話内容を決定する方法としては、談話の結束性に注目し、修飾構造、談話の焦点などの情報を利用し発話内容を決定するアプローチや、談話をある目的のためのプランとして考え、システムがユーザの質問意図として談話ゴールを推論し、そのゴールの達成に必要な内容を協調的発話として生成するアプローチがある。データベース検索における協調的応答生成に関しては質問の答以外に付加的な情報を与えたり、失敗した質問に対する理由や代案を提示するものが多い。本稿では我々が開発した富士山観光案内音声対話システムとその評価実験で挙げられた応答生成システムの問題点を改良するために構築した、協調的な応答機能をもった応答生成システムについて述べる。, Information Processing Society of Japan (IPSJ), Japanese
  • マルチモーダルインタフェースを備えた観光案内対話システム
    傳田 明弘; 伊藤 敏彦; 中川 聖一, 情報処理学会研究報告. SLP, 音声言語情報処理, 96, 74, 53, 54, 26 Jul. 1996
    In this paper, we propose a drive planning system that supports users in making a plan for a trip. This system has the function to help users decide several factors of a trip: multiple destinations and waypoints, arrival and departure times, the number of days that the trip will take and the route. It also proposes taking a rest on a long distance trip in order to ensure safe driving. The drive is planned interactively by a dialog with the system through a natural language interface. We propose a method to construct such a drive planning system, describe the implementation of a prototype di..., 一般社団法人情報処理学会, Japanese
  • A Robust Spoken Dialogue System Based on Understanding Mechanism of Human Being
    YAMAMOTO MIKIO; ITOH TOSHIHIKO; HIDANO MASARU; NAKAGAWA SEIICHI, IPSJ Journal, 37, 4, 471, 482, 15 Apr. 1996
    In a current speech recognition technology, an interpreter that receives the recognized sentences must be developed so as to cope not only with spontaneous sentences but also with illegal sentences with recognition errors to improve a spoken dialogue system property. Therefore, we carried out experiments to investigate how humans modify or correct the recognized sentences which might include errors. Although 43% of the sentences were the results of misrecognition, the results showed that the subjects who were familiar with the system could correctly interpret 87% of all the sentences. And s..., Information Processing Society of Japan (IPSJ), Japanese
  • Cooperative Response in Spoken Dialogue System
    ITOH Toshihiko; NAKAGAWA Seiichi, 情報処理学会研究報告. HI, ヒューマンインタフェース研究会報告, 96, 21, 105, 110, 29 Feb. 1996
    We have developed a robust dialogue system which aid users in information retrieval through spontaneous speech. Dialog system through natural language must be designed so that it can cooperatively response to users. Based on this consideration, we deloped a cooperative response generator in the dialogue system. The response generator is composed of dialog manager, problem solver, knowledge databases, and response sentence generator. The response generator receives a semantic representation (that is, semantic network) which the interpreter builds for the user's utterance and generates as coo..., Information Processing Society of Japan (IPSJ), Japanese
  • Consideration on Development of a Robust Spoken Dialogue System
    Itoh Toshihiko; Hidano Masaru; Yamamoto Mikio; Nakagawa Seiichi, IPSJ SIG Notes, 95, 73, 139, 144, 20 Jul. 1995
    A spoken dialogue system that can understand spontaneous speech needs to handle extensive range of speech compared to the read speech that have been studied so far. The spoken language has looser restriction of the grammar than the written language and has ambiguous phenomena such as interjections, ellipses, inversions, repairs, unknown words and so on. It must be noted the fact that a recognizer may output the sentence that human being never speaks. Therefore, the interpreter must cope not only with spontaneous sentences but also with illegal sentences having recognition errors. We explain..., Information Processing Society of Japan (IPSJ), Japanese
  • Robust natural language dialog system for spontaneous speech
    Hidano Masaru; ITOH TOSHIHIKO; Yamamoto Mikio; Nakagawa Seiichi, 全国大会講演論文集, 50, 2, 467, 468, 15 Mar. 1995
    音声対話システムにおいて自然な発話における間投詞、助詞落ち、言い直し、倒置などを含む文の理解、あるいは誤認識文からの発話文の復元は対話システム品質を向上させるために必要不可欠である。本稿では人間がいかにして文の復元を行なっているかを被験者実験を通して調べ、それを参考にして復元ストラテジーを考案し、ロバストな意味理解システムを構築した。, Information Processing Society of Japan (IPSJ), Japanese
  • A Robust Spoken Dialogue System Basoz on Understanding Mechanism of Human Being
    36, 4, 471, 481, 1995
  • Effects of a prior explanation on the speaker's utterance and recovery strategies of humans from misrecognition
    ITOH TOSHIHIKO; Otani Koji; Hidano Masaru; Yamamoto Mikio; Nakagawa Seiichi, IPSJ SIG Notes, 94, 109, 49, 56, 15 Dec. 1994
    It is difficult to recognize and understand spontaneous speech, because spontaneous speech has many phenomena of ambiguty such as omissions, inversions, repairs and so on. Since there is a trade-off between the looseness of linguistic constraints and recognition precision, the recognizer cannot perfectly recognize the completely free speech of the user on the current art of speech recognition. Therefore some problems arise. First problem is that there are gaps between sentences a dialog sysytem can accept and sentences the user wants to say. Second problem is that the semantic analyzer has ..., Information Processing Society of Japan (IPSJ), Japanese
  • Effects of a prior explanation on the speaker's utterance and recovery strategies of humans from misrecognition
    Itoh Toshihiko; Ohtani Kohji; Hidano Masaru; Yamamoto Mikio; Nakagawa Seiichi, IEICE technical report. Speech, 94, 398, 49, 56, 15 Dec. 1994
    It is difficult to recognize and understand spontaneous speech, because spontaneous speech has many phenomena of ambiguty such as omissions,inversions,repairs and so on.Since there is a trade-off between the looseness of linguistic constraints and recognition precision,the recognizer cannot perfectly recognize the completely free speech of the user on the current art of speech recognition. Therefore some problems arise.First problem is that there are gaps between sentences a dialog sysytem can accept and sentences the user wants to say.Second problem is that the semantic analyzer has to und..., The Institute of Electronics, Information and Communication Engineers, Japanese
  • Spontaneous Speech Understanding and Dialog System
    Yamamoto Mikio; Hidano Masaru; Itoh Toshihiko; Kai Atsuhiko; Nakagawa Seiichi, IPSJ SIG Notes, 94, 57, 91, 98, 07 Jul. 1994
    This paper describes the spoken dialog system for spontaneous speech. It is difficult to recognize and understand spontaneous speech, because spontaneous speech has many phenomena of ambiguity such as ommitions, inversions, repairs and so on. Since there is a trade-off between looseness of linguistic constraints and recognition precision, the recognition rate of speech recognizer is limited. Therefore, the interpretation part must cope with not only spontaneous sentences but illegal sentences with recognition errors. We developed the robust interpretation method and applied it to the dialog..., Information Processing Society of Japan (IPSJ), Japanese
■ Syllabus
  • 自然言語処理学特論, 2024年, 修士課程, 情報科学院
  • 自然言語処理学特論, 2024年, 博士後期課程, 情報科学研究科
  • 自然言語処理学特論, 2024年, 博士後期課程, 情報科学院
  • コンピュータ工学, 2024年, 学士課程, 工学部
  • 言語メディア理解論, 2024年, 学士課程, 工学部
  • 音声メディア応用論, 2024年, 学士課程, 工学部
■ Affiliated academic society
  • 情報処理学会
  • 人工知能学会
  • 日本音響学会
■ Research Themes
  • Human-like Spoken Dialog System
    Grants-in-Aid for Scientific Research(若手研究(B))
    2008 - 2010
    Toshihiko ITOH
    In this research we study how dialog rhythm influences user's comfort and reliability and propose a new framework for building spoken interfaces based on this framework. Although we confirmed user's increased satisfaction and smoothness of conversation, we have not reached the level of naturalness of human to human dialog.To achieve this we have improved our model for generating rhythmical dialogs, re-implemented it into the system and increased processing speed.In result, we achieved better human-likeness and reliability comparing to the previous system, but we could not reach evaluation s...
    Ministry of Education, Culture, Sports, Science and Technology, 若手研究(B), 北海道大学, Principal investigator, Competitive research funding, 20700150
  • 対話のリズムと身体性に着目した対話システムの開発
    科学研究費補助金(若手研究(B))
    2005 - 2007
    伊藤 敏彦
    本研究は音声インターフェイスにおいて、対話のリズムと身体性が、ユーザの快適性や安全性にどれほどの影響を与えるか明らかにし、これらの要素を音声インターフェイスに導入するための新たな枠組みを提案することである。昨年までこの目的のために対話リズムを考慮した音声対話システムの基本システムを構築した。これは人間同士の対話データから発話タイミングを機械学習し、ユーザの音響的特徴と言語的特徴から音声対話システムの発話タイミングを決定する方法で実現した。しかし、予備的な評価実験からユーザ満足度や発話のしやすさなどの向上は確認できたが、人間同士の対話に近い感覚を与えるまでには至らなかった。この原因を調査するために人間同士の対話データを収集し、発話タイミングや韻律的特徴を発話意図(発話内容)の違いにより分類・比較した結果、対話における話し手の発話タイミングは対話相手の発話特徴のみで決定できるわけではなく、話し手の発話意図(発話内容)や発話の重要度、感情などに大きく影響を受けることが示唆された。つまり、音声対話システムがリズミカルに発話するだけでは人間は機械に対して人間らしさ(安心感)を感じるわけではなく、発話意図(発話内容)や発話の重要度、感情なども考慮した適切なタイミングで発話することが人間らしさ(安心感)を感じさせるために重要である。また、聞き手も話し手の発話タイミングの変化やずれなどから発...
    文部科学省, 若手研究(B), 北海道大学, Principal investigator, Competitive research funding, 17700169
  • タイミングに着目した協調的音声インタラクション分析とハンズフリー対話システム構築
    科学研究費補助金(特定領域研究)
    2006 - 2006
    北岡 教英; 中川 聖一; 井藤 敏彦
    人間と機械が対話を行うことを考えるとき,機械が人間同士の会話と同様にあいつちなどさまざまな応答を自然に返すことができれば,より円滑な対話を行うことが期待できる.本研究では,特に雑談のような対話に着目し,自然な雑談対話をする上で最も重要である応答タイミングと韻律的同調性の生成手法を提案した。さらにそれを用いて、種々の雑談的対話現象を生成できる対話システムの枠組みを提案し、それに基づく対話システムを試作した.まず、ユーザーシステム間の対話において、システムは時々刻々ユーザ発話の特徴から決定ルールを用いて相槌や話者交替の判断やそのタイミングを生成し、リアルタイムに応答する手法を実現した。これにより、オーバラップした相槌や話者交代、さらに相手の発話内容を予測してオーバラップして発話する「共同補完」などの、自然な対話で生起するさまざま雑談現象に対応できる手法となることを示した。タイミング生成や、発話内容の選択には、最後のユーザ発話の表層的言語情報及び韻律情報(ピッチやパワーの変化パターン)を情報源として用いた。さらに、対話はスムーズで盛り上がった場合には対話者間の韻律、特に声の高さが同期して変動していることを、実際の人間同士の対話の分析により確かめた。そして、それをシステムで実現するために、ユーザの韻律に追従する韻律制御モデルを提案して、その挙動が人間の動作に似たものであることを示した...
    文部科学省, 特定領域研究, 豊橋技術科学大学->名古屋大学, Competitive research funding, 18049040
  • 高等学校化学を対象とするITS/Microworld統合型知的教育システムの構築
    科学研究費補助金(特定領域研究)
    2003 - 2004
    伊東 幸宏; 小西 達裕; 伊藤 敏彦
    (1)知識表現の再設計実用規模の知識ベース構築にあたり、単に規模によるコスト増大にとどまらない問題が生じた。一般に、問題解決の場面や学習の進行につれ、同一対象についての知識でも一貫しない表現を持つことがある。例えば高校化学では化学現象を再現する際、分子・原子間の対応関係レベル(反応式レベル)で考えれば良い場合と、反応に直接関わらない物質も含め、実空間における化学反応レベルで考えるべき場合がある。このように場面毎に知識の使い分けを必要とする場合、知識表現や推論機構を完全に一定のアーキテクチャのもとで設計することは難しい。この問題に対処するために、本研究では(a)ひとつの概念に複数の属性値を与えたり、ひとつの概念を表す知識を複数持つことを許容する知識表現手法(b)問題に応じて、適切な知識を選択する問題解決エンジンを設計実装した。(2)システムの再構築昨年度まで、システム開発にはUNIX環境におけるTCL/TK言語を用いていた。しかし現場教師との交流などを通じて、教育現場への可搬性、高校における教育用計算機環境の現状との整合性、システム運用の容易さ、処理速度の面から、Webブラウザ上で稼動するJava環境による開発がより望ましいとの知見を得た。知識表現は基本的にはプログラミング言語に依存しないが、部分的に修正を要する部分もあり、見直しを行った。(3)オーサリングツール設計のための基...
    文部科学省, 特定領域研究, 静岡大学, Coinvestigator not use grants, Competitive research funding, 15020230
  • Construction of learning system for novice programming learners
    Grants-in-Aid for Scientific Research(基盤研究(B))
    2002 - 2004
    Yukihiro ITOH; 伊藤 敏彦; 竹内 勇剛; 小西 達裕; 小暮 悟
    1.For the system generating verbal and visual explanations of target programs(1)Expansion of our program understanding mechanismIn our previous work, we proposed a mechanism to understand a behavior of a program in the domain world of "greater and lessen". We have developed an extended method for another domain world "two dimensional space" that is used for some numerical analysis such as 'Newton method' or 'Simpson method'. We achieved it by using heuristic rules to specify correspondence between a variable and an attribute of an entity in the domain world.(2)Development of a method to gen...
    Ministry of Education, Culture, Sports, Science and Technology, 基盤研究(B), 静岡大学, Coinvestigator not use grants, Competitive research funding, 14380081
  • Development for speech interface for form -based in formation access services on Web
    Grants-in-Aid for Scientific Research(基盤研究(B))
    2001 - 2003
    Seiichi NAKAGAWA; 甲斐 充彦; 北岡 教英; 小林 聡; 中野 崇; 伊藤 敏彦
    While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose...
    Ministry of Education, Culture, Sports, Science and Technology, 基盤研究(B), 豊橋技術科学大学, Coinvestigator not use grants, Competitive research funding, 13558033
  • 韻律コーパスとその作成自動化
    科学研究費補助金(特定領域研究(B), 特定領域研究)
    2000 - 2003
    北澤 茂良; 北村 達也; Campbell Nick; 板橋 秀一; 伊藤 敏彦; 市川 熹; 桐山 伸也; Nick Campbell
    1.新規の韻律コーパスの作成(静岡大学)韻律コーパスとして日本語のMULTEXT韻律データベースの40パッセジにJ-ToBI韻律タグ付けを完了し、同様の手法で、筑波大学と千葉大学と東京大孝と東工大グループの既存音声コーパスの各種案内読上げと模擬対話と対話音声、マルチモーダル対話音声、天気予報、模擬感情音声へのJ-ToBIタグ付けを行った。これらのラベリング作業について研究支援者を雇用して行った。言語情報を利用した韻律ラベリング手法の開発と、音素ラベリング支援のための音素自動セグメンテーションと、連接境界における音響的特徴の詳細について研究成果を発表した。2.既存の音声コーパスの韻律分析と韻律コーパスの作成(筑波大学)既存の音声コーパスとして、日本音響学会「研究用連続音声データベース」の各種案内読上げ文と模擬対話、重点領域研究「音声対話」の対話音声コーパス、の3種のコーパスに基本周波数分析と発話ラベルと付与した。200ms以上の無音区間で区切られた音声区間を発話単位として、発話単位長を読上げ音声と模擬対話音声で比較した。模擬対話では間投詞や割込みによって発話単位が短くなる。音声パワーと基本周波数の標準偏差は対話に比べて読上げは狭い範囲に集中していることが分かった。3.ジェスチャー・顔表情付の対話音声収録(千葉大学)音声対話における視線や頷きなどジェスチャーを記録・分析するため、...
    文部科学省, 特定領域研究(B), 特定領域研究, 静岡大学, Coinvestigator not use grants, Competitive research funding, 12132204
  • Human Interface that detects user's intentions and emotions from their gestures and facial expressions
    Grants-in-Aid for Scientific Research(基盤研究(C))
    1999 - 2001
    Hiromasa NAKATANI; 伊藤 敏彦; 佐治 斉
    Conventional interactions between humans and machines are performed mainly by keyboards or special pointing devices. In this project, we have investigated human interface that analyzes user's gestures and facial expressions and identifies their intentions and emotions. The main subjects of this projects are as follows :1. 3D facial motion measuring systemWe have developed a method for measuring three- dimensional moving facial shapes. The system uses two light sources and a slit pattern projector. Natural facial motions can be entered to the system with the sampling rate up to 2/15 sec.2. H...
    Ministry of Education, Culture, Sports, Science and Technology, 基盤研究(C), 静岡大学, Coinvestigator not use grants, Competitive research funding, 11832012
  • 対話訓練システムのための言語処理・文脈処理に関する研究
    科学研究費補助金(特定領域研究(A))
    2000 - 2000
    伊東 幸宏; 小西 達裕; 近藤 真; 中谷 広正; 伊藤 敏彦
    1)入力文の意味解釈能力の向上に関する検討1-1)対話訓練に効果的な協調的タスクを設定し、取り扱う必要がある概念・語彙・文体について事例分析を行った。1-2)同義表現の吸収、文意の文脈への位置付け、文意の統合(蓄積)を可能にする意味表現方法を開発した。この意味表現方法は以下のような特色を持つ。・表層の依存構造によらず、一定の表現形式で意味が表現可能・意味内容毎に、それを位置付ける場所が決まっている2)対話訓練を指向した対話制御に関する検討2.1)協調的タスクのためのプランニング手法を開発し、特に1-1)で設定したタスクについて知識の設計を行った。2-2)タスクに対する学習者の発話の有効性を踏まえてシステムが取るべき教育行動を、対話戦略として実装した。3)タスク設定に関する検討:学習目標を効果的に達成する上で適したタスクを自動設定する手法を開発することをめざし、特に今年度は、学習者に与えるタスクと、それにより学習される事項の関係を整理した。4)試作システムの構築:以上の成果を踏まえて、ホテル検索を中心とした対話をテストベッドにした日本語対話システムを構築した。現状では、対話範囲をホテル検索や観光名所案内等に限定した上で、「ホテルを探して下さい」・「名古屋テレビ塔はどこにありますか」等の、文法に則った正しい文の入力を受け付けることが可能である。この入力の中には「依頼」や「動詞のて...
    文部科学省, 特定領域研究(A), 静岡大学, Coinvestigator not use grants, Competitive research funding, 12040219
  • 協調的音声対話制御
    Competitive research funding
  • 統計的音声言語処理
    Competitive research funding
  • Cooperative Speech Dialogue Manag
    Competitive research funding
  • Stochastic speech Language Processing
    Competitive research funding
■ Industrial Property Rights