Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue | Natural Language Dialogue group

Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue

Title	Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue
Publication Type	Conference Paper
Year of Publication	2010
Authors	Robinson, S., A. Roque, and D. R. Traum
Conference Name	7th International Conference on Language Resources and Evaluation (LREC)
Date Published	May 19-21, 2010
Conference Location	Valletta, Malta
Abstract	As conversational agents are now being developed to encounter more complex dialogue situations it is increasingly difficult to find satisfactory methods for evaluating these agents. Task-based measures are insufficient where there is no clearly defined task. While user-based evaluation methods may give a general sense of the quality of an agent's performance, they shed little light on the relative quality or success of specific features of dialogue that are necessary for system improvement. This paper examines current dialogue agent evaluation practices and motivates the need for a more detailed approach for defining and measuring the quality of dialogues between agent and user. We present a framework for evaluating the dialogue competence of artificial agents involved in complex and underspecified tasks when conversing with people. A multi-part coding scheme is proposed that provides a qualitative analysis of human utterances, and rates the appropriateness of the agent's responses to these utterances. The scheme is outlined, and then used to evaluate Staff Duty Officer Moleno, a virtual guide in Second Life.
URL	http://people.ict.usc.edu/~traum/Papers/Robinson-LREC2010.pdf

Projects:

Virtual Intelligent Guides for Online Realms (VIGOR)

Virtual Characters:

Staff Duty Officer Moleno

Corpora:

Second Life Corpus

Research Efforts:

Dialogue Corpus Annotation

Evaluation of Dialogue Systems and Virtual Humans

Typology of Dialogue Domains

Login to post comments
BibTex
Google Scholar