Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper describes the application of the PARADISE evaluation framework to the corpus of 662 human-computer dialogues collected in the June 2000 Darpa Communicator data collection. We describe results based on the standard logfile metrics as well as results based on additional qualitative metrics derived using the DATE dialogue act tagging scheme. We show that performance models derived via using the standard metrics can account for 37% of the variance in user satisfaction, and that the addition of DATE metrics improved the models by an absolute 5%.

Marilyn A. Walker, Rebecca J. Passonneau, Julie E.

Real-time Traffic

ACL 2001 | ACL 2007 | Additional Qualitative Metrics | Standard Logfile Metrics | Standard Metrics |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2001
Where	ACL
Authors	Marilyn A. Walker, Rebecca J. Passonneau, Julie E. Boland

Comments (0)

Sciweavers

Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems

ACL 2001 | ACL 2007 | Additional Qualitative Metrics | Standard Logfile Metrics | Standard Metrics |

Explore & Download

Productivity Tools

Sciweavers