The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms

15 years 8 months ago

Download www.lrec-conf.org

In this paper, we present the D-TUNA corpus, which is the first semantically annotated corpus of referring expressions in Dutch. Its primary function is to evaluate and improve the performance of REG algorithms. Such algorithms are computational models that automatically generate referring expressions by computing how a specific target can be identified to an addressee by distinguishing it from a set of distractor objects. We performed a large-scale production experiment, in which participants were asked to describe furniture items and people, and provided all descriptions with semantic information regarding the target and the distractor objects. Besides being useful for evaluating REG algorithms, the corpus addresses several other research goals. Firstly, the corpus contains both written and spoken referring expressions uttered in the direction of an addressee, which enables systematic analyses of how modality (text or speech) influences the human production of referring expressions....

Ruud Koolen, Emiel Krahmer

Real-time Traffic

D-TUNA Corpus | Distractor Objects | Education | LREC 2010 | REG Algorithms |

claim paper

» Utilization of two sample ttest statistics from redundant probe sets to evaluate different...

» Scalable community discovery on textual data with relations

» Keyword search in databases the power of RDBMS

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Ruud Koolen, Emiel Krahmer

Comments (0)

Sciweavers

The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms

D-TUNA Corpus | Distractor Objects | Education | LREC 2010 | REG Algorithms |

Explore & Download

Productivity Tools

Sciweavers