Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation

15 years 8 months ago

Download www.lrec-conf.org

Data Selection has emerged as a common issue in language technologies. We define Data Selection as the choosing of a subset of training data that is most effective for a given task. This paper describes deductive feature detection, one component of a data selection system for machine translation. Feature detection determines whether features such as tense, number, and person are expressed in a language. The database of the The World Atlas of Language Structures provides a gold standard against which to evaluate feature detection. The discovered features can be used as input to a Navigator, which uses active learning to determine which piece of language data is the most important to acquire next.

Jonathan Clark, Robert E. Frederking, Lori S. Levi

Real-time Traffic

Data Selection | Deductive Feature Detection | Education | Feature Detection | LREC 2008 |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Jonathan Clark, Robert E. Frederking, Lori S. Levin

Comments (0)

Sciweavers

Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation

Data Selection | Deductive Feature Detection | Education | Feature Detection | LREC 2008 |

Explore & Download

Productivity Tools

Sciweavers