Sciweavers

2735 search results - page 287 / 547
» Comparing notions of randomness
Sort
View
LREC
2008
155views Education» more  LREC 2008»
14 years 17 days ago
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, an...
Barry Haddow, Beatrice Alex
LREC
2008
73views Education» more  LREC 2008»
14 years 17 days ago
Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
In this paper, we present a comparison between two corpora acquired by means of two different techniques. The first corpus was acquired by means of the Wizard of Oz technique. A d...
David Griol, Lluís F. Hurtado, Encarna Sega...
AAAI
2006
14 years 16 days ago
Estimating Search Tree Size
We propose two new online methods for estimating the size of a backtracking search tree. The first method is based on a weighted sample of the branches visited by chronological ba...
Philip Kilby, John K. Slaney, Sylvie Thiéba...
ACL
2006
14 years 16 days ago
Subword-Based Tagging for Confidence-Dependent Chinese Word Segmentation
We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum entr...
Ruiqiang Zhang, Gen-ichiro Kikui, Eiichiro Sumita
EMNLP
2004
14 years 16 days ago
Active Learning and the Total Cost of Annotation
Active learning (AL) promises to reduce the cost of annotating labeled datasets for trainable human language technologies. Contrary to expectations, when creating labeled training...
Jason Baldridge, Miles Osborne