Sciweavers

46 search results - page 5 / 10
» Data extraction as text categorization: an experiment with t...
Sort
View
INTERSPEECH
2010
13 years 2 months ago
Semi-supervised extractive speech summarization via co-training algorithm
Supervised methods for extractive speech summarization require a large training set. Summary annotation is often expensive and time consuming. In this paper, we exploit semisuperv...
Shasha Xie, Hui Lin, Yang Liu
ACL
2011
12 years 11 months ago
Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations
Information extraction (IE) holds the promise of generating a large-scale knowledge base from the Web’s natural language text. Knowledge-based weak supervision, using structured...
Raphael Hoffmann, Congle Zhang, Xiao Ling, Luke S....
BMCBI
2005
160views more  BMCBI 2005»
13 years 7 months ago
Data-poor categorization and passage retrieval for Gene Ontology Annotation in Swiss-Prot
Background: In the context of the BioCreative competition, where training data were very sparse, we investigated two complementary tasks: 1) given a Swiss-Prot triplet, containing...
Frédéric Ehrler, Antoine Geissbü...
LREC
2010
172views Education» more  LREC 2010»
13 years 9 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský
COLING
2008
13 years 9 months ago
The Choice of Features for Classification of Verbs in Biomedical Texts
We conduct large-scale experiments to investigate optimal features for classification of verbs in biomedical texts. We introduce a range of feature sets and associated extraction ...
Anna Korhonen, Yuval Krymolowski, Nigel Collier