Sciweavers

542 search results - page 49 / 109
» Learning author-topic models from text corpora
Sort
View
EMNLP
2009
13 years 5 months ago
Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment
Traditionally, machine learning approaches for information extraction require human annotated data that can be costly and time-consuming to produce. However, in many cases, there ...
Kedar Bellare, Andrew McCallum
ICML
2010
IEEE
13 years 8 months ago
Distance dependent Chinese restaurant processes
We develop the distance dependent Chinese restaurant process (CRP), a flexible class of distributions over partitions that allows for nonexchangeability. This class can be used to...
David M. Blei, Peter Frazier
SIGIR
1999
ACM
14 years 8 hour ago
The Decomposition of Human-Written Summary Sentences
We define the problem of decomposing human-written summary sentences and propose a novel Hidden Markov Model solution to the problem. Human summarizers often rely on cutting and ...
Hongyan Jing, Kathleen McKeown
BMCBI
2010
162views more  BMCBI 2010»
13 years 7 months ago
Moara: a Java library for extracting and normalizing gene and protein mentions
Background: Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein intera...
Mariana L. Neves, José María Carazo,...
ESWS
2010
Springer
13 years 6 months ago
The Semantic Gap of Formalized Meaning
Recent work in Ontology learning and Text mining has mainly focused on engineering methods to solve practical problem. In this thesis, we investigate methods that can substantially...
Sebastian Hellmann