Sciweavers

560 search results - page 26 / 112
» Text Clustering with Feature Selection by Using Statistical ...
Sort
View
IAT
2009
IEEE
14 years 2 months ago
Multilingual Statistical News Summarisation: Preliminary Experiments with English
—In this paper we present a generic approach for summarising multilingual news clusters such as the ones produced by the Europe Media Monitor (EMM) system. It is generic because ...
Mijail Alexandrov Kabadjov, Josef Steinberger, Bru...
JAIR
2010
94views more  JAIR 2010»
13 years 6 months ago
Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback
While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimension...
Sajib Dasgupta, Vincent Ng
ICDAR
2009
IEEE
13 years 5 months ago
Document Content Extraction Using Automatically Discovered Features
We report an automatic feature discovery method that achieves results comparable to a manually chosen, larger feature set on a document image content extraction problem: the locat...
Sui-Yu Wang, Henry S. Baird, Chang An
ICML
2006
IEEE
14 years 8 months ago
Feature subset selection bias for classification learning
Feature selection is often applied to highdimensional data prior to classification learning. Using the same training dataset in both selection and learning can result in socalled ...
Surendra K. Singhi, Huan Liu
UAI
2008
13 years 9 months ago
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression
Although fully generative models have been successfully used to model the contents of text documents, they are often awkward to apply to combinations of text data and document met...
David M. Mimno, Andrew McCallum