Sciweavers

359 search results - page 45 / 72
» Document clustering using word clusters via the information ...
Sort
View
SIGIR
2003
ACM
14 years 1 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
ACL
2007
13 years 10 months ago
Unsupervised Language Model Adaptation Incorporating Named Entity Information
Language model (LM) adaptation is important for both speech and language processing. It is often achieved by combining a generic LM with a topic-specific model that is more releva...
Feifan Liu, Yang Liu
ICML
2009
IEEE
14 years 9 months ago
Learning non-redundant codebooks for classifying complex objects
Codebook-based representations are widely employed in the classification of complex objects such as images and documents. Most previous codebook-based methods construct a single c...
Wei Zhang, Akshat Surve, Xiaoli Fern, Thomas G. Di...
DLIB
2002
263views more  DLIB 2002»
13 years 8 months ago
Information Retrieval by Semantic Analysis and Visualization of the Concept Space of D-Lib Magazine
In this article we present a method for retrieving documents from a digital library through a visual interface based on automatically generated concepts. We used a vocabulary gene...
Junliang Zhang, Javed Mostafa, Himansu Tripathy
ECIR
2003
Springer
13 years 10 months ago
Taming Wild Phrases
Abstract. In this paper the suitability of different document representations for automatic document classification is compared, investigating a whole range of representations be...
Cornelis H. A. Koster, Marc Seutter