Sciweavers

188 search results - page 5 / 38
» The hybrid representation model for web document classificat...
Sort
View
ECIR
2004
Springer
13 years 8 months ago
Complex Linguistic Features for Text Classification: A Comprehensive Study
Abstract. Previous researches on advanced representations for document retrieval have shown that statistical state-of-the-art models are not improved by a variety of different ling...
Alessandro Moschitti, Roberto Basili
ANLP
1994
103views more  ANLP 1994»
13 years 8 months ago
Exploiting Sophisticated Representations for Document Retrieval
The use of NLP techniques for document classification has not produced significant improvements in performance within the standard term weighting statistical assignment paradigm (...
Steven Finch
EMNLP
2010
13 years 5 months ago
Translingual Document Representations from Discriminative Projections
Representing documents by vectors that are independent of language enhances machine translation and multilingual text categorization. We use discriminative training to create a pr...
John Platt, Kristina Toutanova, Wen-tau Yih
WWW
2006
ACM
14 years 7 months ago
Using symbolic objects to cluster web documents
Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the...
Esteban Meneses, Oldemar Rodríguez-Rojas
ECIR
2007
Springer
13 years 8 months ago
Searching Documents Based on Relevance and Type
This paper extends previous work on document retrieval and document type classification, addressing the problem of ‘typed search’. Specifically, given a query and a designated ...
Jun Xu, Yunbo Cao, Hang Li, Nick Craswell, Yalou H...