Sciweavers

85 search results - page 15 / 17
» Improving Text Classification by Web Corpora
Sort
View
AUSDM
2008
Springer
243views Data Mining» more  AUSDM 2008»
13 years 9 months ago
Structure-Based Document Model with Discrete Wavelet Transforms and Its Application to Document Classification
Term signal is an existing text representation that depicts a term as a vector of frequencies of occurrences in a number of user-defined partitions of a document. Although term si...
Supphachai Thaicharoen, Tom Altman, Krzysztof J. C...
PKDD
2004
Springer
205views Data Mining» more  PKDD 2004»
14 years 24 days ago
Breaking Through the Syntax Barrier: Searching with Entities and Relations
The next wave in search technology will be driven by the identification, extraction, and exploitation of real-world entities represented in unstructured textual sources. Search sy...
Soumen Chakrabarti
HIS
2003
13 years 8 months ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
ICML
2006
IEEE
14 years 8 months ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum
KDD
2007
ACM
184views Data Mining» more  KDD 2007»
14 years 7 months ago
Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis
To unravel the concept structure and dynamics of the bioinformatics field, we analyze a set of 7401 publications from the Web of Science and MEDLINE databases, publication years 1...
Bart De Moor, Frizo A. L. Janssens, Wolfgang Gl&au...