Sciweavers

596 search results - page 27 / 120
» Text Mining at the Term Level
Sort
View
142
Voted
AMW
2009
15 years 4 months ago
T3: On Mapping Text To Time Series
We investigate if the mapping between text and time series data is feasible such that relevant data mining problems in text can find their counterparts in time series (and vice ver...
Tao Yang, Dongwon Lee
156
Voted
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
16 years 4 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee
129
Voted
SIGIR
2010
ACM
14 years 10 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
135
Voted
KDD
2007
ACM
167views Data Mining» more  KDD 2007»
16 years 4 months ago
Generalized component analysis for text with heterogeneous attributes
We present a class of richly structured, undirected hidden variable models suitable for simultaneously modeling text along with other attributes encoded in different modalities. O...
Xuerui Wang, Chris Pal, Andrew McCallum
158
Voted
WWW
2003
ACM
16 years 4 months ago
Mining newsgroups using networks arising from social behavior
Recent advances in information retrieval over hyperlinked corpora have convincinglydemonstratedthat links carry less noisy information than text. We investigate the feasibility of...
Rakesh Agrawal, Sridhar Rajagopalan, Ramakrishnan ...