Sciweavers

945 search results - page 64 / 189
» Information Extraction from HTML: Application of a General M...
Sort
View
KDD
2012
ACM
201views Data Mining» more  KDD 2012»
11 years 11 months ago
Learning from crowds in the presence of schools of thought
Crowdsourcing has recently become popular among machine learning researchers and social scientists as an effective way to collect large-scale experimental data from distributed w...
Yuandong Tian, Jun Zhu
CIKM
2005
Springer
14 years 2 months ago
A novel refinement approach for text categorization
In this paper we present a novel strategy, DragPushing, for improving the performance of text classifiers. The strategy is generic and takes advantage of training errors to succes...
Songbo Tan, Xueqi Cheng, Moustafa Ghanem, Bin Wang...
SIAMMAX
2010
189views more  SIAMMAX 2010»
13 years 3 months ago
Fast Algorithms for the Generalized Foley-Sammon Discriminant Analysis
Linear Discriminant Analysis (LDA) is one of the most popular approaches for feature extraction and dimension reduction to overcome the curse of the dimensionality of the high-dime...
Lei-Hong Zhang, Li-Zhi Liao, Michael K. Ng
EMNLP
2008
13 years 10 months ago
Automatic Inference of the Temporal Location of Situations in Chinese Text
Chinese is a language that does not have morphological tense markers that provide explicit grammaticalization of the temporal location of situations (events or states). However, i...
Nianwen Xue
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
14 years 9 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang