Sciweavers

732 search results - page 133 / 147
» Mining Postal Addresses
Sort
View
KAIS
2010
144views more  KAIS 2010»
13 years 6 months ago
Boosting support vector machines for imbalanced data sets
Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers t...
Benjamin X. Wang, Nathalie Japkowicz
PVLDB
2010
135views more  PVLDB 2010»
13 years 6 months ago
P2PDocTagger: Content management through automated P2P collaborative tagging
As the amount of user generated content grows, personal information management has become a challenging problem. Several information management approaches, such as desktop search,...
Hock Hee Ang, Vivekanand Gopalkrishnan, Wee Keong ...
TKDE
2010
137views more  TKDE 2010»
13 years 6 months ago
A Survey on Transfer Learning
—A major assumption in many machine learning and data mining algorithms is that the training and future data must be in the same feature space and have the same distribution. How...
Sinno Jialin Pan, Qiang Yang
COLING
2010
13 years 2 months ago
The Bag-of-Opinions Method for Review Rating Prediction from Sparse Text Patterns
The problem addressed in this paper is to predict a user's numeric rating in a product review from the text of the review. Unigram and n-gram representations of text are comm...
Lizhen Qu, Georgiana Ifrim, Gerhard Weikum
CIKM
2011
Springer
12 years 7 months ago
Scalable entity matching computation with materialization
Entity matching (EM) is the task of identifying records that refer to the same real-world entity from different data sources. While EM is widely used in data integration and data...
Sanghoon Lee, Jongwuk Lee, Seung-won Hwang