Sciweavers

PVLDB
2010
146views more  PVLDB 2010»
13 years 2 months ago
HaLoop: Efficient Iterative Data Processing on Large Clusters
The growing demand for large-scale data mining and data analysis applications has led both industry and academia to design new types of highly scalable data-intensive computing pl...
Yingyi Bu, Bill Howe, Magdalena Balazinska, Michae...
JMLR
2010
110views more  JMLR 2010»
13 years 2 months ago
Attribute Selection Based on FRiS-Compactness
Commonly to classify new object in Data Mining one should estimate its similarity with given classes. Function of Rival Similarity (FRiS) is assigned to calculate quantitative mea...
Nikolay G. Zagoruiko, Irina V. Borisova, Vladimir ...
JMLR
2010
116views more  JMLR 2010»
13 years 2 months ago
Feature Selection, Association Rules Network and Theory Building
As the size and dimensionality of data sets increase, the task of feature selection has become increasingly important. In this paper we demonstrate how association rules can be us...
Sanjay Chawla
WWW
2011
ACM
13 years 2 months ago
Detecting group review spam
It is well-known that many online reviews are not written by genuine users of products, but by spammers who write fake reviews to promote or demote some target products. Although ...
Arjun Mukherjee, Bing Liu, Junhui Wang, Natalie S....
WWW
2011
ACM
13 years 2 months ago
Domain-independent entity extraction from web search query logs
Query logs of a Web search engine have been increasingly used as a vital source for data mining. This paper presents a study on largescale domain-independent entity extraction fro...
Alpa Jain, Marco Pennacchiotti
WIDM
2011
ACM
13 years 2 months ago
Filtered-top-k association discovery
Association mining has been one of the most intensively researched areas of data mining. However, direct uptake of the resulting technologies has been relatively low. This paper e...
Geoffrey I. Webb
ASC
2011
13 years 2 months ago
Fuzzy sets in machine learning and data mining
Machine learning, data mining, and several related research areas are concerned with methods for the automated induction of models and the extraction of interesting patterns from ...
Eyke Hüllermeier
AISS
2010
171views more  AISS 2010»
13 years 2 months ago
An Investigation into Influence Factor of Student Programming Grade Using Association Rule Mining
Computer programming is one of the most essential skills which each graduate has to acquire. However, there are reports that they are unable to write a program well. Researches in...
Mohamad Farhan Mohamad Mohsin, Mohd Helmy Abd Waha...
ADMA
2010
Springer
271views Data Mining» more  ADMA 2010»
13 years 2 months ago
Exploiting Concept Clumping for Efficient Incremental E-Mail Categorization
We introduce a novel approach to incremental e-mail categorization based on identifying and exploiting "clumps" of messages that are classified similarly. Clumping reflec...
Alfred Krzywicki, Wayne Wobcke
ADMA
2010
Springer
250views Data Mining» more  ADMA 2010»
13 years 4 months ago
On Probabilistic Models for Uncertain Sequential Pattern Mining
Muhammad Muzammal, Rajeev Raman