Sciweavers

1186 search results - page 148 / 238
» Improving Random Walk Performance
Sort
View
EUC
2006
Springer
14 years 2 months ago
Distributed Proximity-Aware Peer Clustering in BitTorrent-Like Peer-to-Peer Networks
In this paper, we propose a hierarchical architecture for grouping peers into clusters in a large-scale BitTorrent-like underlying overlay network in such a way that clusters are e...
Bin Xiao, Jiadi Yu, Zili Shao, Minglu Li
SIGIR
2010
ACM
14 years 2 months ago
The impact of collection size on relevance and diversity
It has been observed that precision increases with collection size. One explanation could be that the redundancy of information increases, making it easier to find multiple docum...
Marijn Koolen, Jaap Kamps
FLAIRS
2007
14 years 1 months ago
A Distance-Based Over-Sampling Method for Learning from Imbalanced Data Sets
Many real-world domains present the problem of imbalanced data sets, where examples of one classes significantly outnumber examples of other classes. This makes learning difficu...
Jorge de la Calleja, Olac Fuentes
PAKDD
2010
ACM
134views Data Mining» more  PAKDD 2010»
14 years 1 months ago
Generating Diverse Ensembles to Counter the Problem of Class Imbalance
Abstract. One of the more challenging problems faced by the data mining community is that of imbalanced datasets. In imbalanced datasets one class (sometimes severely) outnumbers t...
T. Ryan Hoens, Nitesh V. Chawla
ACL
2008
14 years 19 days ago
Word Clustering and Word Selection Based Feature Reduction for MaxEnt Based Hindi NER
Statistical machine learning methods are employed to train a Named Entity Recognizer from annotated data. Methods like Maximum Entropy and Conditional Random Fields make use of fe...
Sujan Kumar Saha, Pabitra Mitra, Sudeshna Sarkar