Sciweavers

624 search results - page 99 / 125
» The Tradeoffs of Large Scale Learning
Sort
View
EDBT
1994
ACM
117views Database» more  EDBT 1994»
13 years 12 months ago
Content Routing for Distributed Information Servers
We describe a system that provides query based associative access to the contents of distributed information servers. In typical distributed information systems there are so many o...
Mark A. Sheldon, Andrzej Duda, Ron Weiss, James O'...
ICDM
2010
IEEE
189views Data Mining» more  ICDM 2010»
13 years 5 months ago
S4: Distributed Stream Computing Platform
Abstract--S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continu...
Leonardo Neumeyer, Bruce Robbins, Anish Nair, Anan...
COLING
2010
13 years 2 months ago
Resolving Surface Forms to Wikipedia Topics
Ambiguity of entity mentions and concept references is a challenge to mining text beyond surface-level keywords. We describe an effective method of disambiguating surface forms an...
Yiping Zhou, Lan Nie, Omid Rouhani-Kalleh, Flavian...
WWW
2010
ACM
14 years 2 months ago
Large-scale bot detection for search engines
In this paper, we propose a semi-supervised learning approach for classifying program (bot) generated web search traffic from that of genuine human users. The work is motivated by...
Hongwen Kang, Kuansan Wang, David Soukal, Fritz Be...
WWW
2009
ACM
14 years 8 months ago
An axiomatic approach for result diversification
Understanding user intent is key to designing an effective ranking system in a search engine. In the absence of any explicit knowledge of user intent, search engines want to diver...
Sreenivas Gollapudi, Aneesh Sharma