Sciweavers

233 search results - page 43 / 47
» Clustering documents in a web directory
Sort
View
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 2 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
CCGRID
2005
IEEE
14 years 1 months ago
Bootstrapping to a semantic grid
The Scientific Annotation Middleware (SAM) is a set of components and services that enable researchers, applications, problem solving environments (PSE) and software agents to cre...
Jens Schwidder, Tara D. Talbott, James D. Myers
WCE
2007
13 years 8 months ago
Qualitative and Quantitative Criteria for the Concept Evaluation Task
act—Ontological concept evaluation is a difficult task. Till now, it is done either by domain expert or a knowledge base (thesaurus, ontology, etc.). In this research, we propose...
Lobna Karoui, Supelec France, Nabil El-Kadhi
JMLR
2010
107views more  JMLR 2010»
13 years 2 months ago
Modeling Knowledge Worker Activity
This paper describes an approach to constructing a probabilistic process model representing knowledge worker activity out of a log of primitive events, such as e-mails, web page v...
Tadej Stajner, Dunja Mladenic
ICMCS
2007
IEEE
149views Multimedia» more  ICMCS 2007»
14 years 1 months ago
SICO: A System for Detection of Near-Duplicate Images During Search
Duplicate and near-duplicate digital image matching is beneficial for image search in terms of collection management, digital content protection, and search efficiency. In this ...
Jun Jie Foo, Ranjan Sinha, Justin Zobel