Sciweavers

139 search results - page 18 / 28
» An Approach to Identify Duplicated Web Pages
Sort
View
EUSFLAT
2003
145views Fuzzy Logic» more  EUSFLAT 2003»
13 years 9 months ago
Proximity fuzzy clustering for web context analysis
This study extends the web classification approach through a proximity-based fuzzy clustering sensible to the influence of the page. The proximity-based fuzzy clustering works in ...
Vincenzo Loia, Witold Pedrycz, Sabrina Senatore
CIKM
2011
Springer
12 years 7 months ago
Focusing on novelty: a crawling strategy to build diverse language models
Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...
Luciano Barbosa, Srinivas Bangalore
ECIR
2008
Springer
13 years 9 months ago
Exploiting Locality of Wikipedia Links in Entity Ranking
Abstract. Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research field...
Jovan Pehcevski, Anne-Marie Vercoustre, James A. T...
SAC
2009
ACM
14 years 11 days ago
Towards "WYDIWYS" for MIMI using concept analysis
This paper presents a novel software engineering approach for developing a dynamic web interface that meets the quality criterion of “WYDIWYS” - What You Do Is What You See. T...
Jie Dai, Remo Mueller, Jacek Szymanski, Guo-Qiang ...
SIGIR
1998
ACM
13 years 12 months ago
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to find quality documents related to the query topic. Connectivity...
Krishna Bharat, Monika Rauch Henzinger