Sciweavers

894 search results - page 107 / 179
» Analysis of Web Search Engine Clicked Documents
Sort
View
HPDC
2010
IEEE
13 years 9 months ago
ParaText: scalable text modeling and analysis
Automated analysis of unstructured text documents (e.g., web pages, newswire articles, research publications, business reports) is a key capability for solving important problems ...
Daniel M. Dunlavy, Timothy M. Shead, Eric T. Stant...
SIGMOD
2010
ACM
212views Database» more  SIGMOD 2010»
13 years 6 months ago
Understanding deep web search interfaces: a survey
This paper presents a survey on the major approaches to search interface understanding. The Deep Web consists of data that exist on the Web but are inaccessible via text search en...
Ritu Khare, Yuan An, Il-Yeol Song
VLDB
2000
ACM
133views Database» more  VLDB 2000»
13 years 11 months ago
Memex: A Browsing Assistant for Collaborative Archiving and Mining of Surf Trails
Keyword indices, topic directories, and link-based rankings are used to search and structure the rapidly growing Web today. Surprisingly little use is made of years of browsing ex...
Soumen Chakrabarti, Sandeep Srivastava, Mallela Su...
ECAI
2006
Springer
13 years 11 months ago
Disambiguating Personal Names on the Web Using Automatically Extracted Key Phrases
Abstract. When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. Ho...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
LAWEB
2003
IEEE
14 years 1 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork