Sciweavers

146 search results - page 21 / 30
» WebBase: a repository of Web pages
Sort
View
WWW
2007
ACM
14 years 9 months ago
Towards extracting flickr tag semantics
We address the problem of extracting semantics of tags ? short, unstructured text-labels assigned to resources on the Web ? based on each tag's metadata patterns. In particul...
Tye Rattenbury, Nathan Good, Mor Naaman
INTERACT
1997
13 years 10 months ago
GroupScape: Integrating Synchronous Groupware and the World Wide Web
Synchronous groupware applications support people collaborating in real time over a distance. The world wide web supports asynchronous collaboration by allowing people to share dis...
T. C. Nicholas Graham
WWW
2004
ACM
14 years 9 months ago
Automatic web news extraction using tree edit distance
The Web poses itself as the largest data repository ever available in the history of humankind. Major efforts have been made in order to provide efficient access to relevant infor...
Davi de Castro Reis, Paulo Braz Golgher, Altigran ...
ICSE
2012
IEEE-ACM
11 years 11 months ago
Detecting similar software applications
—Although popular text search engines allow users to retrieve similar web pages, source code search engines do not have this feature. Detecting similar applications is a notoriou...
Collin McMillan, Mark Grechanik, Denys Poshyvanyk
CIKM
2009
Springer
14 years 3 months ago
Graph-based seed selection for web-scale crawlers
One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identifies and explores the problem of seed selection in webscal...
Shuyi Zheng, Pavel Dmitriev, C. Lee Giles