Sciweavers

2553 search results - page 377 / 511
» How-To Web Pages
Sort
View
SIGIR
2009
ACM
14 years 4 months ago
Building enriched document representations using aggregated anchor text
It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....
Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...
IPPS
2008
IEEE
14 years 4 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
AIRWEB
2007
Springer
14 years 4 months ago
Transductive Link Spam Detection
Web spam can significantly deteriorate the quality of search engines. Early web spamming techniques mainly manipulate page content. Since linkage information is widely used in we...
Dengyong Zhou, Chris Burges, Tao Tao
HICSS
2005
IEEE
144views Biometrics» more  HICSS 2005»
14 years 3 months ago
Learning with Weblogs: An Empirical Investigation
The study investigates the impact of weblog use on individual learning in a university environment. Weblogs are a relatively new knowledge sharing technology, which enables people...
Helen S. Du, Christian Wagner
VLDB
2004
ACM
113views Database» more  VLDB 2004»
14 years 3 months ago
Accurate and Efficient Crawling for Relevant Websites
Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there ar...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...