Sciweavers

498 search results - page 9 / 100
» Robust web content extraction
Sort
View
ECAI
2008
Springer
13 years 8 months ago
Automating Accreditation of Medical Web Content
123456 The increasing amount of freely available healthrelated web content generates, on one hand, excellent conditions for self-education of patients as well as physicians, but on...
Vangelis Karkaletsis, Pythagoras Karampiperis, Kon...
WWW
2005
ACM
14 years 7 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
DSN
2009
IEEE
14 years 1 months ago
Dynamic content web applications: Crash, failover, and recovery analysis
This work assesses how crashes and recoveries affect the performance of a replicated dynamic content web application. RobustStore is the result of retrofitting TPC-W’s on-line ...
Luiz Eduardo Buzato, Gustavo M. D. Vieira, Willy Z...
WEBI
2007
Springer
14 years 25 days ago
Question Answering over Implicitly Structured Web Content
Implicitly structured content on the Web such as HTML tables and lists can be extremely valuable for web search, question answering, and information retrieval, as the implicit str...
Eugene Agichtein, Chris Burges, Eric Brill
WWW
2007
ACM
14 years 7 months ago
Robust web page segmentation for mobile terminal using content-distances and page layout information
The demand of browsing information from general Web pages using a mobile phone is increasing. However, since the majority of Web pages on the Internet are optimized for browsing f...
Gen Hattori, Keiichiro Hoashi, Kazunori Matsumoto,...