Sciweavers

502 search results - page 9 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
CICLING
2009
Springer
14 years 9 months ago
Business Specific Online Information Extraction from German Websites
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...
Yeong Su Lee, Michaela Geierhos
WEBI
2007
Springer
14 years 2 months ago
Question Answering over Implicitly Structured Web Content
Implicitly structured content on the Web such as HTML tables and lists can be extremely valuable for web search, question answering, and information retrieval, as the implicit str...
Eugene Agichtein, Chris Burges, Eric Brill
IPM
2007
149views more  IPM 2007»
13 years 8 months ago
Web page title extraction and its application
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...
ICWE
2010
Springer
13 years 7 months ago
Partial Information Extraction Approach to Lightweight Integration on the Web
Abstract. We present partial information extraction approach to lightweight integration on the Web. Our approach allows us to extract dynamic contents created by scripts as well as...
Junxia Guo, Prach Chaisatien, Hao Han, Tomoya Noro...
ISMIS
2003
Springer
14 years 1 months ago
MetaNews: An Information Agent for Gathering News Articles on the Web
This paper presents MetaNews, an information gathering agent for news articles on the Web. MetaNews reads HTML documents from online news sites and extracts article information fro...
Dae-Ki Kang, Joongmin Choi