Sciweavers

502 search results - page 10 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
DRR
2003
13 years 9 months ago
Automated labeling of bibliographic data extracted from biomedical online journals
A prototype system has been designed to automate the extraction of bibliographic data (e.g., article title, authors, , affiliation and others) from online biomedical journals to p...
Jongwoo Kim, Daniel X. Le, George R. Thoma
APCCM
2009
13 years 9 months ago
Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval
Existing HTML mark-up is used only to indicate the structure and lay-out of documents, but not the document semantics. As a result web documents are difficult to be semantically p...
Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah ...
WWW
2005
ACM
14 years 9 months ago
Interactive web-wrapper construction for extracting relational information from web documents
In this paper, we propose a new user interface to interactively specify Web wrappers to extract relational information from Web documents. In this study, we focused on improving u...
Tsuyoshi Sugibuchi, Yuzuru Tanaka
FLAIRS
2001
13 years 9 months ago
Syntactic Folding and its Application to the Information Extraction from Web Pages
Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...
Jörg Herrmann
TAL
2010
Springer
13 years 6 months ago
Portable Extraction of Partially Structured Facts from the Web
A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract usef...
Andrew Salway, Liadh Kelly, Inguna Skadina, Gareth...