Sciweavers

2677 search results - page 135 / 536
» Extracting Structured Data from Web Pages
Sort
View
122
Voted
MKM
2009
Springer
15 years 9 months ago
From Tessellations to Table Interpretation
The extraction of the relations of nested table headers to content cells is automated with a view to constructing narrow domain ontologies of semistructured web data. A taxonomy of...
Ramana C. Jandhyala, Mukkai S. Krishnamoorthy, Geo...
JCDL
2006
ACM
237views Education» more  JCDL 2006»
15 years 8 months ago
Automatic extraction of table metadata from digital documents
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and high...
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
BMCBI
2010
103views more  BMCBI 2010»
15 years 2 months ago
Predicting the protein-protein interactions using primary structures with predicted protein surface
Background: Many biological functions involve various protein-protein interactions (PPIs). Elucidating such interactions is crucial for understanding general principles of cellula...
Darby Tien-Hao Chang, Yu-Tang Syu, Po-Chang Lin
111
Voted
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 8 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
PKDD
2004
Springer
91views Data Mining» more  PKDD 2004»
15 years 7 months ago
Summarization of Dynamic Content in Web Collections
This paper describes a new research proposal of multi-document summarization of dynamic content in web pages. Much information is lost in the Web due to the temporal character of w...
Adam Jatowt, Mitsuru Ishizuka