Sciweavers

1437 search results - page 174 / 288
» Content Extraction Signatures
Sort
View
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 7 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
ICDM
2009
IEEE
148views Data Mining» more  ICDM 2009»
14 years 4 months ago
Hierarchical Bayesian Models for Collaborative Tagging Systems
—Collaborative tagging systems with user generated content have become a fundamental element of websites such as Delicious, Flickr or CiteULike. By sharing common knowledge, mass...
Markus Bundschus, Shipeng Yu, Volker Tresp, Achim ...
MKM
2009
Springer
14 years 4 months ago
From Tessellations to Table Interpretation
The extraction of the relations of nested table headers to content cells is automated with a view to constructing narrow domain ontologies of semistructured web data. A taxonomy of...
Ramana C. Jandhyala, Mukkai S. Krishnamoorthy, Geo...
PRIB
2009
Springer
100views Bioinformatics» more  PRIB 2009»
14 years 4 months ago
Evidence-Based Clustering of Reads and Taxonomic Analysis of Metagenomic Data
Abstract. The rapidly emerging field of metagenomics seeks to examine the genomic content of communities of organisms to understand their roles and interactions in an ecosystem. I...
Gianluigi Folino, Fabio Gori, Mike S. M. Jetten, E...
ICPR
2008
IEEE
14 years 4 months ago
Interactive feature visualization for image retrieval
Most systems for content based image retrieval (CBIR) employ low level image features as a similarity measure. The problem of CBIR systems is that they are a “black box” to th...
Johannes Imo, Sebastian Klenk, Gunther Heidemann