Search Sciweavers | Sciweavers

543 search results - page 14 / 109

» Exploiting content redundancy for web information extraction

126

click to vote

CIKM
2008
Springer

194views Information Technology» more CIKM 2008»

Coreex: content extraction from online news articles

15 years 6 months ago

Download ilpubs.stanford.edu

We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...

Jyotika Prasad, Andreas Paepcke

claim paper

Read More »

159

click to vote

LREC
2008

110views Education» more LREC 2008»

Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence

15 years 5 months ago

Download www.lrec-conf.org

Acquiring knowledge from the Web to build domain ontologies has become a common practice in the Ontological Engineering field. The vast amount of freely available information allo...

David Manzano-Macho, Asunción Gómez-...

claim paper

Read More »

159

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 5 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

144

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 11 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

147

click to vote

UCS
2007
Springer

240views Applied Computing» more UCS 2007»

DroPicks - A Tool for Collaborative Content Sharing Exploiting Everyday Artefacts

15 years 10 months ago

Download www.mediateam.oulu.fi

Emergence of social web services like YouTube[1], Flickr[2] etc. is constantly transforming the way we share our lifestyles with family, friends and colleagues. The significance of...

Simo Hosio, Fahim Kawsar, Jukka Riekki, Tatsuo Nak...

claim paper

Read More »

« Prev « First page 14 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers