Search Sciweavers | Sciweavers

543 search results - page 8 / 109

» Exploiting content redundancy for web information extraction

128

click to vote

WWW
2009
ACM

142views Internet Technology» more WWW 2009»

Estimating web site readability using content extraction

16 years 4 months ago

Download www2009.eprints.org

Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality ...

Thomas Gottron, Ludger Martin

claim paper

Read More »

128

click to vote

WSDM
2010
ACM

265views Data Mining» more WSDM 2010»

Data-oriented Content Query System: Searching for Data into Text on the Web

16 years 1 months ago

Download www.ews.uiuc.edu

As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...

Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng

claim paper

Read More »

134

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 4 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

133

click to vote

ICDM
2008
IEEE

143views Data Mining» more ICDM 2008»

Exploiting Data Semantics to Discover, Extract, and Model Web Sources

15 years 10 months ago

Download www.isi.edu

We describe DEIMOS, a system that automatically discovers and models new sources of information. The system exploits four core technologies developed by our group that makes an en...

José Luis Ambite, Craig A. Knoblock, Kristi...

claim paper

Read More »

124

click to vote

WWW
2003
ACM

149views Internet Technology» more WWW 2003»

Annotating Web pages for the needs of Web Information Extraction Applications

16 years 4 months ago

Download cgi.di.uoa.gr

This paper outlines our approach to the creation of annotated corpora for the purposes of Web Information Extraction, and presents the Web Annotation tool. This tool enables the a...

Georgios Sigletos, Dimitra Farmakiotou, Konstantin...

claim paper

Read More »

« Prev « First page 8 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers