Search Sciweavers | Sciweavers

85 search results - page 4 / 17

» ECON: An Approach to Extract Content from Web News Page

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

14 years 1 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

click to vote

SIGIR
2000
ACM

160views Information Technology» more SIGIR 2000»

OCELOT: a system for summarizing Web pages

13 years 12 months ago

Download www.cs.cmu.edu

Abstract We introduce OCELOT, a prototype system for automatically generating the “gist” of a web page by summarizing it. Although most text summarization research to date has ...

Adam L. Berger, Vibhu O. Mittal

claim paper

Read More »

click to vote

SOFSEM
2007
Springer

156views Theoretical Computer Science» more SOFSEM 2007»

Creating Permanent Test Collections of Web Pages for Information Extraction Research

14 years 1 months ago

Download www.dbai.tuwien.ac.at

In the research area of automatic web information extraction, there is a need for permanent and annotated web page collections enabling objective performance evaluation of differen...

Bernhard Pollak, Wolfgang Gatterbauer

claim paper

Read More »

click to vote

DEXA
2006
Springer

197views Database» more DEXA 2006»

Cleaning Web Pages for Effective Web Content Mining

13 years 9 months ago

Download sol.cs.uwindsor.ca

Classifying and mining noise-free web pages will improve on accuracy of search results as well as search speed, and may benefit webpage organization applications (e.g., keyword-bas...

Jing Li, Christie I. Ezeife

claim paper

Read More »

click to vote

DOCENG
2009
ACM

139views Document Analysis» more DOCENG 2009»

Web document text and images extraction using DOM analysis and natural language processing

14 years 2 months ago

Download www.hpl.hp.com

: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...

Parag Mulendra Joshi, Sam Liu

claim paper

Read More »

« Prev « First page 4 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers