Search Sciweavers | Sciweavers

116 search results - page 3 / 24

» A machine learning approach to web page filtering using cont...

click to vote

LREC
2008

160views Education» more LREC 2008»

Automatic Extraction of Textual Elements from News Web Pages

13 years 9 months ago

Download www.lrec-conf.org

In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...

Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany

claim paper

Read More »

click to vote

WWW
2005
ACM

144views Internet Technology» more WWW 2005»

Finding the boundaries of information resources on the web

14 years 1 months ago

Download www2005.org

In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...

Pavel Dmitriev, Carl Lagoze, Boris Suchkov

claim paper

Read More »

click to vote

WWW
2010
ACM

300views Internet Technology» more WWW 2010»

Automatic extraction of clickable structured web contents for name entity queries

14 years 2 months ago

Download research.microsoft.com

Today the major web search engines answer queries by showing ten result snippets, which need to be inspected by users for identifying relevant results. In this paper we investigat...

Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu

claim paper

Read More »

click to vote

SOCIALCOM
2010

175views Security Privacy» more SOCIALCOM 2010»

Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph

13 years 5 months ago

Download www.cis.temple.edu

A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search...

Harish Sethu, Alexander Yates

claim paper

Read More »

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

14 years 1 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

« Prev « First page 3 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers