Search Sciweavers | Sciweavers

684 search results - page 32 / 137

» Extracting semantic structure of web documents using content...

112

click to vote

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Automatically learning document taxonomies for hierarchical classification

16 years 4 months ago

Download www.ideal.ece.utexas.edu

While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...

Kunal Punera, Suju Rajan, Joydeep Ghosh

claim paper

Read More »

155

click to vote

EDBTW
2010
Springer

139views Software Engineering» more EDBTW 2010»

Using visual pages analysis for optimizing web archiving

15 years 2 months ago

Download www-poleia.lip6.fr

Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...

Myriam Ben Saad, Stéphane Gançarski

claim paper

Read More »

121

click to vote

WEBI
2005
Springer

127views Internet Technology» more WEBI 2005»

Automated Metadata and Instance Extraction from News Web Sites

15 years 9 months ago

Download www.public.asu.edu

In this paper, we present automated techniques for extracting metadata instance information by organizing and mining a set of news Web sites. We develop algorithms that detect and...

Srinivas Vadrevu, Saravanakumar Nagarajan, Fatih G...

claim paper

Read More »

130

click to vote

DOCENG
2009
ACM

130views Document Analysis» more DOCENG 2009»

From rhetorical structures to document structure: shallow pragmatic analysis for document engineering

15 years 10 months ago

Download www.miv.t.u-tokyo.ac.jp

In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric ...

Gersende Georg, Hugo Hernault, Marc Cavazza, Helmu...

claim paper

Read More »

130

click to vote

RIAO
2007

92views Information Technology» more RIAO 2007»

Using a Content-and-Structure Oriented Method for Relevance Feedback in XML Retrieval

15 years 5 months ago

Download riao.free.fr

As opposed to traditional Information Retrieval (IR) which views whole documents as atomic units of retrieval, XML IR processes XML elements as possible units of retrieval. Many o...

Lobna Hlaoua, Mohand Boughanem, Karen Pinel-Sauvag...

claim paper

Read More »

« Prev « First page 32 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers