Search Sciweavers | Sciweavers

330 search results - page 5 / 66

» Unexpected results in automatic list extraction on the web

click to vote

WWW
2010
ACM

300views Internet Technology» more WWW 2010»

Automatic extraction of clickable structured web contents for name entity queries

14 years 2 months ago

Download research.microsoft.com

Today the major web search engines answer queries by showing ten result snippets, which need to be inspected by users for identifying relevant results. In this paper we investigat...

Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu

claim paper

Read More »

click to vote

LREC
2010

201views Education» more LREC 2010»

Cultural Heritage: Knowledge Extraction from Web Documents

13 years 9 months ago

Download www.lrec-conf.org

This article presents the use of NLP techniques (text mining, text analysis) to develop specific tools that allow to create linguistic resources related to the cultural heritage d...

Eva Sassolini, Alessandra Cinini

claim paper

Read More »

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

14 years 8 days ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

click to vote

VLDB
2001
ACM

144views Database» more VLDB 2001»

RoadRunner: Towards Automatic Data Extraction from Large Web Sites

14 years 1 days ago

Download www.vldb.org

The paper investigates techniques for extracting data from HTML sites through the use of automatically generated wrappers. To automate the wrapper generation and the data extracti...

Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...

claim paper

Read More »

click to vote

WWW
2004
ACM

101views Internet Technology» more WWW 2004»

Web-scale information extraction in knowitall: (preliminary results)

14 years 8 months ago

Download turing.cs.washington.edu

Manually querying search engines in order to accumulate a large body of factual information is a tedious, error-prone process of piecemeal search. Search engines retrieve and rank...

Oren Etzioni, Michael J. Cafarella, Doug Downey, S...

claim paper

Read More »

« Prev « First page 5 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers