Search Sciweavers | Sciweavers

910 search results - page 27 / 182

» Testbed for information extraction from deep web

140

click to vote

WWW
2007
ACM

224views Internet Technology» more WWW 2007»

EPCI: extracting potentially copyright infringement texts from the web

16 years 5 months ago

Download www2007.org

In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...

Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...

claim paper

Read More »

119

click to vote

WWW
2007
ACM

150views Internet Technology» more WWW 2007»

Adaptive record extraction from web pages

16 years 5 months ago

Download www2007.org

We describe an adaptive method for extracting records from web pages. Our algorithm combines a weighted tree matching metric with clustering for obtaining data extraction patterns...

Justin Park, Denilson Barbosa

claim paper

Read More »

138

click to vote

WISE
2005
Springer

130views Internet Technology» more WISE 2005»

Constructing Interface Schemas for Search Interfaces of Web Databases

15 years 10 months ago

Download www.cs.binghamton.edu

Many databases have become Web-accessible through form-based search interfaces (i.e., search forms) that allow users to specify complex and precise queries to access the underlying...

Hai He, Weiyi Meng, Clement T. Yu, Zonghuan Wu

claim paper

Read More »

146

click to vote

PKDD
2007
Springer

143views Data Mining» more PKDD 2007»

Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction

15 years 10 months ago

Download www.aifb.uni-karlsruhe.de

Textual patterns have been used effectively to extract information from large text collections. However they rely heavily on textual redundancy in the sense that facts have to be m...

Sebastian Blohm, Philipp Cimiano

claim paper

Read More »

115

click to vote

AI
2005
Springer

189views Artificial Intelligence» more AI 2005»

Unsupervised named-entity extraction from the Web: An experimental study

15 years 4 months ago

Download turing.cs.washington.edu

The KNOWITALL system aims to automate the tedious process of extracting large collections of facts (e.g., names of scientists or politicians) from the Web in an unsupervised, doma...

Oren Etzioni, Michael J. Cafarella, Doug Downey, A...

claim paper

Read More »

« Prev « First page 27 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers