Search Sciweavers | Sciweavers

910 search results - page 22 / 182

» Testbed for information extraction from deep web

119

click to vote

CIKM
2003
Springer

129views Information Technology» more CIKM 2003»

Extracting unstructured data from template generated web documents

15 years 9 months ago

Download www.ir.iit.edu

We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...

claim paper

Read More »

126

click to vote

WWW
2006
ACM

142views Internet Technology» more WWW 2006»

Extracting news-related queries from web query log

16 years 5 months ago

Download download.yandex.ru

In this poster, we present a method for extracting queries related to real-life events, or news-related queries, from large web query logs. The method employs query frequencies an...

Michael Maslov, Alexander Golovko, Ilya Segalovich...

claim paper

Read More »

155

click to vote

NIPS
2007

190views Information Technology» more NIPS 2007»

Sparse Feature Learning for Deep Belief Networks

15 years 6 months ago

Download www.cs.nyu.edu

Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the ra...

Marc'Aurelio Ranzato, Y-Lan Boureau, Yann LeCun

claim paper

Read More »

145

Voted

WWW
2005
ACM

153views Internet Technology» more WWW 2005»

METEOR: metadata and instance extraction from object referral lists on the web

16 years 5 months ago

Download www2005.org

The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by ...

Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...

claim paper

Read More »

117

click to vote

ACL
2010

115views Computational Linguistics» more ACL 2010»

Extracting Sequences from the Web

15 years 2 months ago

Download turing.cs.washington.edu

Classical Information Extraction (IE) systems fill slots in domain-specific frames. This paper reports on SEQ, a novel open IE system that leverages a domainindependent frame to e...

Anthony Fader, Stephen Soderland, Oren Etzioni

claim paper

Read More »

« Prev « First page 22 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers