Sciweavers

910 search results - page 22 / 182
» Testbed for information extraction from deep web
Sort
View
CIKM
2003
Springer
14 years 1 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
WWW
2006
ACM
14 years 8 months ago
Extracting news-related queries from web query log
In this poster, we present a method for extracting queries related to real-life events, or news-related queries, from large web query logs. The method employs query frequencies an...
Michael Maslov, Alexander Golovko, Ilya Segalovich...
NIPS
2007
13 years 9 months ago
Sparse Feature Learning for Deep Belief Networks
Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the ra...
Marc'Aurelio Ranzato, Y-Lan Boureau, Yann LeCun
WWW
2005
ACM
14 years 8 months ago
METEOR: metadata and instance extraction from object referral lists on the web
The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by ...
Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...
ACL
2010
13 years 6 months ago
Extracting Sequences from the Web
Classical Information Extraction (IE) systems fill slots in domain-specific frames. This paper reports on SEQ, a novel open IE system that leverages a domainindependent frame to e...
Anthony Fader, Stephen Soderland, Oren Etzioni