Sciweavers

1947 search results - page 25 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
PAAMS
2010
Springer
13 years 6 months ago
Variable Length-Based Genetic Representation to Automatically Evolve Wrappers
The Web has been the star service on the Internet, however the outsized information available and its decentralized nature has originated an intrinsic difficulty to locate, extract...
David F. Barrero, Antonio González-Pardo, M...
PVLDB
2010
105views more  PVLDB 2010»
13 years 3 months ago
A Probabilistic Approach for Automatically Filling Form-Based Web Interfaces
In this paper we present a proposal for the implementation and evaluation of a novel method for automatically using data-rich text for filling form-based input interfaces. Our sol...
Guilherme A. Toda, Eli Cortez, Altigran Soares da ...
WEBDB
2005
Springer
129views Database» more  WEBDB 2005»
14 years 2 months ago
Searching for Hidden-Web Databases
Recently, there has been increased interest in the retrieval and integration of hidden Web data with a view to leverage high-quality information available in online databases. Alt...
Luciano Barbosa, Juliana Freire
NAACL
2010
13 years 6 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
IRI
2007
IEEE
14 years 2 months ago
Acronym-Expansion Recognition and Ranking on the Web
The paper presents a study on large-scale automatic extraction of acronyms and associated expansions from Web data and from the user interactions with this data through Web search...
Alpa Jain, Silviu Cucerzan, Saliha Azzam