Sciweavers

8479 search results - page 56 / 1696
» Data Extraction from Web Data Sources
Sort
View
ACL
2006
13 years 11 months ago
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
We present a novel method for extracting parallel sub-sentential fragments from comparable, non-parallel bilingual corpora. By analyzing potentially similar sentence pairs using a...
Dragos Stefan Munteanu, Daniel Marcu
SIGMOD
2000
ACM
236views Database» more  SIGMOD 2000»
14 years 2 months ago
XTRACT: A System for Extracting Document Type Descriptors from XML Documents
XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...
Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...
SEMWEB
2007
Springer
14 years 4 months ago
Revyu.com: a Reviewing and Rating Site for the Web of Data
Revyu.com is a live, publicly accessible reviewing and rating Web site, designed to be usable by humans whilst transparently generating machinereadable RDF metadata for the Semanti...
Tom Heath, Enrico Motta
ICDE
2006
IEEE
146views Database» more  ICDE 2006»
14 years 11 months ago
Query Selection Techniques for Efficient Crawling of Structured Web Sources
The high quality, structured data from Web structured sources is invaluable for many applications. Hidden Web databases are not directly crawlable by Web search engines and are on...
Ping Wu, Ji-Rong Wen, Huan Liu, Wei-Ying Ma
IJCAI
2007
13 years 11 months ago
Learning Semantic Descriptions of Web Information Sources
The Internet is full of information sources providing various types of data from weather forecasts to travel deals. These sources can be accessed via web-forms, Web Services or RS...
Mark James Carman, Craig A. Knoblock