Sciweavers

1605 search results - page 11 / 321
» Automatic Set Instance Extraction using the Web
Sort
View
DEBU
2000
95views more  DEBU 2000»
13 years 7 months ago
Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach
A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...
Craig A. Knoblock, Kristina Lerman, Steven Minton,...
CN
1999
148views more  CN 1999»
13 years 7 months ago
Automatic RDF Metadata Generation for Resource Discovery
Automatic metadata generation may provide a solution to the problem of inconsistent, unreliable metadata describing resources on the Web. The Resource Description Framework (RDF [...
Charlotte Jenkins, Mike Jackson, Peter Burden, Jon...
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
14 years 26 days ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
EMNLP
2008
13 years 9 months ago
A Casual Conversation System Using Modality and Word Associations Retrieved from the Web
In this paper we present a textual dialogue system that uses word associations retrieved from the Web to create propositions. We also show experiment results for the role of modal...
Shinsuke Higuchi, Rafal Rzepka, Kenji Araki
CIKM
2009
Springer
14 years 2 months ago
Helping editors choose better seed sets for entity set expansion
Sets of named entities are used heavily at commercial search engines such as Google, Yahoo and Bing. Acquiring sets of entities typically consists of combining semi-supervised exp...
Vishnu Vyas, Patrick Pantel, Eric Crestan