Sciweavers

1287 search results - page 178 / 258
» Integrating document and data retrieval based on XML
Sort
View
DGO
2006
134views Education» more  DGO 2006»
13 years 10 months ago
Next steps in near-duplicate detection for eRulemaking
Large volume public comment campaigns and web portals that encourage the public to customize form letters produce many near-duplicate documents, which increases processing and sto...
Hui Yang, Jamie Callan, Stuart W. Shulman
JAIR
2008
173views more  JAIR 2008»
13 years 9 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
CIKM
2003
Springer
14 years 2 months ago
Tracking changes in user interests with a few relevance judgments
Keeping track of changes in user interests from a document stream with a few relevance judgments is not an easy task. To tackle this problem, we propose a novel method that integr...
Dwi H. Widyantoro, Thomas R. Ioerger, John Yen
SIGMOD
1998
ACM
127views Database» more  SIGMOD 1998»
14 years 1 months ago
ARIADNE: A System for Constructing Mediators for Internet Sources
The Web is based on a browsing paradigm that makes it di cult to retrieve and integrate data from multiple sites. Today, the only way to achieve this integration is by building sp...
José Luis Ambite, Naveen Ashish, Greg Baris...
EDBT
2006
ACM
106views Database» more  EDBT 2006»
14 years 9 months ago
DPTree: A Distributed Pattern Tree Index for Partial-Match Queries in Peer-to-Peer Networks
Abstract. Partial-match queries return data items that contain a subset of the query keywords and order the results based on the statistical properties of the matched keywords. The...
Dyce Jing Zhao, Dik Lun Lee, Qiong Luo