Sciweavers

232 search results - page 42 / 47
» Query-related data extraction of hidden web documents
Sort
View
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 2 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
ICDIM
2006
IEEE
14 years 1 months ago
Creating an Historical Archive Ontology: Guidelines and Evaluation
Ontologies have been proven invaluable tools both for the semantic web and for personal information management. In the context of a historical archive an ontology may provide mean...
Elena Torou, Akrivi Katifori, Costas Vassilakis, G...
FINTAL
2006
13 years 11 months ago
A Text Mining Approach for Definition Question Answering
This paper describes a method for definition question answering based on the use of surface text patterns. The method is specially suited to answer questions about person's po...
Claudia Denicia-Carral, Manuel Montes-y-Góm...
VLDB
2002
ACM
91views Database» more  VLDB 2002»
13 years 7 months ago
DTD-Directed Publishing with Attribute Translation Grammars
We present a framework for publishing relational data in XML with respect to a fixed DTD. In data exchange on the Web, XML views of relational data are typically required to confo...
Michael Benedikt, Chee Yong Chan, Wenfei Fan, Raje...
CIKM
2005
Springer
14 years 1 months ago
Versatile structural disambiguation for semantic-aware applications
In this paper, we propose a versatile disambiguation approach which can be used to make explicit the meaning of structure based information such as XML schemas, XML document struc...
Federica Mandreoli, Riccardo Martoglia, Enrico Ron...