Sciweavers

1319 search results - page 20 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
TLSDKCS
2010
13 years 2 months ago
Improving Retrievability and Recall by Automatic Corpus Partitioning
Abstract. With increasing volumes of data, much effort has been devoted to finding the most suitable answer to an information need. However, in many domains, the question whether a...
Shariq Bashir, Andreas Rauber
DOCENG
2007
ACM
13 years 11 months ago
Elimination of junk document surrogate candidates through pattern recognition
A surrogate is an object that stands for a document and enables navigation to that document. Hypermedia is often represented with textual surrogates, even though studies have show...
Eunyee Koh, Daniel Caruso, Andruid Kerne, Ricardo ...
ICML
2002
IEEE
14 years 8 months ago
Kernels for Semi-Structured Data
Semi-structured data such as XML and HTML is attracting considerable attention. It is important to develop various kinds of data mining techniques that can handle semistructured d...
Hisashi Kashima, Teruo Koyanagi
ERCIMDL
2006
Springer
108views Education» more  ERCIMDL 2006»
13 years 11 months ago
The Use of Summaries in XML Retrieval
Abstract. The availability of the logical structure of documents in contentoriented XML retrieval can be beneficial for users of XML retrieval systems. However, research into struc...
Zoltán Szlávik, Anastasios Tombros, ...
ICSM
2002
IEEE
14 years 10 days ago
Documenting Pattern Use in Java Programs
Design patterns are widely recognized as important software development methods. Their use as software understanding tools, though generally acknowledged has been scarcely explore...
Marco Torchiano