Sciweavers

2763 search results - page 399 / 553
» Retrieval of Ottoman documents
Sort
View
SIGIR
2003
ACM
15 years 9 months ago
Fractal summarization: summarization based on fractal theory
In this paper, we introduce the fractal summarization model based on the fractal theory. In fractal summarization, the important information is captured from the source text by ex...
Christopher C. Yang, Fu Lee Wang
DEXAW
1999
IEEE
95views Database» more  DEXAW 1999»
15 years 8 months ago
An XML-Based, 3-Tier Scheme for Integrating Heterogeneous Information Sources to the WWW
The phenomenal growth that the WWW currently experiences necessitates the integration of various types of information sources to its platform. We present an open, extensible multi...
Costas Petrou, Stathes Hadjiefthymiades, Drakoulis...
DIS
2007
Springer
15 years 10 months ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
JCDL
2004
ACM
121views Education» more  JCDL 2004»
15 years 10 months ago
Enabling interoperability for autonomous digital libraries: an API to citeseer services
We introduce CiteSeer-API, a public API to CiteSeer-like services. CiteSeer-API is SOAP/WSDL based and allows for easy programatical access to all the specific functionalities off...
Yves Petinot, C. Lee Giles, Vivek Bhatnagar, Prade...
BTW
2003
Springer
140views Database» more  BTW 2003»
15 years 9 months ago
An Ontology for Domain-oriented Semantic Similarity Search on XML Data
Abstract: Query languages for XML such as XPath or XQuery support Boolean retrieval where a query result is a (possibly restructured) subset of XML elements or entire documents tha...
Anja Theobald