Sciweavers

IPM
2008
141views more  IPM 2008»
14 years 13 days ago
Towards a unified approach to document similarity search using manifold-ranking of blocks
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...
Xiaojun Wan, Jianwu Yang, Jianguo Xiao
BMCBI
2007
163views more  BMCBI 2007»
14 years 13 days ago
A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluati
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free t...
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
IJDAR
2008
92views more  IJDAR 2008»
14 years 14 days ago
Mobile Retriever: access to digital documents from their physical source
In this paper we describe an image based document retrieval system which runs on camera enabled mobile devices. "Mobile Retriever" aims to seamlessly link physical and di...
Xu Liu, David S. Doermann
ENTCS
2006
116views more  ENTCS 2006»
14 years 14 days ago
How Recent is a Web Document?
One of the most important aspects of a Web document is its up-to-dateness or recency. Up-to-dateness is particularly relevant to Web documents because they usually contain content...
Bo Hu, Florian Lauck, Jan Scheffczyk
CORR
2006
Springer
100views Education» more  CORR 2006»
14 years 14 days ago
Authorised Translations of Electronic Documents
A concept is proposed to extend authorised translations of documents to electronically signed, digital documents. Central element of the solution is an electronic seal, embodied a...
Jan Piechalski, Andreas U. Schmidt
CORR
2006
Springer
71views Education» more  CORR 2006»
14 years 14 days ago
Using NLP to build the hypertextuel network of a back-of-the-book index
Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that h...
Touria Aït El Mekki, Adeline Nazarenko
CORR
2006
Springer
100views Education» more  CORR 2006»
14 years 14 days ago
Automatic annotation of multilingual text collections with a conceptual thesaurus
Automatic annotation of documents with controlled vocabulary terms (descriptors) from a conceptual thesaurus is not only useful for document indexing and retrieval. The mapping of...
Bruno Pouliquen, Ralf Steinberger, Camelia Ignat
CORR
2006
Springer
178views Education» more  CORR 2006»
14 years 14 days ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
CLEIEJ
2008
72views more  CLEIEJ 2008»
14 years 16 days ago
Measuring Contribution of HTML Features in Web Document Clustering
Documents in HTML format have many features to analyze, from the terms in special sections to the phrases that appear in the whole document. However, it is important to decide whi...
Esteban Meneses, Oldemar Rodríguez-Rojas
CORR
2010
Springer
106views Education» more  CORR 2010»
14 years 16 days ago
The WebContent XML Store
In this article, we describe the XML storage system used in the WebContent project. We begin by advocating the use of an XML database in order to store WebContent documents, and w...
Benjamin Nguyen, Spyros Zoupanos