document | Sciweavers

200

IPM
2008

141views more IPM 2008»

Towards a unified approach to document similarity search using manifold-ranking of blocks

15 years 6 months ago

Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...

Xiaojun Wan, Jianwu Yang, Jianguo Xiao

claim paper

Read More »

196

click to vote

BMCBI
2007

163views more BMCBI 2007»

A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluati

15 years 6 months ago

Download www.biomedcentral.com

Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free t...

Illhoi Yoo, Xiaohua Hu, Il-Yeol Song

claim paper

Read More »

167

click to vote

IJDAR
2008

92views more IJDAR 2008»

Mobile Retriever: access to digital documents from their physical source

15 years 6 months ago

Download rii.ricoh.com

In this paper we describe an image based document retrieval system which runs on camera enabled mobile devices. "Mobile Retriever" aims to seamlessly link physical and di...

Xu Liu, David S. Doermann

claim paper

Read More »

177

click to vote

ENTCS
2006

116views more ENTCS 2006»

How Recent is a Web Document?

15 years 6 months ago

Download www.icsi.berkeley.edu

One of the most important aspects of a Web document is its up-to-dateness or recency. Up-to-dateness is particularly relevant to Web documents because they usually contain content...

Bo Hu, Florian Lauck, Jan Scheffczyk

claim paper

Read More »

153

click to vote

CORR
2006
Springer

100views Education» more CORR 2006»

Authorised Translations of Electronic Documents

15 years 6 months ago

Download icsa.cs.up.ac.za

A concept is proposed to extend authorised translations of documents to electronically signed, digital documents. Central element of the solution is an electronic seal, embodied a...

Jan Piechalski, Andreas U. Schmidt

claim paper

Read More »

127

click to vote

CORR
2006
Springer

71views Education» more CORR 2006»

Using NLP to build the hypertextuel network of a back-of-the-book index

15 years 6 months ago

Download hal.archives-ouvertes.fr

Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that h...

Touria Aït El Mekki, Adeline Nazarenko

claim paper

Read More »

171

click to vote

CORR
2006
Springer

100views Education» more CORR 2006»

Automatic annotation of multilingual text collections with a conceptual thesaurus

15 years 6 months ago

Download langtech.jrc.it

Automatic annotation of documents with controlled vocabulary terms (descriptors) from a conceptual thesaurus is not only useful for document indexing and retrieval. The mapping of...

Bruno Pouliquen, Ralf Steinberger, Camelia Ignat

claim paper

Read More »

192

click to vote

CORR
2006
Springer

178views Education» more CORR 2006»

A tool set for the quick and efficient exploration of large document collections

15 years 6 months ago

Download langtech.jrc.ec.europa.eu

: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...

Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...

claim paper

Read More »

138

click to vote

CLEIEJ
2008

72views more CLEIEJ 2008»

Measuring Contribution of HTML Features in Web Document Clustering

15 years 6 months ago

Download www.clei.cl

Documents in HTML format have many features to analyze, from the terms in special sections to the phrases that appear in the whole document. However, it is important to decide whi...

Esteban Meneses, Oldemar Rodríguez-Rojas

claim paper

Read More »

183

click to vote

CORR
2010
Springer

106views Education» more CORR 2010»

The WebContent XML Store

15 years 6 months ago

Download www-rocq.inria.fr

In this article, we describe the XML storage system used in the WebContent project. We begin by advocating the use of an XML database in order to store WebContent documents, and w...

Benjamin Nguyen, Spyros Zoupanos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers