document collections

234

SDM
2012
SIAM

247views Data Mining» more SDM 2012»

Simplex Distributions for Embedding Data Matrices over Time

13 years 9 months ago

Early stress recognition is of great relevance in precision plant protection. Pre-symptomatic water stress detection is of particular interest, ultimately helping to meet the chal...

Kristian Kersting, Mirwaes Wahabzada, Christoph R&...

claim paper

Read More »

207

click to vote

ICDAR
2011
IEEE

199views Document Analysis» more ICDAR 2011»

Word Retrieval in Historical Document Using Character-Primitives

14 years 6 months ago

Download www.icdar2011.org

Word searching and indexing in historical document collections is a challenging problem because, characters in these documents are often touching or broken due to degradation/agei...

Partha Pratim Roy, Jean-Yves Ramel, Nicolas Ragot

claim paper

Read More »

197

click to vote

ICDAR
2011
IEEE

213views Document Analysis» more ICDAR 2011»

Browsing Heterogeneous Document Collections by a Segmentation-Free Word Spotting Method

14 years 6 months ago

Download www.icdar2011.org

—In this paper, we present a segmentation-free word spotting method that is able to deal with heterogeneous document image collections. We propose a patch-based framework where p...

Marçal Rusiñol, David Aldavert, Rica...

claim paper

Read More »

179

click to vote

AAAI
2011

139views Intelligent Agents» more AAAI 2011»

Exploiting Phase Transition in Latent Networks for Clustering

14 years 6 months ago

Download www-personal.umich.edu

In this paper, we model the pair-wise similarities of a set of documents as a weighted network with a single cutoff parameter. Such a network can be thought of an ensemble of unwe...

Vahed Qazvinian, Dragomir R. Radev

claim paper

Read More »

191

click to vote

EMNLP
2010

170views Natural Language Processing» more EMNLP 2010»

Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective

15 years 4 months ago

Download www.cs.cmu.edu

With the proliferation of user-generated articles over the web, it becomes imperative to develop automated methods that are aware of the ideological-bias implicit in a document co...

Amr Ahmed, Eric P. Xing

claim paper

Read More »

197

click to vote

JUCS
2008

167views more JUCS 2008»

A Generic Architecture for the Conversion of Document Collections into Semantically Annotated Digital Archives

15 years 6 months ago

Download www.jucs.org

: Mass digitization of document collections with further processing and semantic annotation is an increasing activity among libraries and archives at large for preservation, browsi...

Josep Lladós, Dimosthenis Karatzas, Joan Ma...

claim paper

Read More »

182

click to vote

CORR
2006
Springer

132views Education» more CORR 2006»

Navigating multilingual news collections using automatically extracted information

15 years 6 months ago

Download langtech.jrc.it

We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that...

Ralf Steinberger, Bruno Pouliquen, Camelia Ignat

claim paper

Read More »

180

click to vote

AVI
2000

174views Software Engineering» more AVI 2000»

A Modular Approach for Exploring the Semantic Structure of Technical Document Collections

15 years 8 months ago

Download www-i5.informatik.rwth-aachen.de

The identification and analysis of an enterprise's knowledge available in a documented form is a key element of knowledge management. Visual methods which allow easy access t...

Andreas Becks, Stefan Sklorz, Matthias Jarke

claim paper

Read More »

170

click to vote

ACL
2006

99views Computational Linguistics» more ACL 2006»

Are These Documents Written from Different Perspectives? A Test of Different Perspectives Based on Statistical Distribution Dive

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper we investigate how to automatically determine if two document collections are written from different perspectives. By perspectives we mean a point of view, for examp...

Wei-Hao Lin, Alexander G. Hauptmann

claim paper

Read More »

202

click to vote

SDM
2007
SIAM

187views Data Mining» more SDM 2007»

Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning

15 years 8 months ago

Download www-users.cs.umn.edu

Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...

Arindam Banerjee, Sugato Basu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers