Sciweavers

58 search results - page 5 / 12
» Analyzing Large Collections of Electronic Text Using OLAP
Sort
View
SDM
2007
SIAM
187views Data Mining» more  SDM 2007»
13 years 9 months ago
Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning
Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...
Arindam Banerjee, Sugato Basu
CIMCA
2005
IEEE
14 years 1 months ago
Topic-Based Audience Metrics for Internet Marketing by Combining Ontologies and Output Page Mining
In Internet marketing, Web audience analysis is essential to understanding the visitors’ needs. However, the existing analysis tools fail to deliver summarized and conceptual me...
Jean-Pierre Norguet, Esteban Zimányi
SIGIR
2005
ACM
14 years 1 months ago
Indexing emails and email threads for retrieval
Electronic mail poses a number of unusual challenges for the design of information retrieval systems and test collections, including informal expression, conversational structure,...
Yejun Wu, Douglas W. Oard
ICDAR
1995
IEEE
13 years 11 months ago
Efficient analysis of complex diagrams using constraint-based parsing
This paper describes substantial advances in the analysis (parsing) of diagrams using constraint grammars. The addition of set types to the grammar and spatial indexing of the dat...
Robert P. Futrelle, Nikos Nikolakis
LREC
2008
139views Education» more  LREC 2008»
13 years 9 months ago
Words in Contexts: Digital Editions of Literary Journals in the "AAC - Austrian Academy Corpus"
In this paper two highly innovative digital editions will be presented. For the creation and the implementation of these editions the latest developments within corpus research ha...
Hanno Biber, Evelyn Breiteneder, Karlheinz Mö...