Search Sciweavers | Sciweavers

213 search results - page 5 / 43

» Combining Statistics and Semantics for Word and Document Clu...

click to vote

NLDB
2007
Springer

94views Natural Language Processing» more NLDB 2007»

Selecting Labels for News Document Clusters

14 years 1 months ago

Download knoesis.wright.edu

This work deals with determination of meaningful and terse cluster labels for News document clusters. We analyze a number of alternatives for selecting headlines and/or sentences o...

Krishnaprasad Thirunarayan, Trivikram Immaneni, Ma...

claim paper

Read More »

click to vote

IJCAI
2007

142views Artificial Intelligence» more IJCAI 2007»

Multi-Document Summarization by Maximizing Informative Content-Words

13 years 9 months ago

Download research.microsoft.com

We show that a simple procedure based on maximizing the number of informative content-words can produce some of the best reported results for multi-document summarization. We ﬁr...

Wen-tau Yih, Joshua Goodman, Lucy Vanderwende, His...

claim paper

Read More »

click to vote

CIS
2005
Springer

186views Applied Computing» more CIS 2005»

Concept Chain Based Text Clustering

14 years 1 months ago

Download dm.thss.tsinghua.edu.cn

Diﬀerent from familiar clustering objects, text documents have sparse data spaces. A common way of representing a document is as a bag of its component words, but the semantic re...

Shaoxu Song, Jian Zhang, Chunping Li

claim paper

Read More »

click to vote

SIGIR
1999
ACM

153views Information Technology» more SIGIR 1999»

Probabilistic Latent Semantic Indexing

14 years 23 hour ago

Download www.cs.brown.edu

Probabilistic Latent Semantic Indexing is a novel approach to automated document indexing which is based on a statistical latent class model for factor analysis of count data. Fit...

Thomas Hofmann

claim paper

Read More »

click to vote

SIGIR
2008
ACM

101views Information Technology» more SIGIR 2008»

Knowledge transformation from word space to document space

13 years 7 months ago

Download ranger.uta.edu

In most IR clustering problems, we directly cluster the documents, working in the document space, using cosine similarity between documents as the similarity measure. In many real...

Tao Li, Chris H. Q. Ding, Yi Zhang 0005, Bo Shao

claim paper

Read More »

« Prev « First page 5 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers