Sciweavers

106 search results - page 16 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
VDA
2010
185views Visualization» more  VDA 2010»
13 years 10 months ago
Visualizing multidimensional data through granularity-dependent spatialization
Spatialization is a special kind of visualization that projects multidimensional data into low-dimensional representational spaces by making use of spatial metaphors. Spatializati...
Sofia Kontaxaki, Eleni Tomai, Margarita Kokla, Mar...
WWW
2010
ACM
14 years 2 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
SDM
2011
SIAM
370views Data Mining» more  SDM 2011»
12 years 10 months ago
Sparse Latent Semantic Analysis
Latent semantic analysis (LSA), as one of the most popular unsupervised dimension reduction tools, has a wide range of applications in text mining and information retrieval. The k...
Xi Chen, Yanjun Qi, Bing Bai, Qihang Lin, Jaime G....
CIKM
2000
Springer
13 years 12 months ago
Dimensionality Reduction and Similarity Computation by Inner Product Approximations
—As databases increasingly integrate different types of information such as multimedia, spatial, time-series, and scientific data, it becomes necessary to support efficient retri...
Ömer Egecioglu, Hakan Ferhatosmanoglu
EMNLP
2008
13 years 9 months ago
Acquiring Domain-Specific Dialog Information from Task-Oriented Human-Human Interaction through an Unsupervised Learning
We describe an approach for acquiring the domain-specific dialog knowledge required to configure a task-oriented dialog system that uses human-human interaction data. The key aspe...
Ananlada Chotimongkol, Alexander I. Rudnicky