Sciweavers

59 search results - page 8 / 12
» An extension of PLSA for document clustering
Sort
View
CIKM
2006
Springer
13 years 11 months ago
Multi-task text segmentation and alignment based on weighted mutual information
Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
Bingjun Sun, Ding Zhou, Hongyuan Zha, John Yen
CORR
2007
Springer
117views Education» more  CORR 2007»
13 years 7 months ago
Dirac Notation, Fock Space and Riemann Metric Tensor in Information Retrieval Models
Using Dirac Notation as a powerful tool, we investigate the three classical Information Retrieval (IR) models and some their extensions. We show that almost all such models can be...
Xing M. Wang
CIKM
2001
Springer
14 years 10 days ago
PowerDB-IR - Information Retrieval on Top of a Database Cluster
Our current concern is a scalable infrastructure for information retrieval (IR) with up-to-date retrieval results in the presence of frequent, continuous updates. Timely processin...
Torsten Grabs, Klemens Böhm, Hans-Jörg S...
WWW
2009
ACM
14 years 15 days ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...
ICTAI
2007
IEEE
14 years 2 months ago
Dragon Toolkit: Incorporating Auto-Learned Semantic Knowledge into Large-Scale Text Retrieval and Mining
The majority of text retrieval and mining techniques are still based on exact feature (e.g. words) matching and unable to incorporate text semantics. Many researchers believe that...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu