Sciweavers

246 search results - page 41 / 50
» Distributed IR for Digital Libraries
Sort
View
ICPR
2010
IEEE
13 years 7 months ago
Unsupervised Learning from Linked Documents
Documents in many corpora, such as digital libraries and webpages, contain both content and link information. In a traditional topic model which plays an important role in the uns...
Zhen Guo, Shenghuo Zhu, Yun Chi, Zhongfei Zhang, Y...
TKDE
2010
224views more  TKDE 2010»
13 years 7 months ago
Probabilistic Topic Models for Learning Terminological Ontologies
—Probabilistic topic models were originally developed and utilised for document modeling and topic extraction in Information Retrieval. In this paper we describe a new approach f...
Wang Wei, Payam M. Barnaghi, Andrzej Bargiela
TOIS
2010
128views more  TOIS 2010»
13 years 7 months ago
Learning author-topic models from text corpora
We propose a new unsupervised learning technique for extracting information about authors and topics from large text collections. We model documents as if they were generated by a...
Michal Rosen-Zvi, Chaitanya Chemudugunta, Thomas L...
ICDM
2009
IEEE
109views Data Mining» more  ICDM 2009»
14 years 3 months ago
Knowledge Discovery from Citation Networks
—Knowledge discovery from scientific articles has received increasing attentions recently since huge repositories are made available by the development of the Internet and digit...
Zhen Guo, Zhongfei Zhang, Shenghuo Zhu, Yun Chi, Y...
SIGIR
2004
ACM
14 years 2 months ago
Length normalization in XML retrieval
XML retrieval is a departure from standard document retrieval in which each individual XML element, ranging from italicized words or phrases to full blown articles, is a potential...
Jaap Kamps, Maarten de Rijke, Börkur Sigurbj&...