Sciweavers

197 search results - page 22 / 40
» Learning Derived Words from Medical Corpora
Sort
View
ICML
2006
IEEE
14 years 8 months ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum
SIGIR
2009
ACM
14 years 2 months ago
Extracting structured information from user queries with semi-supervised conditional random fields
When search is against structured documents, it is beneficial to extract information from user queries in a format that is consistent with the backend data structure. As one step...
Xiao Li, Ye-Yi Wang, Alex Acero
ICADL
2005
Springer
112views Education» more  ICADL 2005»
14 years 1 months ago
A Method for Creating a High Quality Collection of Researchers' Homepages from the Web
This paper proposes a method for creating a high quality collection of researchers’ homepages. The proposed method consists of three phases: rough filtering of the possible web p...
Yuxin Wang, Keizo Oyama
LREC
2010
123views Education» more  LREC 2010»
13 years 9 months ago
Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages
A considerable amount of work has been put into development of stemmers and morphological analysers. The majority of these approaches use hand-crafted suffix-replacement rules but...
Niraj Aswani, Robert J. Gaizauskas
TKDE
2010
284views more  TKDE 2010»
13 years 6 months ago
Unsupervised Semantic Similarity Computation between Terms Using Web Documents
Abstract— In this work, web-based metrics for semantic similarity computation between words or terms are presented and compared with the state-of-the-art. Starting from the funda...
Elias Iosif, Alexandros Potamianos