Sciweavers

139 search results - page 16 / 28
» Information-Theoretic Segmentation of Natural Language
Sort
View
NLPRS
2001
Springer
15 years 9 months ago
A Maximum Entropy Tagger with Unsupervised Hidden Markov Models
We describe a new tagging model where the states of a hidden Markov model (HMM) estimated by unsupervised learning are incorporated as the features in a maximum entropy model. Our...
Jun'ichi Kazama, Yusuke Miyao, Jun-ichi Tsujii
NLPRS
2001
Springer
15 years 9 months ago
Automatic Corpus-Based Extraction of Chinese Legal Terms
This paper reports on a study involving the automatic extraction of Chinese legal terms. We used a word segmented corpus of Chinese court judgments to extract salient legal expres...
Oi Yee Kwong, Benjamin K. Tsou
131
Voted
EACL
2006
ACL Anthology
15 years 6 months ago
Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis
Probabilistic Latent Semantic Analysis (PLSA) models have been shown to provide a better model for capturing polysemy and synonymy than Latent Semantic Analysis (LSA). However, th...
Ayman Farahat, Francine Chen
EACL
1993
ACL Anthology
15 years 5 months ago
A Probabilistic Context-free Grammar for Disambiguation in Morphological Parsing
One of the major problems one is faced with when decomposing words into their constituent parts is ambiguity: the generation of multiple analyses for one input word, many of which...
Josée S. Heemskerk
165
Voted
SIGIR
2003
ACM
15 years 9 months ago
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
This paper presents a novel domain-independent text segmentation method, which identifies the boundaries of topic changes in long text documents and/or text streams. The method c...
Xiang Ji, Hongyuan Zha