Sciweavers

2929 search results - page 15 / 586
» Models of English Text
Sort
View
EMNLP
2008
13 years 8 months ago
One-Class Clustering in the Text Domain
Having seen a news title "Alba denies wedding reports", how do we infer that it is primarily about Jessica Alba, rather than about weddings or reports? We probably reali...
Ron Bekkerman, Koby Crammer
LREC
2008
108views Education» more  LREC 2008»
13 years 8 months ago
A Lightweight and Efficient Tool for Cleaning Web Pages
Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...
Stefan Evert
ANLP
1997
116views more  ANLP 1997»
13 years 8 months ago
A Maximum Entropy Approach to Identifying Sentence Boundaries
We present a trainable model for identifying sentence boundaries in raw text. Given a corpus annotated with sentence boundaries, our model learns to classify each occurrence of., ...
Jeffrey C. Reynar, Adwait Ratnaparkhi
COLING
2008
13 years 8 months ago
Modeling Chinese Documents with Topical Word-Character Models
As Chinese text is written without word boundaries, effectively recognizing Chinese words is like recognizing collocations in English, substituting characters for words and words ...
Wei Hu, Nobuyuki Shimizu, Hiroshi Nakagawa, Huanye...
AAAI
2008
13 years 9 months ago
Text Beautifier: An Affective-Text Tool to Tailor Written Text
We have spelling and grammar checking tools available on today's word processors. But what they are missing is a tool that can recommend several possibilities of a given writ...
Fahim Kawsar, Shaikh Mostafa Al Masum, Mitsuru Ish...