This work provides algorithms and heuristics to index text documents by determining important topics in the documents. To index text documents, the work provides algorithms to gene...
Parsing a long sentence is very difficult, since long sentences often have conjunctions which result in ambiguities. If the conjunctive structures existing in a long sentence can ...
We present a nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities. Its performance is significantly higher than previous corpus-based methods for P...
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...
This paper describes a Web-based English-Chinese concordance system, TotalRecall, developed to promote translation reuse and encourage authentic and idiomatic use in second langua...
Jian-Cheng Wu, Kevin C. Yeh, Thomas C. Chuang, Wen...