Sciweavers

563 search results - page 30 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
ADC
2010
Springer
214views Database» more  ADC 2010»
13 years 3 months ago
Building a dynamic classifier for large text data collections
Due to the lack of in-built tools to navigate the web, people have to use external solutions to find information. The most popular of these are search engines and web directories....
Pavel Kalinov, Bela Stantic, Abdul Sattar
IJCNLP
2005
Springer
14 years 2 months ago
Chunking Using Conditional Random Fields in Korean Texts
We present a method of chunking in Korean texts using conditional random fields (CRFs), a recently introduced probabilistic model for labeling and segmenting sequence of data. In a...
Yong-Hun Lee, Mi-Young Kim, Jong-Hyeok Lee
CICLING
2007
Springer
14 years 2 months ago
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization
Abstract. Classification in genres and domains is a major field of research for Information Retrieval (scientific and technical watch, datamining, etc.) and the selection of app...
Guillaume Cleuziou, Céline Poudat
IJCNLP
2005
Springer
14 years 2 months ago
Acquiring Synonyms from Monolingual Comparable Texts
This paper presents a method for acquiring synonyms from monolingual comparable text (MCT). MCT denotes a set of monolingual texts whose contents are similar and can be obtained au...
Mitsuo Shimohata, Eiichiro Sumita
COLING
2000
13 years 10 months ago
Improving SMT quality with morpho-syntactic analysis
In the framework of statistical machine translation (SMT), correspondences between the words in the source and the target language are learned from bilingual corpora on the basis ...
Sonja Nießen, Hermann Ney