Sciweavers

IADIS
2008
13 years 9 months ago
Towards an Error-Free Stemming
S Stemming is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and natural language processing. The common goal of stemming ...
Eiman Tamah Al-Shammari
FSMNLP
2008
Springer
13 years 9 months ago
Applying Finite State Morphology to Conversion Between Roman and Perso-Arabic Writing Systems
This paper presents a method for converting back and forth between the Perso-Arabic and a Romanized writing systems for Persian. Given a word in one writing system, we use finite ...
Jalal Maleki, Maziar Yaesoubi, Lars Ahrenberg
FSMNLP
2008
Springer
13 years 9 months ago
Learning with Weighted Transducers
Weighted finite-state transducers have been used successfully in a variety of natural language processing applications, including speech recognition, speech synthesis, and machine ...
Corinna Cortes, Mehryar Mohri
FSMNLP
2008
Springer
13 years 9 months ago
Finite State Models for the Generation of Large Corpora of Natural Language Texts
Domenico Cantone, Salvatore Cristofaro, Simone Far...
FSMNLP
2008
Springer
13 years 9 months ago
CLARIN and Free Open Source Finite-State Tools
Kimmo Koskenniemi, Anssi Yli-Jyrä
ESANN
2008
13 years 9 months ago
Factored sequence kernels
In this paper we propose an extension of sequence kernels to the case where the symbols that define the sequences have multiple representations. This configuration occurs in natura...
Pierre Mahé, Nicola Cancedda
EMNLP
2008
13 years 9 months ago
A Dependency-based Word Subsequence Kernel
This paper introduces a new kernel which computes similarity between two natural language sentences as the number of paths shared by their dependency trees. The paper gives a very...
Rohit J. Kate
EMNLP
2008
13 years 9 months ago
N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation
In domains with insufficient matched training data, language models are often constructed by interpolating component models trained from partially matched corpora. Since the ngram...
Bo-June Paul Hsu, James R. Glass