Sciweavers

160 search results - page 7 / 32
» Tagging Spoken Language Using Written Language Statistics
Sort
View
CLIN
2000
13 years 8 months ago
Syntactic Annotation for the Spoken Dutch Corpus Project (CGN)
Of the ten million words of contemporary standard Dutch in the Spoken Dutch Corpus (Corpus Gesproken Nederlands, CGN), a selection of one million words of natural spoken language ...
Heleen Hoekstra, Michael Moortgat, Ineke Schuurman...
ICASSP
2010
IEEE
13 years 6 months ago
Spoken language translation from parallel speech audio: Simultaneous interpretation as SLT training data
In recent work, we proposed an alternative to parallel text as translation model (TM) training data: audio recordings of parallel speech (pSp), as it occurs in any communication s...
Matthias Paulik, Alex Waibel
CORR
1998
Springer
153views Education» more  CORR 1998»
13 years 7 months ago
Identifying Discourse Markers in Spoken Dialog
In this paper, we present a method for identifying discourse marker usage in spontaneous speech based on machine learning. Discourse markers are denoted by special POS tags, and t...
Peter A. Heeman, Donna K. Byron, James F. Allen
COLING
2010
13 years 2 months ago
Urdu and Hindi: Translation and sharing of linguistic resources
Hindi and Urdu share a common phonology, morphology and grammar but are written in different scripts. In addition, the vocabularies have also diverged significantly especially in ...
Karthik Visweswariah, Vijil Chenthamarakshan, Nand...
NAACL
2007
13 years 9 months ago
Tagging Icelandic Text using a Linguistic and a Statistical Tagger
We describe our linguistic rule-based tagger IceTagger, and compare its tagging accuracy to the TnT tagger, a state-of-theart statistical tagger, when tagging Icelandic, a morphol...
Hrafn Loftsson