Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

146

IJCNLP
2004
Springer

109views Natural Language Processing» more IJCNLP 2004»

Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer

15 years 12 months ago

Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer

Download www.ls.info.hiroshima-cu.ac.jp

We present a method to automatically detect sentence boundaries(SBs) in Japanese speech transcriptions. Our method uses a Japanese morphological analyzer that is based on a cost calculation and selects as the best result the one with the minimum cost. The idea behind using a morphological analyzer to identify candidates for SBs is that the analyzer outputs lower costs for better sequences of morphemes. After the candidate SBs have been identiﬁed, the unsuitable candidates are deleted by using lexical information acquired from the training corpus. Our method had a 77.24% precision, 88.00% recall, and 0.8277 F-Measure, for a corpus consisting of lecture speech transcriptions in which the SBs are not given.

Sachie Tajima, Hidetsugu Nanba, Manabu Okumura

Real-time Traffic

IJCNLP 2004 | Japanese Morphological Analyzer | Morphological Analyzer | Speech Transcriptions |

claim paper

Related Content

» Construction of linefeed insertion rules for lecture transcript and their evaluation

» A Grammar And A Parser For Spontaneous Speech

» Experiments on Sentence Boundary Detection

» Online Acquisition of Japanese Unknown Morphemes using Morphological Constraints

» A study in machine learning from imbalanced data for sentence boundary detection in speech

» Detection of Quotations and Inserted Clauses and Its Application to Dependency Structure A...

» Online Japanese Unknown Morpheme Detection using Orthographic Variation

» A Syllable Based Word Recognition Model for Korean Noun Extraction

» Automatic Discovery of Salient Segments in Imperfect Speech Transcripts

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	IJCNLP
Authors	Sachie Tajima, Hidetsugu Nanba, Manabu Okumura

Comments (0)