ALISA: An automatic lightly supervised speech segmentation and alignment tool

8 years 11 months ago

Download www.cstr.ed.ac.uk

This paper describes the ALISA tool, which implements a lightly supervised method for sentence-level alignment of speech with imperfect transcripts. Its intended use is to enable the creation of new speech corpora from a multitude of resources in a language-independent fashion, thus avoiding the need to record or transcribe speech data. The method is designed so that it requires minimum user intervention and expert knowledge, and it is able to align data in languages which employ alphabetic scripts. It comprises a GMM-based voice activity detector and a highly constrained grapheme-based speech aligner. The method is evaluated objectively against a gold standard segmentation and transcription, as well as subjectively through building and testing speech synthesis systems from the retrieved data. Results show that on average, 70% of the original data is correctly aligned, with a word error rate of less than 0.5%. In one case, subjective listening tests show a statistically signiﬁcant p...

Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi,

Real-time Traffic

Automated Reasoning | CSL 2016 |

claim paper

Post Info
More Details (n/a)

Added	01 Apr 2016
Updated	01 Apr 2016
Type	Journal
Year	2016
Where	CSL
Authors	Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell 0001, Oliver Watts, Robert A. J. Clark, Simon King

Comments (0)

Sciweavers

ALISA: An automatic lightly supervised speech segmentation and alignment tool

Automated Reasoning | CSL 2016 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers