The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

Top Cited Papers

1 November 2013

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

The University of Edinburgh has started the development of a new speech database, the Voice Bank corpus, specifically designed for the creation of personalised synthetic voices for individuals with speech disorders. This corpus already constitutes the largest corpora of British English currently in existence, with more than 300 hours of recordings from approximately 500 healthy speakers. New recordings are continuously being made in order to get the best coverage of the different combinations of regional accents, social classes, age and gender across Britain. This paper describes the motivation and the processes involved in the design and recording of this corpus as well as some analysis of its content. The paper concludes with our future plans to further extend this corpus and to overcome its current limitations.

Keywords

This publication has 8 references indexed in Scilit:

Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders
Published by International Speech Communication Association ,2012
Analysis of speaker clustering strategies for HMM-based speech synthesis
Published by International Speech Communication Association ,2012
On generating combilex pronunciations via morphological analysis
Published by International Speech Communication Association ,2010
Thousands of Voices for HMM-Based Speech Synthesis–Analysis and Application of TTS Systems Built on Various ASR Corpora
IEEE Transactions on Audio, Speech, and Language Processing, 2010
Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm
IEEE Transactions on Audio, Speech, and Language Processing, 2009
WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Good‐turing frequency estimation without tears*
Journal of Quantitative Linguistics, 1995
A new readability yardstick.
Journal of Applied Psychology, 1948

Cited by 145 articles