The voice bank corpus: Design, collection and data analysis of a large regional accent speech database
Top Cited Papers
- 1 November 2013
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The University of Edinburgh has started the development of a new speech database, the Voice Bank corpus, specifically designed for the creation of personalised synthetic voices for individuals with speech disorders. This corpus already constitutes the largest corpora of British English currently in existence, with more than 300 hours of recordings from approximately 500 healthy speakers. New recordings are continuously being made in order to get the best coverage of the different combinations of regional accents, social classes, age and gender across Britain. This paper describes the motivation and the processes involved in the design and recording of this corpus as well as some analysis of its content. The paper concludes with our future plans to further extend this corpus and to overcome its current limitations.Keywords
This publication has 8 references indexed in Scilit:
- Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disordersPublished by International Speech Communication Association ,2012
- Analysis of speaker clustering strategies for HMM-based speech synthesisPublished by International Speech Communication Association ,2012
- On generating combilex pronunciations via morphological analysisPublished by International Speech Communication Association ,2010
- Thousands of Voices for HMM-Based Speech Synthesis–Analysis and Application of TTS Systems Built on Various ASR CorporaIEEE Transactions on Audio, Speech, and Language Processing, 2010
- Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation AlgorithmIEEE Transactions on Audio, Speech, and Language Processing, 2009
- WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Good‐turing frequency estimation without tears*Journal of Quantitative Linguistics, 1995
- A new readability yardstick.Journal of Applied Psychology, 1948