Using Text-to-Speech to Prototype Game Dialog

12 November 2018

journal article
research article
Published by Association for Computing Machinery (ACM) in Computers in Entertainment

Vol. 16 (4), 1-16
https://doi.org/10.1145/3276321

Abstract

Voice acting is common in computer games in many genres. The recording and processing of voice acting is a time-consuming process that involves, for instance, voice actors, directors, audio engineers, and game writers. Changes to the script of a game after the voice acting has been recorded are expensive. At the same time, playtests of games without voice acting may give different results than testing where it is present. This creates a situation where improvements identified from play testing are either ignored or leads to extensive re-recording of voice acting. This article presents a design science research project where text-to-speech (TTS) synthesis is used as a substitute for recorded voice acting in the early stages of game production. We propose a set of design principles that have been evaluated in a sharp game production. Our results indicate several benefits of using TTS as a prototyping tool: It can be a source of inspiration for game writers, it gives good estimations on timing and pacing of the game, and it allows for early tests of how the dialog will be perceived by players. The quality and characteristics of the voices provided by the TTS system play an important role in this process. The rapid development in the speech technology field opens many future possibilities.

Keywords

This publication has 26 references indexed in Scilit:

Software Development Processes for Games: A Systematic Literature Review
Communications in Computer and Information Science, 2014
Is Requirements Engineering Useless in Game Development?
Published by Springer Science and Business Media LLC ,2014
Controlling the uncontrollable: ‘Agile’ teams and illusions of autonomy in creative work
Work, Employment & Society, 2013
How Are Agile Methods and Practices Deployed in Video Game Development? A Survey into Finnish Game Studios
Lecture Notes in Business Information Processing, 2013
Games are not convergence: The lost promise of digital production and convergence
Convergence: The International Journal of Research into New Media Technologies, 2011
Statistical parametric speech synthesis
Speech Communication, 2009
Is text-to-speech synthesis ready for use in computer-assisted language learning?
Speech Communication, 2009
A Design Science Research Methodology for Information Systems Research
Journal of Management Information Systems, 2007
Idea Creation, Constructivism and Evolution as Key Characteristics in the Videogame Artifact Design Process
European Management Journal, 2006
Review of text-to-speech conversion for English
The Journal of the Acoustical Society of America, 1987

Cited by 5 articles