Public Perceptions Towards Synthetic Voice Technology

1 September 2021

journal article
research article
Published by SAGE Publications in Proceedings of the Human Factors and Ergonomics Society Annual Meeting

Vol. 65 (1), 1448-1452
https://doi.org/10.1177/1071181321651128

Abstract

Text-to-Speech (TTS) technologies have provided ways to produce acoustic approximations of human voices. However, recent advancements in machine learning (i.e., neural network TTS) have helped move beyond coarse mimicry and towards more natural-sounding speech. With only a small collection of recorded utterances, it is now possible to generate wholly synthetic voices indistinguishable from those of human speakers. While these new approaches to speech synthesis can help facilitate more seamless experiences with artificial agents, they also lower the barrier to entry for those seeking to perpetrate deception. As such, in the development of these technologies, it is important to anticipate potential harms and devise strategies to help mitigate against misuse. This paper presents findings from a 360-person survey that assessed public perceptions of synthetic voices, with a particular focus on how voice type and social scenarios impact ratings of trust. Findings have implications for the responsible deployment of synthetic speech technologies.

Keywords

This publication has 3 references indexed in Scilit:

Impact of auditory sense on trust and brand affect through auditory social interaction and control
Journal of Retailing and Consumer Services, 2020
The spread of true and false news online
Science, 2018
"Like Having a Really Bad PA"
Published by Association for Computing Machinery (ACM) ,2016

Cited by 2 articles