Immersive audio, capture, transport, and rendering: a review

Open Access

16 September 2021

journal article
review article
Published by Now Publishers in APSIPA Transactions on Signal and Information Processing

Vol. 10 (1)
https://doi.org/10.1017/atsip.2021.12

Abstract

Immersive audio has received significant attention in the past decade. The emergence of a few groundbreaking systems and events (Dolby Atmos, MPEG-H, VR/AR, AI) contributes to reshaping the landscape of this field, accelerating the mass market adoption of immersive audio. This review serves as a quick recap of some immersive audio background, end to end workflow, covering audio capture, compression, and rendering. The technical aspects of object audio and ambisonic will be explored, as well as other related topics such as binauralization, virtual surround, and upmix. Industry trends and applications are also discussed where user experience ultimately decides the future direction of the immersive audio technologies.

Keywords

This publication has 10 references indexed in Scilit:

Spleeter: a fast and efficient music source separation tool with pre-trained models
The Journal of Open Source Software, 2020
Fundamentals of a Parametric Method for Virtual Navigation Within an Array of Ambisonics Microphones
Journal of the Audio Engineering Society, 2020
Spatial Coding of Complex Object-Based Program Material
Journal of the Audio Engineering Society, 2019
3D Tune-In Toolkit: An open-source library for real-time binaural spatialisation
PLOS ONE, 2019
Capturing 360° Audio Using an Equal Segment Microphone Array (ESMA)
Journal of the Audio Engineering Society, 2019
A Fifty-Node Lebedev Grid And Its Applications To Ambisonics
Journal of the Audio Engineering Society, 2016
MPEG-H Audio—The New Standard for Universal Spatial/3D Audio Coding
Journal of the Audio Engineering Society, 2015
Energy-Preserving Ambisonic Decoding
Acta Acustica united with Acustica, 2012
Parametric Coding of Stereo Audio
EURASIP Journal on Advances in Signal Processing, 2005
High-fidelity multichannel audio coding with karhunen-loeve transform
IEEE Transactions on Speech and Audio Processing, 2003

Cited by 4 articles