Immersive audio, capture, transport, and rendering: a review
Open Access
- 16 September 2021
- journal article
- review article
- Published by Now Publishers in APSIPA Transactions on Signal and Information Processing
- Vol. 10 (1)
- https://doi.org/10.1017/atsip.2021.12
Abstract
Immersive audio has received significant attention in the past decade. The emergence of a few groundbreaking systems and events (Dolby Atmos, MPEG-H, VR/AR, AI) contributes to reshaping the landscape of this field, accelerating the mass market adoption of immersive audio. This review serves as a quick recap of some immersive audio background, end to end workflow, covering audio capture, compression, and rendering. The technical aspects of object audio and ambisonic will be explored, as well as other related topics such as binauralization, virtual surround, and upmix. Industry trends and applications are also discussed where user experience ultimately decides the future direction of the immersive audio technologies.Keywords
This publication has 10 references indexed in Scilit:
- Spleeter: a fast and efficient music source separation tool with pre-trained modelsThe Journal of Open Source Software, 2020
- Fundamentals of a Parametric Method for Virtual Navigation Within an Array of Ambisonics MicrophonesJournal of the Audio Engineering Society, 2020
- Spatial Coding of Complex Object-Based Program MaterialJournal of the Audio Engineering Society, 2019
- 3D Tune-In Toolkit: An open-source library for real-time binaural spatialisationPLOS ONE, 2019
- Capturing 360° Audio Using an Equal Segment Microphone Array (ESMA)Journal of the Audio Engineering Society, 2019
- A Fifty-Node Lebedev Grid And Its Applications To AmbisonicsJournal of the Audio Engineering Society, 2016
- MPEG-H Audio—The New Standard for Universal Spatial/3D Audio CodingJournal of the Audio Engineering Society, 2015
- Energy-Preserving Ambisonic DecodingActa Acustica united with Acustica, 2012
- Parametric Coding of Stereo AudioEURASIP Journal on Advances in Signal Processing, 2005
- High-fidelity multichannel audio coding with karhunen-loeve transformIEEE Transactions on Speech and Audio Processing, 2003