DIKUL - logo
E-resources
Peer reviewed Open access
  • MPEG Standards for Compress...
    Quackenbush, Schuyler R.; Herre, Jurgen

    Proceedings of the IEEE, 09/2021, Volume: 109, Issue: 9
    Journal Article

    The term "immersive audio" is frequently used to describe an audio experience that provides the listener the sensation of being fully immersed or "present" in a sound scene. This can be achieved via different presentation modes, such as surround sound (several loudspeakers horizontally arranged around the listener), 3D audio (with loudspeakers at, above, and below listener ear level), and binaural audio to headphones. This article provides an overview of two recent standards that support the bitrate-efficient carriage of high-quality immersive sound. The first is MPEG-H 3D audio, which is a versatile standard that supports multiple immersive sound signal formats (channels, objects, and higher order ambisonics) and is now being adopted in broadcast and streaming applications. The second is MPEG-I immersive audio, an extension of 3D audio, currently under development, which is targeted for virtual and augmented reality applications. This will support rendering of fully user-interactive immersive sound for three degrees of user movement three degrees of freedom (3DoF), i.e., yaw, pitch, and roll head movement, and for six degrees of user movement six degrees of freedom (6DoF), i.e., 3DoF plus translational <inline-formula> <tex-math notation="LaTeX">{x} </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">{y} </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">{z} </tex-math></inline-formula> user position movements.