Listening devices
Traditionally, high quality audio reproduction has been the domain of multi-speaker systems, often accompanied by one or more subwoofers. However, with the rise of online gaming and voice chat, many players have transitioned to headsets, many with integrated microphones.
Headphone audio will be the standard for immersive experiences into the future, as it provides better isolation, privacy, portability, and spatialization. For modern immersive experiences, especially with head tracking and user movement, speaker arrays do not provide an adequate audio experience. See below for an overview of listening devices and the characteristics that make them unique.
Headphones offer several significant advantages over free-field speaker systems for immersive audio:
- Acoustic isolation from the listener’s environment enhances realism and immersion.
- Head tracking is greatly simplified.
- HRTFs are more accurate since they don’t suffer from the “doubling down” of HRTF effects, i.e. sounds being modified from the simulated HRTF, and again by the listener’s actual body geometry.
- Access to controls while wearing an HMD is far more user friendly when those controls are physically attached to the listener.
- Microphones are ideally placed near the mouth, and subject to much less echo/feedback.
Headphones are available in a variety of types with various trade-offs:
As a general rule of thumb, closed back headphones offer the most isolation and bass response. However, the closed construction may lead to discomfort due to heat and weight, and they tend to offer less accurate reproduction due to internal resonance. Also, if placed on or over the ear, they cause the pinnae to impact sound reproduction slightly.
While acoustic isolation can help with immersion, it cuts listeners off from their environment so they may be unable to hear others entering the room, cell phone ringing, doorbell, et cetera. Whether that is a good thing or not is up to the individual.
Open back headphones are generally more accurate and comfortable than closed-back headphones, but they do not isolate listeners from the exterior environment, and broadcast to the surrounding environment as well. These are suitable for quiet areas devoted to a immersive experience, possibly in conjunction with a subwoofer.
As with closed back headphones, when placed on or over the ear, open back headphones allow the pinnae to impact sound reproduction slightly.
Earbuds (such as those that ship with cell phones or portable music players) are cheap, lightweight, and very portable, though they typically lack bass. Some models have surprisingly good frequency response, albeit with a steady roll off of bass frequencies. These are mostly ignored for spatialization. Most earbuds are poor at isolation.
In-ear monitors offer superior isolation from your environment, they are very lightweight, and have excellent frequency response over the entire range. They remove the effects of the listener’s pinnae from sound (unlike on-ear headphones).
Headphones, like all transducers, impart their own characteristics on signals which then add coloration to the sound which is undesirable for HRTF reproduction. The frequency response of headphones is designed to roughly mimic the character of listening to speakers in a room, for example, a harmon curve. This can be compensated by applying the inverse frequency response of the headphones, providing a “flat” frequency response for accurate HRTF reproduction. The Oculus Rift and Meta Quest HRTFs is designed for headphone playback and has a frequency response that is tailored to suit most headphones.
Until recently, the most common way to provide sound immersion was to surround the listener with speakers, such as a Dolby 5.1 or 7.1 speaker configuration. While partially effective for a fixed and narrow sitting position, speaker array systems suffer from the following:
- Imprecise imaging due to panning over large portions of the listening area.
- No elevation cues. Sounds only appear in a 360 degree circle around the listener.
- Assumption of immobile listener; in particular, no head tracking.
- Room effects such as reverberation and reflections impact the reproduced sound.
- Poor isolation means that outside sounds can intrude on the immersive experience.
It is doubtful that multi-speaker configurations will be common or effective for home immersive applications, though they may be viable for dedicated commercial installations.
Bluetooth has become a popular communication method of wireless audio broadcast. Unfortunately, modern Bluetooth implementations often incur significant latency, sometimes as high as 500 milliseconds. As a result, Bluetooth technology is not recommended for audio output.
Get started creating your immersive soundscape
If you’re ready to kick off the technical side of immersive audio design and engineering, check out the following technical documentation: