|
|
|
Angelo Farina (1), Lamberto Tronchin (2) |
|
|
|
|
|
(1) Industrial Engineering Dept., University of
Parma, Via delle Scienze 181/A |
|
43100 Parma, ITALY – HTTP://pcfarina.eng.unipr.it |
|
|
|
(2) DIENCA-CIARM, University of Bologna, Via
Risorgimento 42 |
|
40136 Bologna, Italy – HTTP://ciarm.ing.unibo.it |
|
|
|
|
|
|
This paper describes the processing required for
obtaining a realistic audible sound reproduction from the results of a
geometrical room acoustics program |
|
This overall process is usually known as
“Auralization”, and traditionally is performed through the binaural
technology (headphone reproduction) |
|
Here the process is generalized to many more
reproduction systems: Mono, Binaural, Stereo Dipole, Ambisonics and
Ambiophonics. |
|
The last three systems are loudspeaker-based |
|
|
|
|
In all these 5 cases, anyway, the auralization
is obtained by means of real-time convolution of dry signals with properly
derived impulse responses |
|
Convolution is now possible in real time, with
many simultaneous channels, directly on a low cost PC without any added
hardware, thanks to available free sofware, which outperforms traditional
DSP-based convolvers |
|
Consequently, this paper focuses on the
derivation of the proper sets of impulse responses for each reproduction
method, starting from the results of the geometrical room acoustic program. |
|
|
|
|
|
The multichannel convolution can be done for
free on a low cost PC nowadays. Two solutions are currently available: |
|
|
|
|
This piece of software has to read the result
files produced by the room acoustics simulation program (Ramsete 2), and
process them for deriving a multichannel set of impulse responses,
corresponding to those which could be recorded experimentally in a real
hall employing a multichannel microphone of the chosen kind. |
|
Actually the Audio Converter program developed
for the release version of Ramsete 2 supports these microphonic standards:
Coincident spherical harmonics (mono, 1st and 2nd order Ambisonics),
Binaural (Kemar), Stereo-Dipole (Kemar), Dual Stereo Dipole (Kemar),
Ambiophone (Pinnaless sphere dummy head) |
|
|
|
|
Ramsete can combine the results of several sound
sources emitting the same signal, with optional delay and equalization
(multi-source sound reinforcement systems): |
|
|
|
|
In each receiver point, Ramsete computes an
energetic impulse response in ten octave bands for each sound source (or
combination of multiple sources): |
|
|
|
|
The temporal density of the reflections
initially grows (theoretically with the square of time), but later it
starts reducing and eventually vanishes. |
|
|
|
|
First the discrete early reflections are
processed, taking into account their known arrival direction and exact
timing. |
|
For each discrete arrival, a Dirac’s delta is
generated at the exact arrival time, |
|
then it is convolved with the impulse response
of an octave-band equalizer which imposes the proper SPL value in the 10
octave bands, |
|
and finally it is convolved with the
multichannel impulse response of the selected type of microphone, chosen
depending on the direction of arrival. |
|
|
|
|
A local cartesian reference system is assumed
solidal with the listener head; X’ axis is pointing forward (nose), Y’axis
is pointing on the left ear, and the Z’ axis towards the top of the head. |
|
|
|
|
The Ramsete program saves the coordinates of
three points in the absolute reference system: |
|
- Pprov º (xprov, xprov, xprov)
= provenience point of the ray; |
|
- Orec º (xrec, yrec,
zrec) = receiver origin; |
|
- Ptarg º (xt,
yt, zt) = receiver target point. |
|
|
|
First of all, the position of the
provenience point of the ray is recomputed in the local reference system: |
|
Pprov º> P’prov (x’prov,
y’prov, z’prov) |
|
|
|
|
|
|
Elevation angle j : |
|
|
|
|
|
Azimuth angle q : |
|
|
|
|
0th Order can be listened to through
a single loudspeaker |
|
1st and 2nd orders must be
properly matrixed, for driving an Ambisonics array of loudspeakers
surrounding the listener |
|
|
|
|
Hardware decoders are obsolete nowadays. |
|
Free software is available for 1st
and 2nd order real-time decoding, driving a multichannel sound
board (thanks to Richard Furse) |
|
The decoding structure can be realized also by
means of a multichannel convolution software (BruteFir, Ambiovolver,
FIRreverb) |
|
|
|
|
In Ambisonics, the directive microphones IRs are
simply matter of changing the gain (and perhaps the polarity) of a Dirac’s
delta function |
|
In the binaural case, instead, a complex stereo
IR is required for any direction-of-arrival |
|
A large data base of stereo binaural IRs (HRTF)
is available from MIT-Medialab |
|
A proper routine has been developed for
interpolating the required IR starting from the knowledge of the direction
of arrival of each discrete early reflection. |
|
|
|
|
The complete set of binaural IRs measured on the
Kemar dummy head at MIT-Medialab |
|
|
|
|
Number of measurements at each elevation |
|
|
|
|
|
the weight Pi (i = 1,2,3) relative to
each HRTF is obtained calculating the opposite triangle area and dividing
it for the total initial triangle area; so we have P1+P2+P3
=1. |
|
|
|
|
At the exact arrival time of the received ray,
an averaged binaural IR is added to the global impulse response. |
|
The averaged IR is obtained by a frequency
domain interpolation between the three HRTF complex spectra: |
|
|
|
|
|
The interpolation is actually done in the
frequency domain, on the magnitude and unwrapped phase of the three HRTFs |
|
The octave-band amplitude equalization
corresponding to the room transfer function is also applied in the
frequency domain |
|
FFT and IFFT are used for converting between
time domain and frequency domain |
|
|
|
|
The cross-talk cancellation allows for the
replica of the recorded signals at the ears of the listener |
|
|
|
|
First, a binaural measurement is made in front
of the Stereo Dipole loudspeakers |
|
|
|
|
The regularization parameter, e, has to be
adjusted by trials |
|
|
|
|
Measured impulse responses h |
|
|
|
|
Computed long-FIR inverse filters f |
|
|
|
|
It is a four-channel system, in which a frontal
stereo dipole is employed for reproducing the sound coming from directions
located in the frontal hemispace, and the rear stereo dipole reproduces the
sound coming from the rear hemispace |
|
Usually the rear loudspeaker pair requires a
larger angle than the frontal one |
|
|
|
|
The system is based on two indipendently
designed groups of loudspeakers: |
|
a Stereo Dipole, responsible for the
reproduction only of the direct sound and early reflections coming from the
stage, |
|
a surround periphonic array, driven by real-time
convolution with room impulse responses (it can be quite irregularly
shaped) |
|
|
|
|
For any loudspeaker in the surround array, the
WXYZ channels of a B-format IR can be processed, extracting a single (mono)
response of a virtual microphone pointing along a given versor r (rx,
ry, rz): |
|
|
|
|
|
Auralization can be done nowadays not only by
the traditional binaural method (headphone listening), but also with modern
methods of loudspeaker presentation |
|
Ambiophonics revealed to give significant
advantages over the two surround systems which constitutes it (1st
order Ambisonics and Stereo Dipole). |
|
In cases of limitation in number of reproduction
channels, a Dual-Stereo-Dipole can also be very effective. |
|
When multiple listeners are required in a large
listening area, the preferred method is 2nd order Ambisonics. |
|
The computational power required by these
multichannel reproduction methods can be obtained cheaply by means of a
modern PC running one of the available free convolution softwares |
|
|