The HAS (Human Auditory System) has a finite frequency resolution, which basically means that weaker audio signal (maskee) becomes inaudible in the presence of (is masked by) a louder audio signal (masker), when they are close enough [1], in the frequency domain (and obviously in time, i.e., in the same chunk). When this happens, the subband [2] in which the maskee signal is placed can be quantized more severely without perceiving the quantization noise in the maskee subband (see Figure 1).
Find the subband with the highest energy:
Being \({\mathbf \Delta }_m\) the current QSS of the subband \({\mathbf w}_m\), compute the set of optimal1 QSSs as
Implement the algorithm described in Section 2 in a module named
simultaneous_masking.py
to be used in the InterCom.
Mark: 10 points.
[1] M. Bosi and R.E. Goldberd. Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, 2003.
[2] M. Vetterli and J. Kovačević. Wavelets and Subband Coding. Prentice-hall, 1995.
1From a perceptual perspective.