[sdiy] Beat tones, Fourier analysis, and nonlinearities in the ear [was Phase shifts and instantaneous frequency]

Ian Fritz ijfritz at comcast.net
Wed Jul 16 13:40:24 CEST 2008


At 08:20 PM 7/15/2008, Aaron Lanterman wrote:

>Oh, the ear and auditory system is all sorts of nonlinear. I'm not
>arguing that. I just am unsure about the theory that a 100 Hz tone and
>a 201 Hz tone are multiplying in the ear and creating a 101 Hz spur
>that's beating against the 100 Hz tone, creating the 1 Hz that you
>hear.

This isn't something I made up.  It has been studied extensively, starting 
with Helmholtz's work in the 19th century and continuing up to more recent 
studies where probes were surgically inserted into the ear.


>You can directly _see_ the 1 Hz pattern in the waveform

Of *course* you can see the 1 Hz beats. There is an amplitude difference 
when sinusoids  are added together with different relative phases.  That 
doesn't mean there is a physical signal at 1 Hz.  In linear theory the only 
frequencies that can be present are the original tones and their sum and 
difference frequencies.  None of these is at 1 Hz.  It doesn't matter a bit 
what your eye sees.  The 1 Hz detuning is simply a convenient quasi-static 
method to demonstrate the perceived timbre change.  You would hear the same 
timbre change if the signals were at 100 Hz and 200.0001 Hz.  The 
comparison would be difficult because the change would be so gradual.  You 
can also hear the same timbre change if you record a superpositioned 100 Hz 
and 200 Hz signal with two different phases and make an A/B comparison. I 
guess nobody is going to understand this unless I do it for you.


>Consider playing a 439 Hz tone against a 441 Hz tone. Your ear
>perceives a 440 Hz tone modulated by a 1 Hz beat, so you'll hear two
>peaks per second, and if you plot a _short time_ Fourier transform of
>it with a small window, that's what you'll see. You'll see it in the
>spectrogram and you'll hear it with your ears, and you can describe
>that without ever having to multiply cos(439 t) with cos(441 t).

But in this case 2 Hz is the difference frequency of the two signals, so of 
*course* you see it in the spectrum and there is energy at that 
frequency.  But this is a qualitatively different situation.  Once again, 
with 100 Hz and 201 Hz there is *no* low frequency difference frequency.


>I think maybe an important distinction to make is about short-term vs.
>non-short term Fourier analysis.

Why?  We are only considering (quasi) steady tones.  But go ahead and do a 
short-time analysis of the 100 Hz + 201 Hz signal.  There won't be any 
significant change in the spectrum with time.  And if the small change 
there is bothers you, redo everything with 100 Hz and 200.1 Hz or 200.00001 
Hz.  You will hear the same timbre change.


>You could write down an equation for an exponentially damped sinusoid
>x(t). LIstening to it, you'll hear a tone that gradually decreases in
>volume.
>
>You could then write down its Fourier transform, which would look like
>two copies of 1/(a+j w) centered at +/- the frequency of your
>sinusoid. In this representation, time's been collapsed, and you
>imagine the wave is made up of the continuum of frequencies - in fact,
>there'd be frequency content at ALL frequencies. But you don't
>perceive all those frequencies.

Because they are very weak and masked by the main signal.  If the envelope 
is short enough that the various frequency components are strong enough to 
be physically perceived then you *do* hear them.  There is a distinct click 
or pop (as I'm sure everyone knows from using EGs with short A/D times).


>Neither the time-domain or frequency-domain equation fully describe
>what we perceive. For that, a short-time Fourier analysis is best.

Again, the signals are very slowly varying (quasi static). I don't see any 
need for short-time Fourier analysis.

   Ian 




More information about the Synth-diy mailing list