Pure intonation vs. 12 equal temperament musical intervals

Given a musical sound X, its pitch is characterized by the frequency f(X) of its sound wave. For example, if we regard the pitch as a musical note, then the pitch of frequency 2\times\mathrm{freq}(X) corresponds to what is called an octave higher above X; so we have had this feeling of consonance in between two frequencies differing by a multiple of 2. Now a thesis here is that our minds’ ears are musically interested in the ratios of frequencies, and this is the basis of the usual octave division in western music: if you are going to divide pitches between the note X and its higher octave into 12 notes, then it might be better to put this notes in a geometric progression, i.e. the k-th note has frequency \mathrm{freq}(X) \times 2^{\frac{k-1}{12}}. In this manner, the ratio between any two consecutive notes is equal to a universal amount, 2^{1/12}. Pianos, classical guitars, saxophones, and other instruments all obey this tuning, known as 12 equal temperament (12-ET) tuning.

Yet, in the meantime you may have heard of the consonance quality of the perfect fifth interval, e.g. when you play the notes G and C of an octave, that the note G in an octave has the frequency \frac{3}{2}\times \mathrm{freq}(C) for the note C in the same octave, and it’s said that it is because of this simplest ratio of \frac{3}{2} in between them that the two notes sound so resonant to our ears (I checked with Google Gemini now, it said it too). But is the ratio actually equal to \frac{3}{2}? Doing the math from the above paragraph, we have that the perfect fifth interval G is 8 divisions (called semitones officially) higher than C, we have k=8 in the above formula and

\mathrm{freq}(G)=\mathrm{freq}(C)\times 2^{7/12} \simeq \mathrm{freq}(C) \times 1.498

so the actual ratio is not 1.5! What’s happening here?

Well first off, the numbers 1.498 is so close to 1.5 so that \mathrm{freq}(C)\times 1.498 and \mathrm{freq}(C) \times 1.5 are sonically indistinguishable to our ears (or you are blessed with divine ears), hence it might be the case that actually 1.498 is the number of magical quality providing the consonance rather than 1.5. We still haven’t ruled out the saying “simple ratios (like \frac{3}{2}) of notes \implies consonance between the notes”, but we have to further investigate it.

Simple ratios indeed make physical sense through the harmonic series and overtones: If we play a note, say C, on the piano, it generates various sine waves (also called harmonics) of frequencies \mathrm{freq}(C) \times n for all n \in \mathbb{Z}, each of which with some loudness (the loudness of these auxiliary waves depends on the shape of the original wave, determining the timbre of the instrument). So we are hearing all of the frequencies \mathrm{freq}(C), 2\mathrm{freq}(C), 3\mathrm{freq}(C), 4\mathrm{freq}(C),\dots to some extent, and in most natural sounds, the loudness of the present frequencies tend to decrease as we go further in this sequence. Now notice that the auxiliary wave of frequency 3 \mathrm{freq}(C) is \frac{3}{2} away from the wave of frequency 2 \mathrm{freq}(C), i.e. we are hearing an interval of perfect fifth just by playing one note on the piano. Similarly, our mind is hearing a ratio of \frac{4}{3} through the third and fourth frequencies. And a ratio of \frac{5}{3}, etc. So if we agree that a note on piano sounds pleasant, then these observations might lead us to consider 1.5 as the correct ratio for the consonance of the perfect fifth, rather than 1.498, and further to define a simple ratio as “a ratio whose numerator and denominator appear soon in the sequence 1,2,3,4\dots“. Pure intonation, also called Just intonation (JI), is a tuning system based on this philosophy. Here’s a table of the justly tuned C major scale.

NoteCDEFGABC
Ratio from C1/19/85/44/33/25/315/82/1
Cents020438649870288410881200

where cent is just computing the power of 2 in the geometric series approach by \mathrm{freq}(X) = \mathrm{freq}(C) \times 2^{\frac{\mathrm{cent}(X)}{1200}}. So now we can compare the last row and see the difference between the 12 ET system where the cent values for the notes in the C major scale are respectively 0, 200,400,500,700,900,1100 and 1200. Here’s Claire de Lune played in just intonation.

In fact, harmonic-series-based approaches are among the earliest ones used for tuning, dating back to Pythagoras at the latest (see Pythagorean tuning which is similar in idea to just intonation). The dots on the soundboard of the Chinese instrument Guqin indicates some of the harmonic positions. The just intonation and Pythagorean tuning where widely used in western music even in the 15th century. Considering the trajectory of the discussion, we might now be turning the unjustifiability objection towards the 12 ET system instead of just intonation.

Here’s a major issue with just intonation. Suppose we started with the root note C and formed our intervals based on the table above, and let’s say we want to play in the key of D major. As before, the perfect fifth interval humanly has an important quality to our scale, which is now the note A. What is the ratio of D to its perfect fifth A in this tuning? Using their ratios with C we can compute the ratio as \frac{5 /3}{9/ 8}=\frac{40}{27}\simeq1.481; i.e. in just intonation, the perfect fifth of C has ratio 1.5 with it (by construction) whereas the perfect fifth of D has ratio 1.481 with D. Human perception of pitch is roughly based on the logarithm of ratios, e.g. based on cents, with about 5-10 cents threshold for identifying pitch change (not a universal number though; it depends on the interval too, e.g. for the octave the threshold is about 2 cents for a trained ear), so converting to cents we have \log _2(1.5)\times 1200 \simeq 701.955 versus \log _2\left( 1.481 \right)\times 1200 \simeq 680.448, whose difference is over 21 cents—way bigger than the threshold of 5! This means that if we are to use various scales in our just intonation system, the various perfect fifth intervals don’t provide a uniform sonic experience. On the other hand, clearly the 12 ET system, or any other equally divided octave system for that matter, doesn’t have this issue by construction. In fact equal temperament systems came to the scene to remedy the inconsistent interval experience. Back in time not much key modulation was felt to be necessary; the black and white alignment of the piano keys based on the C major scale to me resembles an era of commitment to single scale music. Bach seems to be among the first to use equal temperament tunings to explore key changes. A reason that 12 is a good number for an ET system in my view is that it contains notes with ratios pretty close to simple ratios with the root frequency, i.e. we have the \frac{3}{2} (the perfect fifth) and \frac{4}{3} (major fourth) by only 2 cents off.

One last thing I would like to point out is that 12 ET has been practiced prevalently for no more than 500 years, and before that time, it was considered dissonant and unpleasant. When I now listen to a major third in just intonation, it sounds strange and off to me; you may also have felt the same in listening to the Claire de Lune above. This demonstrates that human apprehension of sonic beauty is shaped by our cultural influences and our experiential memory. If you are interested in other tuning systems with different philosophies, check out Xenharmonic.


Discover more from Investigations in Forms

Subscribe to get the latest posts sent to your email.

Leave a comment