/ home / articles / harmony
Harmony and Waveforms

Introduction

An interesting question, which musicians often overlook, is "what makes a chord sound good ?". In other words, why a particular combination of notes sounds better than another one. With "sounds better" I mean "is percieved as more desiderable" or "gives more pleasure to the listener"... choose the definition that best matches for you.

This article attempts to give at least some hints for answering this simple question.

Sound

By sound, we commonly mean the vibrations that travel through air and can be heard by humans. However, scientists and engineers use a wider definition of sound that includes low and high frequency vibrations in air that cannot be heard, and vibrations that travel through all forms of matter, gases, liquids and solids. The matter that supports the sound is called the medium. Sound propagates as waves of alternating pressure, causing local regions of compression and rarefaction. Particles in the medium are displaced by the wave and oscillate.

As a wave, sound is characterized by the properties of waves including frequency, wavelength, period, amplitude and velocity or speed.

The range of frequencies that humans can hear is approximately between 20 Hz and 20,000 Hz. This range is by definition the audible spectrum, but some people (particularly women) can hear above 20,000 Hz. This range varies by individual and generally shrinks with age, mostly in the upper part of the spectrum. The ear is most sensitive to frequencies around 3,500 Hz. Sound above 20,000 Hz is known as ultrasound; sound below 20 Hz as infrasound.

Waveforms

Variations in air pressure against the ear drum, and the subsequent physical and neurological processing and interpretation, give rise to the experience called "sound". Most sound that people recognize as "musical" is dominated by periodic or regular vibrations rather than non-periodic ones (called a definite pitch), and we refer to the transmission mechanism as a "sound wave". In a very simple case, the sound of a sine wave, which is considered to be the most basic model of a sound waveform, causes the air pressure to increase and decrease in a regular fashion, and is heard as a very "pure" tone.

pure tone
Pure tone at 1 Hz.

Pitch

The rate at which the air pressure varies governs the frequency of the tone, which is also measured in oscillations per second, or hertz. Frequency is a primary determinate of the perceived pitch.

Pitches are often labeled using scientific pitch notation or some combination of a letter and a number representing a fundamental frequency. Human pitch perception is logarithmic with respect to fundamental frequency: the perceived distance between the pitches "220 Hz" and "440 Hz" is the same as the perceived distance between the pitches "440 Hz" and "880 Hz".

The distance between two pitches related by a frequency factor of 2 is called octave. A sound oscillating at 440 Hz (that is, 440 times per second) is called A (A4) and is being used as reference in the widely adopted musical pitch standard (American Standard). All the pitches obtained by multiplying or dividing 440 by 2 are still called A, just belonging to higher or lower octave.

Obviously there is an infinity of other pitches between two adiacent A. However we choose to use only some of them to compose music and the reason of this choice will be depicted later in this document.

Simultaneous sounds

Whenever two different pitches are played at the same time, their sound waves interact with each other - the highs and lows in the air pressure reinforce each other to produce a different sound wave. As a result, any given sound wave which is more complicated than a sine wave can, nonetheless, be modelled by many different sine waves of the appropriate frequencies and amplitudes (a frequency spectrum). In humans the hearing apparatus (composed of the ears and brain) can isolate these tones and hear them distinctly. When two or more tones are played at once, a single variation of air pressure at the ear "contains" the pitches of each, and the ear and/or brain isolate and decode them into distinct tones.

octave
Octave: the sum of pure sounds at 1 Hz and 2 Hz. The overall wave still vibrates at 1 Hz.

There are mathematic instruments that are able to decompose complex sounds into combinations of pure sounds. One of such instruments is the Fourier Transform. The spectrum obtained by applying the Fourier Transform to the octave waveform in the figure above would have two peaks (Dirac's deltas) placed at 1 Hz and 2 Hz, that is it would decompose the complex waveform into its pure sinusoidal components.

octave fourier transform
Qualitative Fourier Transform of an octave interval.

Harmonics and Timbre

The "natural" sounds usually aren't pure. The waveforms aren't necessairly sinusoidal but may be, for example, square or triangular. However, as long as the frequency of the waveform is well defined, the perceived pitch is the same. The perceived property that changes with the waveform is called timbre.

Non pure waveforms can be still decomposed in combinations of pure sounds by the means of the Fourier Transform. An interesting property of the FT is that the spectrum of a periodic signal (waveform) is made up of peaks that "sample up" the spectrum of the waveform obtained by truncating the original one to a single period. The detailed explaination of the Fourier Transform properties go beyond the scope of this document and can be found out by consulting a book about Signal Theory. It is sufficient to state that the peaks are placed at integer multiplies of the fundamental frequency f (so at 2f, 3f, 4f etc..). The peaks at 2f, 4f, 8f, 16f etc (powers of 2) are octaves while the others are just partial harmonics.

periodic signal fourier transform
Very Qualitative Fourier Transform of a periodic triangle wave

Thus when the original sound sources are perfectly periodic, the note consists of several related sine waves (which mathematically add to each other) called the fundamental and the harmonics, partials, or overtones. The sounds have harmonic frequency spectra. The lowest frequency present is the fundamental, and is the frequency that the entire wave vibrates at. The overtones vibrate faster than the fundamental, but must vibrate at integer multiples of the fundamental frequency in order for the total wave to be exactly the same each cycle. Real instruments are close to periodic, but the frequencies of the overtones are slightly imperfect, so the shape of the wave changes slightly over time.

Finally, the combination of non perfectly periodic instruments, non sinusoidal sources and complex interactions with the transmitting medium (air) lets us perceive a large set of frequencies when a single note is played. The spectrum of such a sound is usually still made up of peaks but in certain cases may even end up being continous.

violin spectra

The spectrum of a playing violin.
The frequency is on the Y axis while X axis rappresents time.
The bright lines along the bottom are fundamental sounds while the upper ones are harmonics and overtones.

Harmony

If two pure notes are simultaneously played, with frequency ratios that are simple fractions (e.g. 2/1, 3/2 or 5/4), then the composite wave will still be periodic with a short period, and the combination will sound consonant. For instance, a note vibrating at 200 Hz and a note vibrating at 300 Hz (a perfect fifth, or 3/2 ratio, above 200 Hz) will add together to make a wave that repeats at 100 Hz: every 1/100 of a second, the 300 Hz wave will repeat thrice and the 200 Hz wave will repeat twice. Note that the total wave repeats at 100 Hz, but there is not actually a 100 Hz sinusoidal component present.

pure tone
Perfect fifth: the sum of pure tones at 1 Hz and 1.5 Hz. The wave repeats at 0.5 Hz.

Additionally, when considering real instruments, the notes will have their harmonics and partials. It ends up that when the two notes played have simple fraction frequency ratios then they also will have many of the same partials. For instance, a note with a fundamental frequency of 200 Hz will have harmonics at (200,) 400, 600, 800, 1000, 1200, ... A note with fundamental frequency of 300 Hz will have harmonics at (300,) 600, 900, 1200, 1500, ... The two notes have the harmonics 600 and 1200 in common, and more will coincide further up the series. The combination of composite waves with short fundamental frequencies and shared or closely related partials is what causes the sensation of harmony.

violin perfect fifth spectra

The spectrum of a violin playing first the fundamental note and then a perfect fifth above it.
The shared partials are highlighted by white lines.

The Natural Scale

For several (historic and physic) reasons we choose a set of pitches between two octaves to form our contemporary music. The notes choosen turn out to relatiely simple frequency ratios when played together and thus sound "naturally" consonant.

The natural scale is attributed to the Greek philosopher Aristoxenus Tarentinus. After fixing the frequency of the first note - the C of the scale - the frequencies of the other notes are determined from the ratios indicated in the following table. On the last C the following octave begins and the operation can be repeated. The following table shows the ratios between the frequencies of all the notes of the scale and the fixed frequency of the first note of the scale.

CDEFGABC
19/85/44/33/25/315/82

The interval between C and G is our perfect fifth as depicted above.

There is one major problem with the natural scale and the just intonation derived from it. The ratio of the frequencies of two notes which differ for one tone is not always the same. Consequently a certain melody cannot be played starting from a random note of the scale. For instance, a melody starting with the two notes C and D (ratio 9/8) cannot be transposed one tone higher, since the ratio of the frequencies of E and of D is very near ((5/4)/(9/8) = 10/9), but not equal to 9/8.

Equal temperament

To obviate this inconveniency, we today use the so-called equal temperament, which constitutes the compromise adopted in modern western music. Earlier western music used other compromises.

Equal temperament is a scheme of musical tuning in which the octave is divided into a series of equal steps (equal frequency ratios). The best known example of such a system is twelve-tone equal temperament which is nowadays used in most Western music. Other equal temperaments do exist (some music has been written in 19-TET and 31-TET for example, Arabian and eastern styled music is based on a 24-tone equal temperament).

Mathematically, each successive pitch is related to the previous pitch by a factor of the twelfth root of 2. That is, the ratio between the frequencies of any two successive pitches is 1.05946309436. This is also the value of the ratio of the widths of two consecutive frets on modern guitars. The twelfth fret divides the string in two exact halves (octave).

The first "usable" pitch above the reference A is thus 440 * 1.05946309436 = 466.1637615184.

12-TET (Tone Equal Temperament) allows the use of integer notation and modulo 12, and this allows for proofs concerning pitch.

The following table shows the values of the intervals of 12 TET, along with one interval from just intonation that each approximates, and the percentage by which they differ:

Name Decimal value Just intonation interval Percent difference
Unison 1.000000 1 = 1.000000 0.00%
Minor second 1.059463 16/15 = 1.066667 0.68%
Major second 1.122462 9/8 = 1.125000 0.23%
Minor third 1.189207 6/5 = 1.200000 0.91%
Major third 1.259921 5/4 = 1.250000 0.79%
Perfect fourth 1.334840 4/3 = 1.333333 0.11%
Diminished fifth 1.414214 7/5 = 1.400000 1.02%
Perfect fifth 1.498307 3/2 = 1.500000 0.11%
Minor sixth 1.587401 8/5 = 1.600000 0.79%
Major sixth 1.681793 5/3 = 1.666667 0.90%
Minor seventh 1.781797 16/9 = 1.777778 0.23%
Major seventh 1.887749 15/8 = 1.875000 0.68%
Octave 2.000000 2/1 = 2.000000 0.00%

Obviously the 12-TET leads to "non perfectly simple" frequency ratios when notes are played together and thus leads to slightly dissonant chords. However, the dissonances aren't easily audible.

perfect and well tempered fifth comparison
Above, a perfect fifth, Below, a 12-TET fifth

perfect and well tempered fifth comparison
Above, a perfect fifth, Below, a 12-TET fifth



here
we're consonant
there
blog
articles
software
pragmaware
pragma
photos
contact
login
past
h 0x06f4b6
u 0x068510
p 0x0004f3
future
working on it...