<-- Back to Previous Page | TOC | Next Chapter --> |
Chapter 3: The Frequency DomainSection 3.6: Some Alternatives to the FFT
|
||
We know that in previous sections weve made it sound like the FFT is the only game in town, but thats not entirely true. Historically,the FFT was the first, most important, and most widely used algorithm for frequency domain analysis. Musicians, especially, have taken a liking to it because there are some well-known references on its implementation, and computer musicians especially have gained a lot of experience in its nuances. Its fairly simple to program, and there are a lot of existing software programs to use if you get lazy. Also, it works pretty well (emphasis on the word "pretty"). The FFT breaks up a sound into sinusoids, but there may be reasons why youd like to break a sound into other sorts of basic sounds. These alternate transforms can work better, for some purposes, than the FFT. (A transform is just something that takes a list of the sample values and turns the values into a new list of numbers that describes a possible new way to add up different basic sounds.) For example, wavelet transforms (sometimes called dynamic base transforms) can modify the resolution for different frequency ranges, unlike the FFT, which has a constant bandwidth (and thus is, by definition, less sensitive to lower frequenciesthe important ones than to higher ones!). Wavelets also use a variety of different analysis waveforms (as opposed to the FFT, which only uses sinusoids) to get a better representation of a signal. Unfortunately, wavelet transforms are still a bit uncommon in computer software, since they are, in general, harder to implement than FFTs. One of the big problems is deciding which wavelet to use, and in what frequency bandwidth. Often, that decision-making process can be more important (and time-consuming) than the actual transform! Wavelets are a pretty hot topic though, so well probably be seeing some wavelet-based techniques emerge in the near future. McAulay-Quatieri (MQ) AnalysisAnother interesting approach to a more organized, information-rich set of transforms is the extended McAulay-Quatieri (MQ) analysis algorithm. This is used in the popular Macintosh program Lemur Pro, written by the brilliant computer music researcher Kelly Fitz. MQ analysis works a lot like normal FFTsit tries to figure out how a sound can be re-created using a number of sine waves. However, Lemur Pro is an enhancement of more traditional FFT-based software in that it uses the resulting sine waves to extract frequency tracks from a sound. These tracks attempt to follow particular components of the sound as they change over time. That is, MQ not only represents the amplitude trajectories of partials, but also tries to describe the ways that spectral components change in frequency over time. This is a fundamentally different way of describing spectra, and a powerful one. MQ is an excellent idea, as it allows the analyst a much higher level of data representation and provides a more perceptually significant view of the sound. A number of more advanced FFT-based programs also implement some form of frequency tracking in order to improve the results of the analysis, similar to that of MQ analysis. For example, even though a sine wave might sweep from 1 kHz to 2 kHz, it would still be contained in one MQ track (rather than fading in and out of FFT bins, forming what are called lobes). An MQ-type analysis is often more useful for sound work than regular FFT analysis, since it has sonic cognition ideas built into it. For example, the ear tends to follow frequencies in time, much like an MQ analysis, and in our perception loud frequencies tend to mask neighboring quieter frequencies (as do MQ analyses). These kinds of sophisticated analysis techniques, where frequency and amplitude are isolated, suggest purely graphical manipulation of sounds, and well show some examples of that in Section 5.7. |
<-- Back to Previous Page | Next Chapter --> |
©Burk/Polansky/Repetto/Roberts/Rockmore. All rights reserved.