Mercurial > hg > qm-dsp
comparison ext/kissfft/README @ 184:76ec2365b250
Bring in kissfft into this repo (formerly a subrepo, but the remote is not responding)
| author | Chris Cannam |
|---|---|
| date | Tue, 21 Jul 2015 07:34:15 +0100 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| 183:7ab3539e92e3 | 184:76ec2365b250 |
|---|---|
| 1 KISS FFT - A mixed-radix Fast Fourier Transform based up on the principle, | |
| 2 "Keep It Simple, Stupid." | |
| 3 | |
| 4 There are many great fft libraries already around. Kiss FFT is not trying | |
| 5 to be better than any of them. It only attempts to be a reasonably efficient, | |
| 6 moderately useful FFT that can use fixed or floating data types and can be | |
| 7 incorporated into someone's C program in a few minutes with trivial licensing. | |
| 8 | |
| 9 USAGE: | |
| 10 | |
| 11 The basic usage for 1-d complex FFT is: | |
| 12 | |
| 13 #include "kiss_fft.h" | |
| 14 | |
| 15 kiss_fft_cfg cfg = kiss_fft_alloc( nfft ,is_inverse_fft ,0,0 ); | |
| 16 | |
| 17 while ... | |
| 18 | |
| 19 ... // put kth sample in cx_in[k].r and cx_in[k].i | |
| 20 | |
| 21 kiss_fft( cfg , cx_in , cx_out ); | |
| 22 | |
| 23 ... // transformed. DC is in cx_out[0].r and cx_out[0].i | |
| 24 | |
| 25 free(cfg); | |
| 26 | |
| 27 Note: frequency-domain data is stored from dc up to 2pi. | |
| 28 so cx_out[0] is the dc bin of the FFT | |
| 29 and cx_out[nfft/2] is the Nyquist bin (if exists) | |
| 30 | |
| 31 Declarations are in "kiss_fft.h", along with a brief description of the | |
| 32 functions you'll need to use. | |
| 33 | |
| 34 Code definitions for 1d complex FFTs are in kiss_fft.c. | |
| 35 | |
| 36 You can do other cool stuff with the extras you'll find in tools/ | |
| 37 | |
| 38 * multi-dimensional FFTs | |
| 39 * real-optimized FFTs (returns the positive half-spectrum: (nfft/2+1) complex frequency bins) | |
| 40 * fast convolution FIR filtering (not available for fixed point) | |
| 41 * spectrum image creation | |
| 42 | |
| 43 The core fft and most tools/ code can be compiled to use float, double, | |
| 44 Q15 short or Q31 samples. The default is float. | |
| 45 | |
| 46 | |
| 47 BACKGROUND: | |
| 48 | |
| 49 I started coding this because I couldn't find a fixed point FFT that didn't | |
| 50 use assembly code. I started with floating point numbers so I could get the | |
| 51 theory straight before working on fixed point issues. In the end, I had a | |
| 52 little bit of code that could be recompiled easily to do ffts with short, float | |
| 53 or double (other types should be easy too). | |
| 54 | |
| 55 Once I got my FFT working, I was curious about the speed compared to | |
| 56 a well respected and highly optimized fft library. I don't want to criticize | |
| 57 this great library, so let's call it FFT_BRANDX. | |
| 58 During this process, I learned: | |
| 59 | |
| 60 1. FFT_BRANDX has more than 100K lines of code. The core of kiss_fft is about 500 lines (cpx 1-d). | |
| 61 2. It took me an embarrassingly long time to get FFT_BRANDX working. | |
| 62 3. A simple program using FFT_BRANDX is 522KB. A similar program using kiss_fft is 18KB (without optimizing for size). | |
| 63 4. FFT_BRANDX is roughly twice as fast as KISS FFT in default mode. | |
| 64 | |
| 65 It is wonderful that free, highly optimized libraries like FFT_BRANDX exist. | |
| 66 But such libraries carry a huge burden of complexity necessary to extract every | |
| 67 last bit of performance. | |
| 68 | |
| 69 Sometimes simpler is better, even if it's not better. | |
| 70 | |
| 71 FREQUENTLY ASKED QUESTIONS: | |
| 72 Q: Can I use kissfft in a project with a ___ license? | |
| 73 A: Yes. See LICENSE below. | |
| 74 | |
| 75 Q: Why don't I get the output I expect? | |
| 76 A: The two most common causes of this are | |
| 77 1) scaling : is there a constant multiplier between what you got and what you want? | |
| 78 2) mixed build environment -- all code must be compiled with same preprocessor | |
| 79 definitions for FIXED_POINT and kiss_fft_scalar | |
| 80 | |
| 81 Q: Will you write/debug my code for me? | |
| 82 A: Probably not unless you pay me. I am happy to answer pointed and topical questions, but | |
| 83 I may refer you to a book, a forum, or some other resource. | |
| 84 | |
| 85 | |
| 86 PERFORMANCE: | |
| 87 (on Athlon XP 2100+, with gcc 2.96, float data type) | |
| 88 | |
| 89 Kiss performed 10000 1024-pt cpx ffts in .63 s of cpu time. | |
| 90 For comparison, it took md5sum twice as long to process the same amount of data. | |
| 91 | |
| 92 Transforming 5 minutes of CD quality audio takes less than a second (nfft=1024). | |
| 93 | |
| 94 DO NOT: | |
| 95 ... use Kiss if you need the Fastest Fourier Transform in the World | |
| 96 ... ask me to add features that will bloat the code | |
| 97 | |
| 98 UNDER THE HOOD: | |
| 99 | |
| 100 Kiss FFT uses a time decimation, mixed-radix, out-of-place FFT. If you give it an input buffer | |
| 101 and output buffer that are the same, a temporary buffer will be created to hold the data. | |
| 102 | |
| 103 No static data is used. The core routines of kiss_fft are thread-safe (but not all of the tools directory). | |
| 104 | |
| 105 No scaling is done for the floating point version (for speed). | |
| 106 Scaling is done both ways for the fixed-point version (for overflow prevention). | |
| 107 | |
| 108 Optimized butterflies are used for factors 2,3,4, and 5. | |
| 109 | |
| 110 The real (i.e. not complex) optimization code only works for even length ffts. It does two half-length | |
| 111 FFTs in parallel (packed into real&imag), and then combines them via twiddling. The result is | |
| 112 nfft/2+1 complex frequency bins from DC to Nyquist. If you don't know what this means, search the web. | |
| 113 | |
| 114 The fast convolution filtering uses the overlap-scrap method, slightly | |
| 115 modified to put the scrap at the tail. | |
| 116 | |
| 117 LICENSE: | |
| 118 Revised BSD License, see COPYING for verbiage. | |
| 119 Basically, "free to use&change, give credit where due, no guarantees" | |
| 120 Note this license is compatible with GPL at one end of the spectrum and closed, commercial software at | |
| 121 the other end. See http://www.fsf.org/licensing/licenses | |
| 122 | |
| 123 A commercial license is available which removes the requirement for attribution. Contact me for details. | |
| 124 | |
| 125 | |
| 126 TODO: | |
| 127 *) Add real optimization for odd length FFTs | |
| 128 *) Document/revisit the input/output fft scaling | |
| 129 *) Make doc describing the overlap (tail) scrap fast convolution filtering in kiss_fastfir.c | |
| 130 *) Test all the ./tools/ code with fixed point (kiss_fastfir.c doesn't work, maybe others) | |
| 131 | |
| 132 AUTHOR: | |
| 133 Mark Borgerding | |
| 134 Mark@Borgerding.net |
