silvet: constant-q-cpp/src/ext/kissfft/README annotate

annotate constant-q-cpp/src/ext/kissfft/README @ 372:af71cbdab621 tip

Update bqvec code

author	Chris Cannam
date	Tue, 19 Nov 2019 10:13:32 +0000
parents	5d0a2ebb4d17
children

rev	line source
Chris@366	1 KISS FFT - A mixed-radix Fast Fourier Transform based up on the principle,
Chris@366	2 "Keep It Simple, Stupid."
Chris@366	3
Chris@366	4 There are many great fft libraries already around. Kiss FFT is not trying
Chris@366	5 to be better than any of them. It only attempts to be a reasonably efficient,
Chris@366	6 moderately useful FFT that can use fixed or floating data types and can be
Chris@366	7 incorporated into someone's C program in a few minutes with trivial licensing.
Chris@366	8
Chris@366	9 USAGE:
Chris@366	10
Chris@366	11 The basic usage for 1-d complex FFT is:
Chris@366	12
Chris@366	13 #include "kiss_fft.h"
Chris@366	14
Chris@366	15 kiss_fft_cfg cfg = kiss_fft_alloc( nfft ,is_inverse_fft ,0,0 );
Chris@366	16
Chris@366	17 while ...
Chris@366	18
Chris@366	19 ... // put kth sample in cx_in[k].r and cx_in[k].i
Chris@366	20
Chris@366	21 kiss_fft( cfg , cx_in , cx_out );
Chris@366	22
Chris@366	23 ... // transformed. DC is in cx_out[0].r and cx_out[0].i
Chris@366	24
Chris@366	25 free(cfg);
Chris@366	26
Chris@366	27 Note: frequency-domain data is stored from dc up to 2pi.
Chris@366	28 so cx_out[0] is the dc bin of the FFT
Chris@366	29 and cx_out[nfft/2] is the Nyquist bin (if exists)
Chris@366	30
Chris@366	31 Declarations are in "kiss_fft.h", along with a brief description of the
Chris@366	32 functions you'll need to use.
Chris@366	33
Chris@366	34 Code definitions for 1d complex FFTs are in kiss_fft.c.
Chris@366	35
Chris@366	36 You can do other cool stuff with the extras you'll find in tools/
Chris@366	37
Chris@366	38 * multi-dimensional FFTs
Chris@366	39 * real-optimized FFTs (returns the positive half-spectrum: (nfft/2+1) complex frequency bins)
Chris@366	40 * fast convolution FIR filtering (not available for fixed point)
Chris@366	41 * spectrum image creation
Chris@366	42
Chris@366	43 The core fft and most tools/ code can be compiled to use float, double,
Chris@366	44 Q15 short or Q31 samples. The default is float.
Chris@366	45
Chris@366	46
Chris@366	47 BACKGROUND:
Chris@366	48
Chris@366	49 I started coding this because I couldn't find a fixed point FFT that didn't
Chris@366	50 use assembly code. I started with floating point numbers so I could get the
Chris@366	51 theory straight before working on fixed point issues. In the end, I had a
Chris@366	52 little bit of code that could be recompiled easily to do ffts with short, float
Chris@366	53 or double (other types should be easy too).
Chris@366	54
Chris@366	55 Once I got my FFT working, I was curious about the speed compared to
Chris@366	56 a well respected and highly optimized fft library. I don't want to criticize
Chris@366	57 this great library, so let's call it FFT_BRANDX.
Chris@366	58 During this process, I learned:
Chris@366	59
Chris@366	60 1. FFT_BRANDX has more than 100K lines of code. The core of kiss_fft is about 500 lines (cpx 1-d).
Chris@366	61 2. It took me an embarrassingly long time to get FFT_BRANDX working.
Chris@366	62 3. A simple program using FFT_BRANDX is 522KB. A similar program using kiss_fft is 18KB (without optimizing for size).
Chris@366	63 4. FFT_BRANDX is roughly twice as fast as KISS FFT in default mode.
Chris@366	64
Chris@366	65 It is wonderful that free, highly optimized libraries like FFT_BRANDX exist.
Chris@366	66 But such libraries carry a huge burden of complexity necessary to extract every
Chris@366	67 last bit of performance.
Chris@366	68
Chris@366	69 Sometimes simpler is better, even if it's not better.
Chris@366	70
Chris@366	71 FREQUENTLY ASKED QUESTIONS:
Chris@366	72 Q: Can I use kissfft in a project with a ___ license?
Chris@366	73 A: Yes. See LICENSE below.
Chris@366	74
Chris@366	75 Q: Why don't I get the output I expect?
Chris@366	76 A: The two most common causes of this are
Chris@366	77 1) scaling : is there a constant multiplier between what you got and what you want?
Chris@366	78 2) mixed build environment -- all code must be compiled with same preprocessor
Chris@366	79 definitions for FIXED_POINT and kiss_fft_scalar
Chris@366	80
Chris@366	81 Q: Will you write/debug my code for me?
Chris@366	82 A: Probably not unless you pay me. I am happy to answer pointed and topical questions, but
Chris@366	83 I may refer you to a book, a forum, or some other resource.
Chris@366	84
Chris@366	85
Chris@366	86 PERFORMANCE:
Chris@366	87 (on Athlon XP 2100+, with gcc 2.96, float data type)
Chris@366	88
Chris@366	89 Kiss performed 10000 1024-pt cpx ffts in .63 s of cpu time.
Chris@366	90 For comparison, it took md5sum twice as long to process the same amount of data.
Chris@366	91
Chris@366	92 Transforming 5 minutes of CD quality audio takes less than a second (nfft=1024).
Chris@366	93
Chris@366	94 DO NOT:
Chris@366	95 ... use Kiss if you need the Fastest Fourier Transform in the World
Chris@366	96 ... ask me to add features that will bloat the code
Chris@366	97
Chris@366	98 UNDER THE HOOD:
Chris@366	99
Chris@366	100 Kiss FFT uses a time decimation, mixed-radix, out-of-place FFT. If you give it an input buffer
Chris@366	101 and output buffer that are the same, a temporary buffer will be created to hold the data.
Chris@366	102
Chris@366	103 No static data is used. The core routines of kiss_fft are thread-safe (but not all of the tools directory).
Chris@366	104
Chris@366	105 No scaling is done for the floating point version (for speed).
Chris@366	106 Scaling is done both ways for the fixed-point version (for overflow prevention).
Chris@366	107
Chris@366	108 Optimized butterflies are used for factors 2,3,4, and 5.
Chris@366	109
Chris@366	110 The real (i.e. not complex) optimization code only works for even length ffts. It does two half-length
Chris@366	111 FFTs in parallel (packed into real&imag), and then combines them via twiddling. The result is
Chris@366	112 nfft/2+1 complex frequency bins from DC to Nyquist. If you don't know what this means, search the web.
Chris@366	113
Chris@366	114 The fast convolution filtering uses the overlap-scrap method, slightly
Chris@366	115 modified to put the scrap at the tail.
Chris@366	116
Chris@366	117 LICENSE:
Chris@366	118 Revised BSD License, see COPYING for verbiage.
Chris@366	119 Basically, "free to use&change, give credit where due, no guarantees"
Chris@366	120 Note this license is compatible with GPL at one end of the spectrum and closed, commercial software at
Chris@366	121 the other end. See http://www.fsf.org/licensing/licenses
Chris@366	122
Chris@366	123 A commercial license is available which removes the requirement for attribution. Contact me for details.
Chris@366	124
Chris@366	125
Chris@366	126 TODO:
Chris@366	127 *) Add real optimization for odd length FFTs
Chris@366	128 *) Document/revisit the input/output fft scaling
Chris@366	129 *) Make doc describing the overlap (tail) scrap fast convolution filtering in kiss_fastfir.c
Chris@366	130 *) Test all the ./tools/ code with fixed point (kiss_fastfir.c doesn't work, maybe others)
Chris@366	131
Chris@366	132 AUTHOR:
Chris@366	133 Mark Borgerding
Chris@366	134 Mark@Borgerding.net

Mercurial > hg > silvet

annotate constant-q-cpp/src/ext/kissfft/README @ 372:af71cbdab621 tip