dicklyon@534: CARFAC Design Doc
dicklyon@534: by "Richard F. Lyon" <dicklyon@google.com>
dicklyon@567: updated 24 May 2012 (v.237)
dicklyon@534: 
dicklyon@534: The CAR-FAC (cascade of asymmetric resonators with fast-acting
dicklyon@534: compression) is a cochlear model implemented as an efficient sound
dicklyon@534: processor, for mono, stereo, or multi-channel sound inputs.  This file
dicklyon@534: describes aspects of the software design.
dicklyon@534: 
dicklyon@563: The implementation will include equivalent Matlab, C++, Python, and
dicklyon@534: perhaps other versions.  They should all be based on the same set of
dicklyon@534: classes (or structs) and functions, with roughly equivalent
dicklyon@534: functionality and names where possible.  The examples here are Matlab,
dicklyon@534: in which the structs are typeless, but in other languages each will be
dicklyon@534: class or type.
dicklyon@534: 
dicklyon@534: The top-level class is CARFAC.  A CARFAC object knows how to design
dicklyon@534: its details from a modest set of parameters, and knows how to process
dicklyon@534: sound signals to produce "neural activity patterns" (NAPs).  It
dicklyon@534: includes various sub-objects representing the parameters, designs, and
dicklyon@534: states of different parts of the CAR-FAC model.
dicklyon@534: 
dicklyon@563: The CARFAC class includes a vector of "ear" objects -- one for mono,
dicklyon@563: two for stereo, or more.  
dicklyon@563: 
dicklyon@563: The three main subsystems of the EAR are the cascade of asymmetric
dicklyon@534: resonators (CAR), the inner hair cell (IHC), and the automatic gain
dicklyon@565: control (AGC).  These are not intended to work independently, but
dicklyon@565: each part has three groups of data associated with it.  The CARFAC 
dicklyon@565: stores instances of the "params" that drive the design, and the
dicklyon@565: "coeffs" and "state" are stored per ear:
dicklyon@534: 
dicklyon@534:   CAR_params, CAR_coeffs, CAR_state
dicklyon@534:   IHC_params, IHC_coeffs, IHC_state
dicklyon@534:   AGC_params, AGC_coeffs, AGC_state
dicklyon@534: 
dicklyon@534: These names can be used both for the classes, and slightly modified
dicklyon@534: for the member variables, arguments, temps, etc.
dicklyon@534: 
dicklyon@534: The params are inputs that specify the system; the coeffs are things
dicklyon@534: like filter coefficients, things computed once and used at run time;
dicklyon@534: the state is whatever internal state is needed between running the
dicklyon@534: model on samples or segments of sound waveforms, such as the state
dicklyon@534: variables of the digital filters.
dicklyon@534: 
dicklyon@565: At construction ("design") time, params are provided, and ears with coeffs 
dicklyon@565: are created.  The created CARFAC stores the params that it was designed
dicklyon@565: from; there is no state yet:
dicklyon@534: 
dicklyon@534:   CF = CARFAC_Design(CAR_params, IHC_params, AGC_params)
dicklyon@534: 
dicklyon@534: 
dicklyon@534: On "channels" and "ears":
dicklyon@534: 
dicklyon@534: Stages of the cascade are called "channels", and there are n_ch of
dicklyon@534: them (n_ch is determined from other parameters, but we could also make
dicklyon@534: a function to pick params to get a desired n_ch).  Since we already
dicklyon@534: used "channels" for that, sound input channels (one for monaural, two
dicklyon@534: for binaural, or more) are called "ears", and there are n_ears of them.
dicklyon@534: The parameter n_ears is set when state is initialized, but
dicklyon@534: not earlier at design time; it is not in params since it doesn't
dicklyon@534: affect coefficient design).
dicklyon@534: 
dicklyon@565: Multi-ear designs usually have the same coeffs across several ears, but 
dicklyon@565: always need separate state.  
dicklyon@565: The coeffs are kept separate per ear in case someone wants to
dicklyon@565: modify one or both to simulate asymmetric hearing loss or such.
dicklyon@565: 
dicklyon@565: The only  place the ears interact is in the AGC.  The function that
dicklyon@565: closes the AGC loop can "mix" their states, making an inter-ear
dicklyon@565: coupling.  Other than that, the functions that update the CAR and IHC 
dicklyon@565: and AGC states are simple single-ear functions (methods of the ear
dicklyon@565: class?).
dicklyon@534: 
dicklyon@534: 
dicklyon@534: Data size and performance:
dicklyon@534: 
dicklyon@534: The coeffs and states are several kilobytes each, since they store a
dicklyon@565: handful (10 or so) of floating-point values (at 4 or 8 bytes each) for
dicklyon@534: each channel (typically 60 to 100 channels); that 240 to 800 bytes per
dicklyon@534: coefficient and per state variable.  That's the entire data memory
dicklyon@534: footprint; most of it is accessed at every sample time (hopefully it
dicklyon@534: will all fit and have good hit rate in a typical 32 KB L1 d-cache).
dicklyon@534: In Matlab we use doubles (8-byte), but in C++ we intend to use floats
dicklyon@534: (4-byte) and SSE (via Eigen) for higher performance.  Alternate
dicklyon@534: implementations are OK.
dicklyon@534: 
dicklyon@534: 
dicklyon@534: Run-time strategy:
dicklyon@534: 
dicklyon@534: To support real-time applications, sound can be processed in short
dicklyon@534: segments, producing segments of NAPs:
dicklyon@534: 
dicklyon@534:   [NAPs, CF] = CARFAC_Run_Segment(CF, input_waves)
dicklyon@534: 
dicklyon@534: Here the plurals ("NAPs" and "input_waves") suggest multiple ears.  
dicklyon@534: These can be vectors, with length 1 for mono, or Matlab
dicklyon@534: multi-D arrays with a singleton last dimension.
dicklyon@534: 
dicklyon@534: The CF's states are updated such that there will be no glitch when a next
dicklyon@534: segment is processed.  Segments are of arbitrary positive-integer
dicklyon@534: length, not necessarily equal.  It is not inefficient to use a segment
dicklyon@534: length of 1 sample if results are needed with very low latency.
dicklyon@534: 
dicklyon@534: Internally, CARFAC_Run updates each part of the state, one sample at a
dicklyon@565: time.  First the CAR, IHC, and AGC are updated ("stepped") for all ears:
dicklyon@534: 
dicklyon@534:   [car_out, CAR_state] = CARFAC_CAR_Step(sample,  CAR_coeffs, CAR_state)
dicklyon@534:   [ihc_out, IHC_state] = CARFAC_IHC_Step(car_out, IHC_coeffs, IHC_state)
dicklyon@565:   [AGC_state, updated] = CARFAC_AGC_Step(ihc_out, AGC_coeffs, AGC_state)
dicklyon@534: 
dicklyon@534: The AGC filter mostly runs at a lower sample rate (by a factor of 8 by 
dicklyon@534: default). Usually it just accumulates its input and returns quickly. The 
dicklyon@534: boolean "updated" indicates whether the AGC actually did some work and 
dicklyon@534: has a new output that needs to be used to "close the loop" and modify 
dicklyon@534: the CAR state. When it's true, there's one more step, involving both AGC 
dicklyon@534: and CAR states, so it's a function on the CARFAC: 
dicklyon@534: 
dicklyon@534:   CF = CARFAC_Close_AGC_Loop(CF)
dicklyon@534: 
dicklyon@534: In Matlab, these functions return a modified copy of the state or
dicklyon@534: CARFAC; that's why we have the multiple returned values.  In languages
dicklyon@534: that do more than just call-by-value, a reference will be passed so
dicklyon@534: the state can be modified in place instead.
dicklyon@534: 
dicklyon@534: 
dicklyon@534: C++ Eigen strategy:
dicklyon@534: 
dicklyon@534: The mapping from Matlab's "parallel" operations on vectors of values
dicklyon@534: into C++ code will be done by using Eigen
dicklyon@534: (http://eigen.tuxfamily.org/) arrays, which support similar
dicklyon@534: source-level parallel arithmetic, so we don't have to write loops over
dicklyon@534: channels.  Eigen also allows fairly efficient compilation to SSE or
dicklyon@534: Arm/Neon instructions, which can do four float operations per cycle.
dicklyon@534: So we should be able to easily get to efficient code on various
dicklyon@534: platforms.
dicklyon@534: 
dicklyon@534: If there are similar strategies available in other languages, we should 
dicklyon@534: use them.  Python may have a route to using Eigen
dicklyon@534: (http://eigen.tuxfamily.org/index.php?title=PythonInterop).