Evaluation and comparison of different machine learning approaches to auditory spectro-temporal receptive field estimation
© Meyer et al; licensee BioMed Central Ltd. 2011
Published: 18 July 2011
The linear spectro-temporal receptive field (STRF) is a well-known approach to describe which features are encoded by auditory cortical neurons . It is defined as the linear filter that, when convolved with the envelope of a stimulus, gives a linear estimate of the spike rate evoked by that stimulus. A common STRF estimation method is reverse correlation, also known as spike-triggered average (STA), where the stimulus parts preceding the spikes are averaged in a specific time window. Linear regression approaches estimate an STRF based on the ensemble-averaged response spike rate. In  we have shown that the linear STRF model can be reformulated in terms of linear classification and a novel method using a support vector machine (SVM) classifier has been presented. Hence, given a set of stimuli with evoked response ensembles measured for a neuron the STRF of that specific neuron can be estimated using any of these methods.
Reverse correlation, linear regression and the SVM-based approach are evaluated using neural recordings from the primary auditory cortex of mongoelan gerbils  and synthetic data created using an inhomogeneous Linear-nonlinear Poisson (LNP) model with refractory period . In the LNP model, STRFs with different characteristics are used in the linear stage of the model and the average spike rate and the number of trials are varied. Complexes of frequency modulated (FM) sweeps as described in  are used as stimuli for both types of data. All methods are tested and evaluated using 5-fold cross-validation (CV). Model complexity in terms of Principal Components (PC) for linear regression and the SVM classifier is determined by using the least complex model within one standard deviation of the maximum mean coherence between estimated and predicted spike rate. The resulting STRFs are evaluated using mean coherence, the area under the receiver operating curve (AUC) for single spike classification and STRF variability for the different CV folds.
- Theunissen FE, Sen K, Doupe AJ: Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. J Neurosci. 2000, 20: 2315-2331.PubMedGoogle Scholar
- Happel MFK, Müller SG, Anemüller J, Ohl FW: Predictability of STRFs in auditory cortex neurons depends on stimulus class. Interspeech. 2008, 670.Google Scholar
- Meyer AF, Happel MFK, Ohl FW, Anemüller J: Estimation of spectro-temporal receptive fields based on linear support vector machine classification. BMC Neuroscience. 2009, 10: P147-10.1186/1471-2202-10-S1-P147.View ArticleGoogle Scholar
- Schwartz O, Pillow JW, Rust NC, Simoncelli EP: Spike-triggered neural characterization. J Vis. 2006, 6: 484-507. 10.1167/6.4.13.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.