Scene Classification: Geiger et al

Overview

This is a contribution to the IEEE AASP Challenge on classification of acoustic scenes. From the 30 second long highly variable recordings, spectral, cepstral, energy and voicing-related audio features are extracted. A sliding window approach is used to obtain statistical functionals of the low-level features on short segments. SVM are used for classification of these short segments, and a majority voting scheme is employed to get a decision for the whole recording. On the official development set of the challenge, an accuracy of 73 % is achieved. A feature analysis using the t-statistic showed that mainly Mel spectra were the most relevant features.

Related publications

J. T. Geiger, B. Schuller, and G. Rigoll, “Recognising acoustic scenes with large-scale audio feature extraction and SVM,” 2013.: [More Details] [BIBT_EX] [URL (ext.)]; @techreport{j2013a, author = {Jürgen T. Geiger and Bjoern Schuller and Gerhard Rigoll}, title = {Recognising acoustic scenes with large-scale audio feature extraction and SVM}, year = {2013} }
J. T. Geiger, B. Schuller, and G. Rigoll, “Large-Scale Audio Feature Extraction and SVM for Acoustic Scene Classification,” in WASPAA, 2013, p. 4.: [More Details] [BIBT_EX] [URL (ext.)]; @inproceedings{j2013a, author = {Jürgen T. Geiger and Bjoern Schuller and Gerhard Rigoll}, title = {Large-Scale Audio Feature Extraction and SVM for Acoustic Scene Classification}, booktitle = {WASPAA}, month = {10}, pages = {4}, publisher = {IEEE}, year = {2013} }

Members

Manager: Dan Stowell, Emmanouil Benetos, Jürgen Geiger, Mark Plumbley

IEEE AASP D-CASE Challenge Code Submissions »

Overview

Related publications

Members