160 likes | 306 Views
This research project focuses on analyzing audio data to identify music genres using statistical, signal processing, and machine learning methods. It explores techniques such as Fourier Transform for spectral analysis and fractal dimension measurements for genre identification. By employing feed-forward neural networks, the study examines the complexity and structure of music, distinguishing between genres via various spectral aggregations. Results indicate a correlation between frequency and magnitude properties, providing insights into effective music classification methods and future avenues for research in machine learning and signal processing.
E N D
Josiah Boning TJHSST Senior Research Project Computer Systems Lab, 2007-2008 Music Analysis
Purpose • Framework for study of audio data • Components: • Statistical • Signal Processing • Machine Learning • Uses: • Aircraft identification • Speech recognition and synthesis • And more... • Ideal: computer recognition of music
Background • Bigarelle and Iost (1999) • Music genre can be identified by fractal dimension • Basilie et al. (2004) • Music genre can be identified by machine learning algorithms • Used discrete MIDI data
Methods • Data Processing • Spectral Analysis: Fourier Transform • Fractal Dimension: Variation and ANAM Methods • Machine Learning • Feed-Forward Neural Network
Time domain to frequency domain Spectral Analysis – Fourier Transform
Spectrogram • Many Fourier transforms sequentially
Frequency Aggregate • Horizontal sum of spectrogram
Magnitude Aggregate • Vertical sum of spectrogram • Can perform second Fourier transform
Fractal Dimension • Bigerelle and Iost – used to distinguish genre • Variation Method: • ANAM Method:
Fractal Dimension • Inaccurate calculations • Correct values around 1.6-1.9 • Variation: ~1.16 • ANAM: ~2.25 • Interpolation between samples didn’t help
Machine Learning • Neural networks • Feed-Forward
Neurons • Each node performs weighted, rescaled sum of input values • Scaling: Sigmoid function
Results • Frequency aggregate – songs are similar!
Results • Magnitude and frequency: negative correlation • Not shared bywhite noise • Fourier transform of magnitude aggregate • Meaningless
Where next? • Move code from driver to Fourier module • Otherwise, well organized • Fix fractal dimension calculations • Neural network learning algorithms • Beat detection • Other signal processing techniques