69
loading...
This website collects cookies to deliver better user experience
librosa
and essentia
. librosa
is an API for feature extraction and processing data in Python. librosa.feature.mfcc
is a method that simplifies the process of obtaining MFCCs by providing arguments to set the number of frames, hop length, number of MFCCs and so on. Based on the arguments that are set, a 2D array is returned.essentia
is a full function workflow environment for high and low level features, facilitating audio input, preprocessing and statistical analysis of output. It was written in C++ with Python binding and exports data in YAML or JSON format.essentia.standard.MFCC
function has a parameter to fix the number of coefficients in the MFCC but processes the entire file in one go returning a 1D array. The library however also has a FrameGenerator
method that takes in other parameters which could make it yield similar results with librosa
.FrameGenerator
method to set other parameters like the hop length, number of frames and number of MFCCs to be the same as those used with librosa. Also, the sample rate and windowing type were modified to be the same for both libraries.essentia
was still about 2 times faster than librosa
(this was the primary metric I wanted to compare). However, I also noticed something else. The MFCCs did not look the same.0.9019551277160645
0.9127510786056519
0.90
and 0.94
. 69