Main Content

Audio Processing Using Deep Learning

Extend deep learning workflows with audio and speech processing applications

Apply deep learning to audio and speech processing applications by using Deep Learning Toolbox™ together with Audio Toolbox™. For signal processing applications, see Signal Processing Using Deep Learning. For applications in wireless communications, see Wireless Communications Using Deep Learning.

Apps

Signal LabelerLabel signal attributes, regions, and points of interest, and extract features

Functions

expand all

audioDatastoreDatastore for collection of audio files
audioDataAugmenterAugment audio data
audioFeatureExtractorStreamline audio feature extraction
openl3EmbeddingsExtract OpenL3 feature embeddings
pitchnnEstimate pitch with deep learning neural network
vggishEmbeddingsExtract VGGish feature embeddings
classifySoundClassify sounds in audio signal
crepeCREPE neural network
crepePreprocessPreprocess audio for CREPE deep learning network
crepePostprocessPostprocess output of CREPE deep learning network
openl3OpenL3 neural network
openl3EmbeddingsExtract OpenL3 feature embeddings
openl3PreprocessPreprocess audio for OpenL3 feature extraction
pitchnnEstimate pitch with deep learning neural network
vggishVGGish neural network
vggishEmbeddingsExtract VGGish feature embeddings
vggishPreprocessPreprocess audio for VGGish feature extraction
yamnetYAMNet neural network
yamnetGraphGraph of YAMNet AudioSet ontology
yamnetPreprocessPreprocess audio for YAMNet classification

Blocks

VGGishVGGish embeddings extraction network
VGGish EmbeddingsExtract VGGish embeddings
YAMNetYAMNet sound classification network
Sound ClassifierClassify sounds in audio signal
OpenL3OpenL3 embeddings extraction network
OpenL3 EmbeddingsExtract OpenL3 embeddings

Topics