Imbalanced Audio Dataset for Deep Learning Classification

Nicholas Ang

12 Jul 2021

1 Answer

Updated 30 Jul 2021

24 Views (30 days)

Follow Question

Show older comments

0 votes

Hi, I am trying to use audio data from interviews for binary classification through converting my dataset into spectrograms before feeding into CNN for classification. Firstly, the audio data have different duration i.e., 7 min-30 min and the dataset is imbalanced. I am aware of techniques such as SMOTE and oversampling of minority classes, but I am lost on how to oversample my minority class. Should I convert into spectrogram before oversampling and are there any ways to do it? Thanks!

0 Comments
Show -2 older comments Hide -2 older comments

Follow Question

Answers (1)

Vineet Joshi on 30 Jul 2021

0 votes

The following documentation talks about data augmentation for audio data. It covers examples on how to create custom pipelines and functions such as pitch shifting, time shifting, and time stretching.

Data Augmentation

Hope this helps you.

Thanks

0 Comments
Show -2 older comments Hide -2 older comments

Products

MATLAB

Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Imbalanced Audio Dataset for Deep Learning Classification

0 Comments
Show -2 older comments Hide -2 older comments

Answers (1)

0 Comments
Show -2 older comments Hide -2 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Imbalanced Audio Dataset for Deep Learning Classification

0 Comments Show -2 older comments Hide -2 older comments

Answers (1)

0 Comments Show -2 older comments Hide -2 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments