How can I accelerate deep learning training using GPU?

123 views (last 30 days)
I've made a simple neural network
It classifies MNIST handwritten digit using fully-connected layers
lgraph_2 = [ ...
imageInputLayer([28 28 1])
fullyConnectedLayer(512)
reluLayer
fullyConnectedLayer(256)
reluLayer
fullyConnectedLayer(128)
reluLayer
fullyConnectedLayer(10)
softmaxLayer
classificationLayer];
And the options in the neural network is
miniBatchSize = 10;
valFrequency = 5;
options = trainingOptions('sgdm', ...
'MiniBatchSize',miniBatchSize, ...
'MaxEpochs',5, ...
'InitialLearnRate',3e-4, ...
'Shuffle','every-epoch', ...
'ValidationData',augimdsValidation, ...
'ValidationFrequency',valFrequency, ...
'Verbose',true, ...
'Plots','training-progress', ...
'ExecutionEnvironment', 'parallel');
I expected when i use a GPU, it's training speed will be high
But when I train this network using Macbook(sigle CPU)
it takes 1 hour for around 2500 iterations
And when I use my desktop using RTX 2080Ti,
It takes much longer time to train.
MATLAB detects my GPU properly(I checked the GPU information using gpuDevice)
I don't know how can I accelerate the training proess.
Thank you in advance

Answers (2)

Joss Knight
Joss Knight on 2 Jun 2019
Your mini-batch size is far too small. You're not going to get any benefit of GPU over CPU with that little GPU utilisation. Increase it to 512 or 1024, or higher (MNIST is a toy network - you could probably train the whole thing in a single mini-batch).
Also, the ExecutionEnvironment option you're looking for is gpu or auto, not parallel. parallel may be slowing things down in your case, if you have a second supported graphics card.
  2 Comments
Ali Al-Saegh
Ali Al-Saegh on 22 Oct 2020
Hello Knight,
I need to know how things are going on when a GPU is used for deep learning, does the CPU also involved in the training process or any other stuff? It will be great for me if I get some explanation for that!
Also, is it possible to measure the overhead time required for transferring data between memory and GPU?
Umar
Umar on 30 Jun 2024
Hi Ali,
When a GPU is utilized for deep learning tasks, the CPU is involved in managing the overall process, such as loading data, initializing the network, and coordinating communication between the CPU and GPU. The CPU handles tasks that are not suitable for parallel processing on the GPU, ensuring a smooth workflow during training.
To measure the overhead time for data transfer between memory and GPU, you can use profiling tools like MATLAB Profiler or NVIDIA's profiling tools. These tools can help identify bottlenecks in data transfer, allowing you to optimize your code for efficient data handling and minimize overhead time.
Let us know if you need further assistance.

Sign in to comment.


Shivam Sardana
Shivam Sardana on 29 May 2019
Edited: KSSV on 27 May 2021
Considering CUDA® enabled NVIDIA® GPU with compute capability 3.0 or higher and Parallel Computing Toolbox™ are installed. Consider changing ‘ExecutionEnivronement’ to ‘gpu’. You can refer to the documentation link to see if this helps:
  1 Comment
NOSHEEN SOHAIL
NOSHEEN SOHAIL on 23 Oct 2019
I'm almost facing the same issue. my GeForce GTX1080 GPU does not show training progress, no update on a single iteration, single epoch even waited watching 3 days.. or it seems to be much slower than CPU training??
How is this happening, instead of being faster computation, it shows nothing. only big noisy sound from my GPU card is heard but no training plot progresses??
All requirements of CUDA® enabled NVIDIA® GPU with compute capability 3.0 or higher and Parallel Computing Toolbox™ are installed. Done changing ‘ExecutionEnivronement’ to ‘gpu
please help

Sign in to comment.

Categories

Find more on Image Data Workflows in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!