Can we only train the classification layer when do transfer learning of a pre-trained network?

Question

0 votes

Hi,

My question relates to this article https://uk.mathworks.com/help/deeplearning/gs/get-started-with-transfer-learning.html

The question is, can we only train the classification layer when do transfer learning of a pre-trained network? I want to speed up my training by keeping the feature extraction layers (base model) as they are and only replace and retrain the classification layers.

The equivalent way in Keras (Python) is by: base_model.trainable = False

If possible in Matlab, please let me know how. Your help is appreciated.

Cheers

Sud

2 Comments
Show None Hide None

Greg Heath on 17 Jul 2020

See if it is now possible to assign different learning rates to the different layers. I wasn't able to some time ago.

Greg

Sud Sudirman on 17 Jul 2020

What we can do it to set the InitialLearnRate to a very small number e.g., 1e-4 and set the WeightLearnRateFactor and BiasLearnRateFactor of the FCNN before the last Classification Layer to a large number (e.g., 20) as was demonstrated here https://uk.mathworks.com/help/deeplearning/gs/get-started-with-transfer-learning.html . This approach seems to approximate the approach you suggested, Greg.

However, I was thinking about something more in the line of what described in https://uk.mathworks.com/help/deeplearning/ug/extract-image-features-using-pretrained-network.html. In this approach, I can record the activations of the last layer (bottleneck features) before the FCNN and use them to train a classifier. But for some reason, I cannot use these features to train an network containing {SequenceInputLayer+FCNN+ClassificationLayer}. Odd.

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Srivardhan Gadila on 17 Jul 2020

Open in MATLAB Online

0 votes

In order to freeze the weights of a particular layer of your network set the properties WeightLearnRateFactor & BiasLearnRateFactor to zero. Refer to fullyconnectedLayer - Learn Rate and Regularization, convolution2dLayer - Learn Rate and Regularization & lstmLayer - Learn Rate and Regularization.

layer.WeightLearnRateFactor = 0;

You can also refer to Freeze Initial Layers of the Train Deep Learning Network to Classify New Images example.

2 Comments
Show None Hide None

Sud Sudirman on 20 Jul 2020

Thanks for the answer and providing useful links. They help a lot.

Mathieu on 2 Jul 2021

Hi,

I'm OK, it work, but the training seems to be relatively slow? I mean, I expected it to be quicker. With your method, is the gradient calculated for all layers?

Sign in to comment.

Can we only train the classification layer when do transfer learning of a pre-trained network?

2 Comments
Show None Hide None

Accepted Answer

2 Comments
Show None Hide None

More Answers (0)

Categories

Products

Release

Tags

Community Treasure Hunt

Can we only train the classification layer when do transfer learning of a pre-trained network?

2 Comments Show None Hide None

Accepted Answer

2 Comments Show None Hide None

More Answers (0)

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

2 Comments
Show None Hide None

2 Comments
Show None Hide None