Why is there NaN in the weights of Convolutional layer in the deeplab V3+ semantic segmentation network

Question

Zhuofan Zheng on 8 Jul 2019

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/470679-why-is-there-nan-in-the-weights-of-convolutional-layer-in-the-deeplab-v3-semantic-segmentation-netw

Answered: Ganesh Regoti on 30 Jul 2019

I use the example in https://ww2.mathworks.cn/help/vision/examples/semantic-segmentation-using-deep-learning.html

however, I want to use the deeplab to segmentate the resomte sensing images. but it has four channel so I change the first conv layer's size from 3 to 4.

It did works, but the accuracy did not increase. I stoped the training, finding that the weights in the first conv layer became NaN. The loss is not NaN, I feel so confused that why this happens?

I have tried several times, all this problem occured.