I am training a deep residual network with 10 hidden layers with game data.
Does anyone have an idea why I don't get any overfitting here? Training and test loss still decreasing after 100 epochs of training.
https://imgur.com/Tf3DIZL
I am training a deep residual network with 10 hidden layers with game data.
Does anyone have an idea why I don't get any overfitting here? Training and test loss still decreasing after 100 epochs of training.
https://imgur.com/Tf3DIZL
Just a couple of advice:
learning_rate
is not tuned; try to increase it (and, probably, number of epochs
if you will implement some kind of 'smart' lr-reduce)