I don’t know what are you talking on or asking about.
It is training on good way.
Give me some tips or more detail to understand for me.
If you will pay attention on the image, you will see it achieved a training loss of 0.51 in only 0.02 epochs outof 2. And in only 140 steps. That’s the problem.
