Reports

Check out a userscript which highlights deleted posts. GitHub

79512915

Date: 2025-03-16 17:03:12

Score: 1

Natty:

After some investigation, I have found the problem. The model included two Dropout layers, which are active during training but disabled during evaluation. This was affecting the final accuracy in evaluate, as the model sees all connections in use during inference. After removing the Dropout layers, the model was able to train correctly.

Reasons:

Has code block (-0.5):
Self-answer (0.5):
Single line (0.5):
Low reputation (0.5):

Posted by: bardulia