Your model size is too small. Make it 4/5 hidden layers with 258 hidden node in each layer. Then you can see the difference.