Concise Implementation of Multilayer Perceptron

mli · May 31, 2020, 2:47am

http://d2l.ai/chapter_multilayer-perceptrons/mlp-concise.html

gpk2000 · September 4, 2020, 4:15pm

Can someone explain why there is a sudden dip in the plot?

StevenJokes · September 4, 2020, 4:29pm

I guess it is caused by testing what you haven’t trained well accidently…
@goldpiggy

Vinicius · September 15, 2020, 5:38pm

Sem título
I want to know too, my sudden dip is different than yours

N_Praveen_Chandhar · December 20, 2020, 5:09pm

According to that scale, that dip is a loss of accuracy of around 0.05.

The test batch would have behaved differently with the current set of parameters, after that epoch. By chance, it might be the case that the current parameters give an improved accuracy on the training set but reduced accuracy on the test set.

Where that dip happens (it need not happen compulsorily even) depends on your initial state of parameters.

JH.Lam · January 15, 2021, 8:51am

do me too.it’s close to the meaning of ‘overfit’

shintafiaa · January 21, 2022, 7:23am

the same with me. but, i just try runtime > run all. the output is the same with the module. sorry for not answering ur why question.