Concise Implementation of Softmax Regression

mli · May 27, 2020, 9:17pm

https://d2l.ai/chapter_linear-classification/softmax-regression-concise.html

S_X · June 7, 2020, 8:32am

Hi @mli I get this error when I import the libraries needed for this module:

from d2l import mxnet as d2l
from mxnet import gluon, init, npx
from mxnet.gluon import nn
npx.set_np()
---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
<ipython-input-2-2bc514450bf4> in <module>
----> 1 from d2l import mxnet as d2l
      2 from mxnet import gluon, init, npx
      3 from mxnet.gluon import nn
      4 npx.set_np()

ImportError: cannot import name 'mxnet' from 'd2l' (/Users/xyz/miniconda3/envs/d2l/lib/python3.7/site-packages/d2l/__init__.py)

Is the first line for import correct? I see this error at least in all notebooks for the linear networks module
I think it should be just:
import d2l

goldpiggy · June 8, 2020, 5:24pm

Hi @S_X, I run the code multiple times and it goes through smoothy. Please check if you have d2l installed by running

!pip list | egrep d2l

in your jupyter notebook. It should return you the current version of d2l, such as d2l 0.13.2.

If not, please run

!pip install d2l

in your jupyter notebook.

S_X · June 8, 2020, 6:33pm

Hi @goldpiggy,
It worked fine for me last week. I then downloaded the latest version of d2l notebooks and this problem showed up. The issue is:

from d2l import mxnet as d2l
it should be
import d2l

In from d2l import mxnet as d2l I don’t think the intent is to import mxnet library from d2l and name it as d2l! Please correct me if I’m mistaken.

mli · June 8, 2020, 11:42pm

Please update to the latest version of d2l by pip install d2l==0.13.2 -f https://d2l.ai/whl.html. We updated the way to import d2l to support multiple backends. from d2l import mxnet as d2l imports the mxnet backend.

S_X · June 9, 2020, 4:51am

Thanks! That helped!

S_X · June 9, 2020, 6:35am

For the question #2 at the end:

Why might the test accuracy decrease again after a while? How could we fix this?

When I run 30 epochs, I see a small dip in test accuracy. Is this what is being referred to? I don’t see a big decrease in test accuracy.

goldpiggy · June 9, 2020, 3:56pm

Hi @S_X, did you run the code

d2l.train_ch3(net, train_iter, test_iter, loss, num_epochs, trainer)

multiple times without reinitializing the net?

If a model is trained with too much epoch after its converge, it’s called “overfitting” (http://d2l.ai/chapter_multilayer-perceptrons/underfit-overfit.html).

S_X · June 10, 2020, 4:50am

Hi @goldpiggy,
No, I ran the following:
num_epochs = 30
d2l.train_ch3(net, train_iter, test_iter, loss, num_epochs, trainer)
This is the correct way, right?

goldpiggy · June 10, 2020, 4:18pm

Hi @S_X! Since your initial loss is quite low, it looks like you may run this cell:

num_epochs = 30
d2l.train_ch3(net, train_iter, test_iter, loss, num_epochs, trainer)

multiple times with the same defined net.

If it was trained on the same net, the model will continue optimizing on the pretrained net that you ran last time. Hence even though the epoch on the plot looks like “0-30”, it is probably should be “30-60”, “60-90”, etc.

Does it make sense to you?

S_X · June 11, 2020, 8:04am

Hi @goldpiggy
Yes, you are right. I ran with the same net. That explains it.
When I reinitialize the whole notebook and run larger epochs, I can see the test accuracy reduce compared to training accuracy). Thanks for your help!

Result of 30 epochs after re-initializing all the variables:

goldpiggy · June 11, 2020, 5:43pm

Great! My pleasure to help!

Rosetta · June 19, 2020, 3:39pm

I am not able to run Huber Loss for regression

goldpiggy · June 19, 2020, 8:43pm

We will solve it! Thanks!