lr, num_epochs = 0.01, 10 d2l.train_ch6(net, train_iter, test_iter, num_epochs, lr)
training on cpu
no other results?
Please use a GPU for deep nets. A CPU could be 100 times slower than a GPU.
For the AlexNet on Fashion-MNIST, a GPU takes ~ 20 seconds per epoch, which means a CPU would take 2000 seconds ~ 30 minutes.
@ChenYangyao thank for your reply. I don’t have gpus. I’m using colab for learning.
And now I can’t find any exercitations because of my finance undergraduate diploma.
@StevenJokes I think colab lets you use a GPU for free? Tools -> Change runtime type -> choose ‘GPU’ under Hardware Accelerator
I have known it. But 12 hours will disconnect.
And you can find why colab is not so friendly in my discussion:
And there is an issue of python 3.6:
aha, I see. maybe you can try the kaggle.com free kernel? It allows 30 hours a week of GPU use for its contests I think.
I’ll try. @Nish
If I change Dataset then how do I change the below statement like dataset of Animals?
train_iter, test_iter = d2l.load_data_fashion_mnist(batch_size, resize=224)
For Exercise 4, I’m guessing the dominant part of computation is the block of fully connected layers at the end. Is the dominant part of the memory footprint the largest convolution layer?
If you are using custom dataset then you might have to use the
torch.utils.data.Dataset to create a custom class. Writing Custom Datasets, DataLoaders and Transforms — PyTorch Tutorials 1.9.0+cu102 documentation
- Try increasing the number of epochs. Compared with LeNet, how are the results different? Why?
Finally was able to “train” it but it looks more like a heartbeat than anything
- The result is a very noisy function, it is like its not getting trained at all.
AlexNet may be too complex for the Fashion-MNIST dataset.
- Try simplifying the model to make the training faster, while ensuring that the accuracy
does not drop significantly.
- done but accuracy has still not improved.
- Design a better model that works directly on 28 × 28 images.
Modify the batch size, and observe the changes in accuracy and GPU memory.
- more batch size more consumption
Analyze computational performance of AlexNet.
- any idea how to do it ?
What is the dominant part for the memory footprint of AlexNet?
- Linear network and 3 cnns
What is the dominant part for computation in AlexNet?
- the neuralnetwork 3 cnns
How about memory bandwidth when computing the results?
- more than 100 mb
Apply dropout and ReLU to LeNet-5. Does it improve? How about preprocessing?
lenet_5 = nn.Sequential(nn.Conv2d(1, 6, kernel_size=5, padding=2), nn.ReLU(), nn.AvgPool2d(kernel_size=2, stride=2), nn.Conv2d(6, 16, kernel_size=5), nn.ReLU(), nn.AvgPool2d(kernel_size=2, stride=2), nn.Flatten(), nn.Linear(16 * 5 * 5, 120), nn.ReLU(), nn.Dropout(p=0.5), nn.Linear(120, 84), nn.ReLU(), nn.Linear(84, 10))
its not training well as well.
FInally able to train , here is the network summary
it took me 6m 21 sec to get an accuracy of 88.7, in just 3 epochs.