Padding and Stride

mli · May 29, 2020, 9:21pm

http://d2l.ai/chapter_convolutional-neural-networks/padding-and-strides.html

StevenJokes · August 7, 2020, 3:31pm

Q:
1.
4 * 4:

2 * 2:

2. TODO:
3. stride each 2*sampling rate?
4. Look big picture. To avoid overfitting and accerlate the training.

Nicholas_Kim · September 5, 2020, 12:36pm

Hi all.
This is self-paced course for me to understand more about the ML.
If it is NOT allowed to add comments/questions except for the officially registered students, I will not write anything here further.

@StevenJokes
3. stride each 2*sampling rate
You mean down sampling by 2?

@Everybody
4. Cannot understand the question clearly.
Just saying that it can reduce the calculation of hidden layer seems to be too simple & obvious answer. Expecting other answers?

StevenJokes · September 5, 2020, 12:50pm

@Nicholas_Kim
I guess it is. I’m not sure about it.

goldpiggy · September 8, 2020, 9:01pm

Hi @Nicholas_Kim,

You are almost right! Strides help remove unnecessary computational burdensome and help to reduce the feature dimension more quickly.

codebat · January 22, 2021, 11:47am

Hi all,
In the paragraph after equation 6.3.1 it says:
Assuming that 𝑘ℎ is odd here, we will pad 𝑝ℎ/2 rows on both sides of the height. If 𝑘ℎ is even, one possibility is to pad ⌈𝑝ℎ/2⌉ rows on the top of the input and ⌊𝑝ℎ/2⌋ rows on the bottom. We will pad both sides of the width in the same way.
Is even and odd switched here? should it be
Assuming that 𝑘ℎ is even here, we will pad 𝑝ℎ/2 rows on both sides of the height. If 𝑘ℎ is odd, one possibility is to pad ⌈𝑝ℎ/2⌉ rows on the top of the input and ⌊𝑝ℎ/2⌋ rows on the bottom. We will pad both sides of the width in the same way.

Ignore- It shouldn’t be cause we are preserving the input dimension.

fanbyprinciple · August 20, 2021, 6:20am

Exercises

For the last example in this section, use mathematics to calculate the output shape to see if
it is consistent with the experimental result.

it is consistent, |(8 -3 + 0 + 3)/3| , |(8-5+1+4)/4|

Try other padding and stride combinations on the experiments in this section.

hmm tried

For audio signals, what does a stride of 2 correspond to?

it might be two time peridod long

What are the computational benefits of a stride larger than 1

efficiency in calculation,downsampling

Shreyas_Jaiswal · March 8, 2022, 7:23pm

I think you need to change the equ. 6.3.2. If we use padding there should 2ph and 2pw as we pad 0 to top bottom and left right respectively, So the output shape should be something like this:
((nh - kh + 2ph + sh) // sh) x ((nw-kw+2pw+sw) // sw).

pandalabme · August 25, 2023, 3:34pm

My solutions to the exs: 7.3