Multi-Branch Networks (GoogLeNet)

mli · May 29, 2020, 10:45pm

http://d2l.ai/chapter_convolutional-modern/googlenet.html

Leosword · April 16, 2021, 8:53am

Hello, I find that there is no relu function after the second convolutional layer in module ‘b2’ in the pytorch version, is it missing?

anirudh · April 24, 2021, 2:02pm

Thanks @Leosword for raising this. Indeed it is missing. Will be fixed in this PR.

hmh10098 · November 15, 2021, 10:14am

we do’nt add Batchnorm after AdaptiveAvgPool2d and Flatten(), right? I’m just add nn.BatchNorm2d after Conv2d

leo_leng · August 14, 2022, 3:28am

how can I get the ratio of 1/2 and 1/12?

JieLiu5326 · May 15, 2023, 7:20am

I am also confused to it so I try to find the answer in its original paper.
The structure of GoogLeNet like below:

In the Insecetion(3a), #3*3 reduce is the input of second path, #3*3 is the output of second path. In this chapter, author only tell me the truth that This layer Input/Before layer Output=96/128=1/2, #5*5 reduce as the same. It hard to find rule because it changes to 1/2, 1/8 in the second Inception block.

If you want to know why structure like that, the paper seems do not give us a reason. I think it comes from a lot of try.

Last, I think it is more important to know the idea that we can use more conv(11, 33, 55) with less channel to replace directly using 55 or 3*3 conv.

pandalabme · August 28, 2023, 1:54pm

My solutions to the exs: 8.4

Riezmann75 · November 27, 2024, 8:00am

A scale of 1/2 and 1/8 respectively suffices
Since the author used the word “suffices”, I think the number can be chosen empirically.