Multiple Input and Output Channels

mli · May 29, 2020, 9:22pm

http://d2l.ai/chapter_convolutional-neural-networks/channels.html

sunchunqiang · October 31, 2020, 6:54am

I am a beginner. What I am wondering is that for a Multiple Output Channels layer, it seems each channel has same initialized parameters, but they can get different kernel at last.

goldpiggy · November 2, 2020, 11:11pm

Hi @sunchunqiang, yes you are right with the same initialization. However, the initialization itself will randomly sample through the given distribution. Which means even if we use the same initialization to sample, we will get different result at every time.

Sandeep_Gupta · December 4, 2020, 9:01pm

Quick question about Figure 6.4.1, multi-channel 2D convolutions:

In practice, for a standard conv2D layer, do we usually use the same 2D kernel for each input channel, or do we use a 3khkw kernel, which would mean that there are different parameters for each channel? The diagram shows an example of the latter, but it seems like actual implementations use the first.

six · February 8, 2021, 8:11pm

Ex. 6.4.6 (last question of chapter)
Is it ok for Y1 != Y2 when the kernel size goes from 1x1 to a 2x2? Or should the 2x2 channel convolution match what the cross correlation iterative function does?

This is what I have so far:

# Exercise 6
def corr2d_multi_in_out_2x2(X, K):
    c_i, h, w = X.shape
    c_o, c_ii, kh, kw = K.shape
    assert c_ii == c_i, "Kernel channel dimensions don't match input"
    X = X.reshape((c_i, h * w))
    K = K.reshape((c_o, kh * kw, c_i))
    Y = np.dot(K, X)  # Matrix multiplication in the fully-connected layer
    Y = Y.sum(axis=2) # input channel dimension
    return Y.reshape((c_o, kh, kw))


K = np.random.normal(0, 1, (2, 3, 2, 2))
Y1 = corr2d_multi_in_out_2x2(X, K)
print(Y1.shape)
Y2 = corr2d_multi_in_out(X, K)
print(Y2.shape)

JH.Lam · April 1, 2021, 4:08pm

i think this is no luck for doing matrix product in case of any dimensions for kernel. here conv2d will give a light to the details.

tardebut · August 14, 2021, 2:45pm

Hi there,

Nothing is said in this chapter about the activation function. It may come later. Is it operating on the aggregation (sum) of channel-specific convolutional layers (multichannel input, Fig 6.4.1)? Or before the sum. Does it matters?

Omar