Deferred Initialization

Q1. specify the input dimensions to the first layer? I looked at the nn.Dense documentation but I did not find how to set both the input and output dimensions.

Q2. We get a runtime error, correct?

Q3. Don’t we use padding, if, for example, we have sentences of varying length? I guess the author here means using the same params on different inputs.

@anirudh I do not see a PyTorch implementation for this section. Is it because deferred initialization is not possible with Pytorch ? or can we use the LazyLinear for the same functionality?

Yes, @sushmit86 Deferred init is not applicable in PyTorch.