Because the last hidden state of the encoder serves as the initial hidden state of the decoder: $$s_0$$.
How did you change from GRU to LSTM? I got an error!
Thanks for your help.
Because the last hidden state of the encoder serves as the initial hidden state of the decoder: $$s_0$$.
How did you change from GRU to LSTM? I got an error!
Thanks for your help.