Recurrent Neural Network Implementation from Scratch

astonzhang · June 29, 2020, 9:54pm

https://d2l.ai/chapter_recurrent-neural-networks/rnn-scratch.html

HarryDC · August 13, 2020, 12:41pm

There seems to be a broken reference in section 8.5.3 in the second paragraph " :numref: sec_mlp"

goldpiggy · August 15, 2020, 12:09am

Hi @HarryDC, eagle eyes! Can you post an issue on our github? Thanks for contributing!

astonzhang · September 18, 2020, 2:38am

Thanks. Now it’s fixed. See comments in https://github.com/d2l-ai/d2l-en/issues/1448

zhangjiekui · November 19, 2020, 4:57am

the second traning (use_random_iter=True) uses the model trained by the first time:
(1)the ppl is from about 2.0
(2)the curve is bumpy

So misleading

StevenJokess · November 19, 2020, 7:16am

I argee with “uses the model trained by the first time”. Thank you for reminding us.
@zhangjiekui
I think use_random_iter=True maybe is a way to converge better when it is hard to converge our model.

ljppro · October 24, 2021, 8:03am

There is a statement H, = state in function rnn.
I’ve never seen such a usage. What does it mean?
I asked some colleagues around me, but nobody knows.
Can somebody kindly explain it to me?
Many thanks!

dhern023 · November 13, 2021, 4:19pm

@ljppro
This statement unpacks an iterable of length one

x, = [2]
>>x
2

qbaza · January 29, 2022, 5:51am

The second training still uses the same data iterator which is sequential not random. Am I missing something?