https://d2l.ai/chapter_natural-language-processing-pretraining/approx-training.html
Exercise 2) Regarding to this exercise, I was wondering if the following steps would answer it or if I’m losing something:
1 Like
but how does formula-15.2.9 have anything to do with formula-15.1.4?