Large-Scale Pretraining with Transformers

https://d2l.ai/chapter_attention-mechanisms-and-transformers/large-pretraining-transformers.html

Fig. 11.9.7.

Je suis malade = I am sick

Yes.
And “He’s calm” = “il est calme” | “Elle court” = “She’s running”