用于预训练词嵌入的数据集

https://zh.d2l.ai/chapter_natural-language-processing-pretraining/word-embedding-dataset.html