Finding Synonyms and Analogies

astonzhang · June 29, 2020, 10:40pm

https://d2l.ai/chapter_natural-language-processing-pretraining/similarity-analogy.html

StevenJokess · October 21, 2020, 9:46am

nixonjin · December 31, 2020, 9:08am

Is there any theoretical proof which can explain why the word embedding algorithms could achieve finding synonyms and analogies using the pretrained vector. Isn’t there interpretability in term of this problem?

astonzhang · January 2, 2021, 8:28pm

If you look at likelihood function in word embeddings such as word2vec, the exponent is the dot product of center word and multiple context words. If two center words are interchangable (e.g., synonym), we maximize likelihood functions so that interchangable words have larger dot products