In the findSynonyms
method, word2vec
it does not calculate the cosine similarity v1・vi / |v1| |vi|
, instead it calculates v1・vi / |vi|
where v1
is the vector of the query word, and vi
is the vector of candidate words. Therefore, the value sometimes exceeds 1. To find closer words, there is no need to divide by |v1|
, because it is permanent.
source
share