I would suggest replacing all words with a low (especially 1) frequency with <unseen> , and then prepare a classifier in this data. For classification, you must query the model for <unseen> in the case of a word that is not in the training data.
source share