Smoothing in python NLTK

Question

Smoothing in python NLTK

I am using the Naive Bayes classifier in python to classify text. Are there any smoothing methods to avoid zero chance for invisible words in python NLTK? Thanks in advance!

+4

python nltk smoothing

Aikin Nov 13 '12 at 6:20

source share

1 answer

oroszgy · Answer 1 · 2012-11-15T12:51:09+0000

I would suggest replacing all words with a low (especially 1) frequency with <unseen> , and then prepare a classifier in this data. For classification, you must query the model for <unseen> in the case of a word that is not in the training data.

Smoothing in python NLTK

More articles: