, . , ?
.
Then I would make a list of topics and assign words and phrases that would fall into this topic, and then match them with tags. The problem is that you can get more than one topic in each article.
Perhaps the best way would be to use some form of Bayesian classifiers to determine which topic best describes the article. This will require you to train the system first.
This method is used to determine if the message is spam or not.
This article may help.
source
share