Smart way to create a tag cloud? - Python

I created a content aggregator and would like to add a tag cloud representing current trends.

Unfortunately, this is rather complicated, because I have to look for keywords that represent the context of each article.

For example, words such as I were , , amazing , nice have nothing to do with the context.


Help would be greatly appreciated! :)

+4
source share
2 answers

Use NLTK and, in particular, Stop words corpus :

In addition to the usual words of content, this is another class of words called stop words that perform important grammatical functions, but are unlikely to be interesting to themselves. These include prepositions, additions and qualifiers. NLTK comes bundled with The Stopwords corpus, a list of 2400 stop words across 11 different languages ​​(including English).

+9
source

NLTK can help you analyze the content to select the appropriate terms.

+2
source

Source: https://habr.com/ru/post/1304679/


All Articles