Software to find the most common unique words in a file

I remember once visiting a website that would present a resume book / novel in the most interesting way. It will show the list that was most often repeated in this book and was unique / unusual. In other words, words of maximum frequency will be displayed in it, but not such general words as me, you, etc. Then he needs to have things like showing a phrase, if it is repeated often. For example, Treasure Island would probably have such words as: pirates, assault, battle, treasure, pieces of eight, island, Long John Silver, Jim, omen, etc.

It was the most interesting way to quickly get a good idea of ​​whether I want to read this book or not. I can no longer find this site. So I thought about finding software that would do the job. I have several pdf and doc books that I would like to analyze. Does anyone know of a good tool / software that can do this?

Of course, I could probably do it myself, but it would be nice not to reinvent the wheel. So my question is, " do you know about any such software? "

Thanks,
Mugen
(BOOK)

+3
source share
1 answer
+1

Source: https://habr.com/ru/post/1764408/


All Articles