[Caveat] This is not a programming issue, but it is something that so often occurs when processing a language that I am sure it will be useful to the community.
Does anyone have a good list of uninteresting (English) words that have been checked more than randomly? This will include all prepositions, conjunctions, etc. Words that may have semantic meaning, but are often found in every sentence, regardless of subject. From time to time I made my own lists for personal projects, but they were ad-hoc; I constantly add words that I forget when they enter.
source
share