" ASCII", , ? - . 128 (.. 0,127) "ASCII"; , 128..255, - ASCII, cp437. " ASCII", cp437 - .
But I was distracted. Your question is not about character encoding, about filtering, but the filter should be based on the properties of the characters: is it a letter, number, control character? Most modern programming languages ββprovide methods or functions for obtaining such information, and most of them also provide support for regular expressions. As for what you should filter, or you should filter in general, only you can know.
It looks like you need to learn more about character encoding and Unicode. Start here.
source
share