Your observation is not always true. And the choice of ASCII is more human factors, rather than technical or security issues.
In most cases, this is just for ease of programming. The programmer never knows that all the software, libraries, utilities on the website will be broken or not with some characters. Why is website development risked while ASCII works well? In addition, some packaged web programs will prevent Unicode from being used in the username. This makes many websites only support usernames in ASCII.
Theoretically, all current software can handle 8-bit data well. There are currently no issues with storage or transfer. Even if some protocols do not exist, they can translate to UTF-7 or with other conversion schemes.
Unicode has some problems. It is more on the data processing side. This can be display, fonts, readiness of program and program libraries for characters other than BMP, sorting, comparison, input methods, recording directions. Administrators may not be knowledgeable enough to handle them. Depending on the nature of the website, this may be a problem, but basically it is not.
For administrator purposes, itโs not easy to type some exotic characters. This makes it difficult for the administrator to search for users. It is also difficult for an administrator to store abusive usernames in foreign languages โโfrom a website.
However, it is not uncommon that Chinese usernames are used on a Chinese site. This may not always be in ASCII. Other cultures and languages โโdo the same. In some global projects, almost all kinds of Unicode characters. Wikipedia is an example.
OmniBus Aug 13 '10 at 10:28 2010-08-13 10:28
source share