If it's Python 2.x str , get it len . If it's Python 3.x str (or Python 2.x unicode ), first encode to bytes (or a str , respectively) using the preferred encoding ( 'utf-8' is a good choice) and then get the len encoded bytes / str object .
For example, ASCII characters use 1 byte each:
>>> len("hello".encode("utf8")) 5
while the Chinese use 3 bytes each:
>>> len("你好".encode("utf8")) 6
tzot Oct 25 '10 at 9:48 2010-10-25 09:48
source share