zzz , as defined in your post, is a simple str object, not a unicode object, so there is no way to indicate that this is what it really is not. You can convert the str object to a unicode object, however, specifying the encoding:
s.find(zzz.decode("utf-8"))
Substitution of utf-8 using any encoding into which the string is encoded.
Please note that in your example
zzz = '\u0101'
zzz is a simple string of length 6. After this, there is no easy way to fix this invalid string literal, except for hacks along the strings
ast.literal_eval("u'" + zzz + "'")
source share