I want to use a word boundary in a regular expression to match some text in Unicode. Unicode letters are defined as the word boundary in a Python regular expression, as here:
>>> re.search(r"\by\b","üyü") <_sre.SRE_Match object at 0x02819E58> >>> re.search(r"\by\b","ğyğ") <_sre.SRE_Match object at 0x028250C8> >>> re.search(r"\by\b","uyu") >>>
What should I do so that the character of a word character does not match Unicode letters?
source share