Since you specifically request Unicode, \p{L} is a shortcut for a Unicode letter. However, not all regular expression flavors support this syntax..NET, Perl, Java, and the JGSoft-regex engine, for example, Python will not.
So, for example, \b\p{L}+\b will match a whole word of Unicode characters.
Tim Pietzcker Mar 06 '10 at 12:11 2010-03-06 12:11
source share