See UTF-8 vs. utf8 vs. UTF8 Encode section .
To summarize, Perl has two different UTF-8 encodings. Its native encoding is called utf8
and basically allows any code, regardless of what the Unicode standard says about this code.
Another encoding is called utf-8
(aka utf-8-strict
). This allows you to use only code pages assigned by the Unicode standard.
\x{FFFF}
Unicode. Perl utf8
.
encode
, , (. ). utf-8
U + FFFD ( ), UTF-8 11101111 10111111 10111101 ().