stri_unescape_unicode {stringi} | R Documentation |
Unescapes all known escape sequences
stri_unescape_unicode(str)
str |
character vector |
Uses ICU facilities to unescape Unicode character sequences.
The following ASCII standard escapes are recognized:
\a
, \b
, \t
, \n
, \v
, \?
,
\e
,
\f
, \r
, \"
, \'
, \\
.
Moreover, the function understands the following ones:
\uXXXX
(4 hex digits),
\UXXXXXXXX
(8 hex digits),
\xXX
(1-2 hex digits),
\ooo
(1-3 octal digits),
\cX
(control-X; X is masked with 0x1F).
For \xXX
and \ooo
beware of non-valid UTF8 byte sequences.
Note that some versions of R on Windows cannot handle characters defined with \UXXXXXXXX. We are working on that.
Returns a character vector.
If an escape sequence is ill-formed,
result will be NA
and a warning will be given.
Other escape: stri_escape_unicode
stri_unescape_unicode("a\\u0105!\\u0032\\n")