native2unicode
Convert numeric bytes to Unicode character representation
Syntax
unicodestr = native2unicode(bytes)
unicodestr = native2unicode(bytes,
encoding)
Description
unicodestr = native2unicode(bytes) converts a
numeric vector, bytes, from the user default encoding to a
Unicode® character representation. native2unicode treats
bytes as a vector of 8-bit bytes, and each value must be in the
range [0,255]. The output argument unicodestr is a character vector
having the same general array shape as bytes.
unicodestr = native2unicode(bytes,
encoding)
converts bytes to a Unicode representation with the assumption that bytes is in the
character encoding scheme specified by encoding. The input argument
encoding must have no characters ('') or it
must be a name or alias for an encoding scheme. Some examples are
'UTF-8', 'latin1',
'US-ASCII', and 'Shift_JIS'. If
encoding is unspecified or has no characters
(''), the default encoding scheme is used.
encoding can be a character vector or a string scalar.
Note
If bytes is a character vector or a string scalar, it is
returned unchanged.
Examples
This example begins with a vector of bytes in an unknown character
encoding scheme. The user-written function detect_encoding determines
the encoding scheme. If successful, it returns the encoding scheme
name or alias as a character vector. If unsuccessful, it throws an
error represented by an MException object, ME.
The example calls native2unicode to convert the
bytes to Unicode representation:
try
enc = detect_encoding(bytes);
str = native2unicode(bytes, enc);
disp(str);
catch ME
rethrow(ME);
endNote that the computer must be configured to display text in
a language represented by the detected encoding scheme for the output
of disp(str) to be correct.
Extended Capabilities
Version History
Introduced before R2006a