home / documentation / v16 / string_decoder

String decoder

目录

自 v0.10.0 版本开始新增

稳定级别:2 - Stable

The node:string_decoder module provides an API for decoding Buffer objects into strings in a manner that preserves encoded multi-byte UTF-8 and UTF-16 characters. It can be accessed using:

JS

The following example shows the basic use of the StringDecoder class.

JS

When a Buffer instance is written to the StringDecoder instance, an internal buffer is used to ensure that the decoded string does not contain any incomplete multibyte characters. These are held in the buffer until the next call to stringDecoder.write() or until stringDecoder.end() is called.

In the following example, the three UTF-8 encoded bytes of the European Euro symbol () are written over three separate operations:

JS

C StringDecoder

M new StringDecoder([encoding])

自 v0.1.99 版本开始新增

  • encoding string The character encoding the StringDecoder will use. Default: 'utf8'.

Creates a new StringDecoder instance.

M stringDecoder.end([buffer])

自 v0.9.3 版本开始新增

Returns any remaining input stored in the internal buffer as a string. Bytes representing incomplete UTF-8 and UTF-16 characters will be replaced with substitution characters appropriate for the character encoding.

If the buffer argument is provided, one final call to stringDecoder.write() is performed before returning the remaining input. After end() is called, the stringDecoder object can be reused for new input.

M stringDecoder.write(buffer)

历史
版本历史变更
v8.0.0Each invalid character is now replaced by a single replacement character instead of one for each individual byte.
v0.1.99自 v0.1.99 版本开始新增

Returns a decoded string, ensuring that any incomplete multibyte characters at the end of the Buffer, or TypedArray, or DataView are omitted from the returned string and stored in an internal buffer for the next call to stringDecoder.write() or stringDecoder.end().