Skip to content

Add support for processing surrogate pairs in sequenced \uD800\uDC00 items #12

@harningt

Description

@harningt

Surrogate pairs as described in the title will be decomposed into a pair of 3-byte utf-8 encoded chunks that are technically invalid. These should be decomposed into a 4-byte utf-8 encoded value for the represented codepoint.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions