Characters whose Unicode scalar values are greater than U+FFFF (such as some rare Chinese/Japanese/Korean/Vietnamese characters and some emoji) are stored in UTF-16 with two surrogate code units each. For example, a string containing the single character U+1F600 "Emoji grinning face" (

Document Tags and Contributors

Contributors to this page: thenoelman, jwhitlock
Last updated by: thenoelman,