Character encoding

Character encoding defines a mapping between bytes and text. A sequence of bytes allows for different textual interpretations. By specifying a particular encoding (such as UTF-8), we specify how the sequence of bytes is to be interpreted.

For example, in HTML we normally declare a character encoding of UTF-8, using the following line:

html

<meta charset="utf-8" />

This ensures that you can use characters from just about any human language in your HTML document, and they will display reliably.

Help improve MDN

Learn how to contribute

This page was last modified on Jul 11, 2025 by MDN contributors.

View this page on GitHub • Report a problem with this content

Character encoding

See also

Help improve MDN