Intl.Segmenter() constructor
Baseline
2024
Newly available
Since April 2024, this feature works across the latest devices and browser versions. This feature might not work in older devices or browsers.
The Intl.Segmenter() constructor creates Intl.Segmenter objects.
Try it
const segmenterFr = new Intl.Segmenter("fr", { granularity: "word" });
const string = "Que ma joie demeure";
const iterator = segmenterFr.segment(string)[Symbol.iterator]();
console.log(iterator.next().value.segment);
// Expected output: 'Que'
console.log(iterator.next().value.segment);
// Expected output: ' '
Syntax
new Intl.Segmenter()
new Intl.Segmenter(locales)
new Intl.Segmenter(locales, options)
Parameters
localesOptional-
A string with a BCP 47 language tag or an
Intl.Localeinstance, or an array of such locale identifiers. The runtime's default locale is used whenundefinedis passed or when none of the specified locale identifiers is supported. For the general form and interpretation of thelocalesargument, see the parameter description on theIntlmain page. optionsOptional-
An object containing the following properties, in the order they are retrieved (all of them are optional):
localeMatcher-
The locale matching algorithm to use. Possible values are
"lookup"and"best fit"; the default is"best fit". For information about this option, see Locale identification and negotiation. granularity-
How granularly should the input be split. Possible values are:
"grapheme"(default)-
Split the input into segments at grapheme cluster (user-perceived character) boundaries, as determined by the locale.
"word"-
Split the input into segments at word boundaries, as determined by the locale.
"sentence"-
Split the input into segments at sentence boundaries, as determined by the locale.
Return value
A new Intl.Segmenter instance.
Exceptions
RangeError-
Thrown if
localesoroptionscontain invalid values.
Examples
>Basic usage
The following example shows how to count words in a string using the Japanese language (where splitting the string using String methods would have given an incorrect result).
const text = "吾輩は猫である。名前はたぬき。";
const japaneseSegmenter = new Intl.Segmenter("ja-JP", { granularity: "word" });
console.log(
[...japaneseSegmenter.segment(text)].filter((segment) => segment.isWordLike)
.length,
);
// 8, as the text is segmented as '吾輩'|'は'|'猫'|'で'|'ある'|'。'|'名前'|'は'|'たぬき'|'。'
Specifications
| Specification |
|---|
| ECMAScript® 2026 Internationalization API Specification> # sec-intl-segmenter-constructor> |
Browser compatibility
Loading…