8291660: Grapheme support in BreakIterator

Reviewed-by: smarks
This commit is contained in:
Naoto Sato 2022-09-09 17:13:51 +00:00
parent a14c3a493a
commit b8598b0297
15 changed files with 245 additions and 149 deletions

View file

@ -1,5 +1,5 @@
/*
* Copyright (c) 1996, 2021, Oracle and/or its affiliates. All rights reserved.
* Copyright (c) 1996, 2022, Oracle and/or its affiliates. All rights reserved.
* DO NOT ALTER OR REMOVE COPYRIGHT NOTICES OR THIS FILE HEADER.
*
* This code is free software; you can redistribute it and/or modify it
@ -99,6 +99,12 @@ import sun.util.locale.provider.LocaleServiceProviderPool;
* and a diacritical mark. What users consider to be a character can
* differ between languages.
*
* @implSpec The default implementation of the character boundary analysis
* conforms to the Unicode Consortium's Extended Grapheme Cluster breaks.
* For more detail, refer to
* <a href="https://www.unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries">
* Grapheme Cluster Boundaries</a> section in the Unicode Standard Annex #29.
*
* <p>
* The {@code BreakIterator} instances returned by the factory methods
* of this class are intended for use with natural languages only, not for