mirror of
https://github.com/unicode-org/icu.git
synced 2025-04-14 09:21:03 +00:00
The home of the ICU project source code.
Change the implementation of grapheme cluster matching in regex to use an ICU break iterator instead of a little one-off state machine. The old implementation had fallen behind the Unicode UAX-29 specification for graphem clusters, and could not be easily updated. The implementation follows the same general pattern that is used for finding word boundaries with an ICU break iterator. In reviewing that code, a few improvements to the handling of ICU error codes were also made. Also note that this change adds a new dependency on Break Iteration. Regex patterns that previously would work with ICU builds that were configured with no break iteration will now fail. But only if they include \X for matching grapheme cluster boundaries. |
||
---|---|---|
.ci-builds | ||
.github | ||
docs | ||
icu4c | ||
icu4j | ||
tools | ||
vendor/double-conversion | ||
.appveyor.yml | ||
.cpyskip.txt | ||
.gitattributes | ||
.gitignore | ||
.travis.yml | ||
KEYS | ||
README.md |
International Components for Unicode
This is the repository for the International Components for Unicode. The ICU project is under the stewardship of The Unicode Consortium.
Build Status (master branch)
Build | Status |
---|---|
TravisCI | |
Azure Pipelines | |
Azure Pipelines (Exhaustive Tests) | |
AppVeyor | |
Fuzzing |
Subdirectories and Information
icu4c/
ICU for C/C++icu4j/
ICU for Javatools/
Toolsvendor/
Vendor dependencies
License
Please see ./icu4c/LICENSE (C and J are under an identical license file.)
Copyright © 2016 and later Unicode, Inc. and others. All Rights Reserved. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. Terms of Use and License