icu/icu4c/source/data/brkitr/rules
2024-12-20 03:54:59 +01:00
..
char.txt ICU-22956 Use InCB for grapheme cluster segmentation 2024-11-12 10:45:16 +01:00
line.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_loose.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_loose_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_loose_phrase_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_normal.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_normal_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_normal_phrase_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
line_phrase_cj.txt ICU-22986 GL takes CM 2024-12-20 03:54:59 +01:00
README.md ICU-21257 remove #License fragment from license URLs 2020-09-04 10:02:17 -07:00
sent.txt ICU-20401 rbbi break rules, update comments to match current UAX versions. 2019-02-08 12:53:58 -08:00
sent_el.txt ICU-20401 rbbi break rules, update comments to match current UAX versions. 2019-02-08 12:53:58 -08:00
title.txt ICU-13194 RBBI auto reverse tables: size reduction, and remove hand written rules. 2018-03-28 01:20:13 +00:00
word.txt ICU-22941 Revert "ICU-22112 word break updates for @,colon; colon tailorings for fi,sv" 2024-11-05 22:59:24 +01:00
word_POSIX.txt ICU-22941 Revert "ICU-22112 word break updates for @,colon; colon tailorings for fi,sv" 2024-11-05 22:59:24 +01:00

Break Iterator Rule Source Data

This directory contains rule based break iterator rule files, one set per file.

The set of rules to be included for each locale is defined in the parent directory, icu/icu4c/source/data/brkitr. Most locales fall back to root rules, which are from char.txt, word.txt, line.txt and sent.txt for, respectively, grapheme cluster, word, line and sentence breaks.