Commit graph

270 commits

Author SHA1 Message Date
Peter Edberg
3333fd557c ICU-21480 integrate CLDR release 39 alpha0 to ICU trunk 2021-02-10 11:58:26 -08:00
Peter Edberg
43aa7ddda2 ICU-21249 integrate CLDR release-38-beta to ICU trunk 2020-09-25 09:33:13 -07:00
Peter Edberg
7888b23e87 ICU-21249 integrate CLDR release-38-alpha2 to ICU trunk 2020-09-16 15:10:05 -07:00
Markus Scherer
9971c663ff ICU-21257 remove #License fragment from license URLs 2020-09-04 10:02:17 -07:00
Peter Edberg
e618a1cc2d ICU-21249 integrate CLDR release 38 alpha0 to ICU trunk 2020-09-02 10:23:14 -07:00
Peter Edberg
63e480dedc ICU-20987 integrate CLDR release-37-alpha1 to master (using new tooling) 2020-02-27 16:27:50 -08:00
Markus Scherer
af9ef2650b ICU-20893 Unicode 13 data 2020feb19 2020-02-19 22:02:35 -08:00
Andy Heninger
197e0239ab ICU-20893 Line break tailorings updated to Unicode 13. 2019-11-26 15:25:06 -08:00
Shane Carr
017c8b762e ICU-20890 Change locale_dependencies.py into LOCALE_DEPS.json files
- Refactors Python to make I/O operations more abstract
- Adds stable sample data for Python test
2019-11-22 20:23:30 -08:00
Peter Edberg
04c8616f93 ICU-20857 integrate CLDR release-36-1-preview to maint-66 2019-11-22 19:01:36 -08:00
Markus Scherer
a7e378d587 ICU-20893 Unicode 13 beta
See PR #915, see changes.txt
- Unicode 13 beta data as of 2019-nov-21
- uprops.icu format version 7.7 with more bits for Script/Script_Extensions
- more bits in spoof checker ScriptSet
- root line break rules adjusted for UAX 14 changes, from Andy
- line break tailorings not yet in sync with root
2019-11-21 17:35:53 -08:00
Peter Edberg
7814980f51 ICU-20749 add ອາແມນ to laodict.txt 2019-09-11 14:46:40 -07:00
Peter Edberg
e25796f6e5 ICU-20801 integrate CLDR release-36-alpha2, update MeasureUnits (#809) 2019-09-06 14:07:36 -07:00
Andy Heninger
fa240d49cc ICU-13637 Break Iterator Rule Updates for Indic Grapheme Clusters. 2019-06-27 17:17:26 -07:00
Andy Heninger
5c23416308 ICU-13637 Documentation for doing break iterator updates. 2019-06-21 10:31:40 -07:00
Andy Heninger
84d56bc91b ICU-20404 Simplify word break rules. 2019-06-18 14:14:22 -07:00
yumaoka
163793824f ICU-20535 Added Japanese new era Reiwa to cjdict.txt to support the sequence in word segmentation. 2019-04-03 18:46:12 -04:00
Shane Carr
56ffae8a0b ICU-20489 Removing resfiles.mk files.
Builds res_index.txt based on directory glob minus aliases read from deprecates XML file.

In ICU 64, please use the ICU Data Build Tool instead of reslocal.mk for locale filtering.
2019-03-14 01:51:52 -07:00
Peter Edberg
84e392e426 ICU-20439 64 BRS, integrate CLDR release-35-alpha4 2019-03-13 09:12:40 -07:00
Peter Edberg
30d2034597 ICU-20438 64rc BRS, integrate CLDR alpha2, update MeasureUnit APIs [& resolve conflicts] (#485) 2019-02-24 22:28:51 -08:00
Andy Heninger
64f4dd64e2 ICU-12017 Improve line break around numbers. 2019-02-08 13:54:14 -08:00
Andy Heninger
b50f97a58a ICU-20401 rbbi break rules, update comments to match current UAX versions. 2019-02-08 12:53:58 -08:00
Andy Heninger
58786f5cbe ICU-12079 Full width digits become numeric for word break. 2018-12-18 16:36:06 -08:00
Dongyuan Liu
46a888be87 ICU-13441 For zh/ja, tailor linebreak classes for quotations such as “ 201C and ” 201D 2018-11-14 19:53:12 -08:00
Peter Edberg
b6074fe044
ICU-20119 63rc BRS, integrate cldr 34-alpha2, part 1 icu4c 2018-09-27 14:27:41 -07:00
Andy Heninger
740b24118f
ICU-8151 Simplify Finnish Line Break Tailoring, move to root. (#99) 2018-09-27 14:27:39 -07:00
Andy Heninger
fa5ae3dc45
ICU-7270 Line Break rule LB8 updated for UAX-14 conformance. (#41)
Includes all line break tailorings.
Corresponding updates to monkey test rules.
State table builder, fix missed table optimization, uncovered by new rule.
2018-09-27 14:27:38 -07:00
Peter Edberg
4e6358240c ICU-13792 Integrate data from cldr/tags/release-33-1-d02 to ICU trunk for ICU62rc
X-SVN-Rev: 41479
2018-05-29 22:33:10 +00:00
Andy Heninger
50fc9ce878 ICU-13770 Line Break Rules for Unicode 11, tailorings updated.
X-SVN-Rev: 41410
2018-05-18 17:47:03 +00:00
Andy Heninger
6e40252eb3 ICU-13770 RBBI Line break rules update, work in progress.
X-SVN-Rev: 41404
2018-05-17 18:07:43 +00:00
Andy Heninger
14cfaac98c ICU-13770 RBBI Line Break rules for Unicode 11. Root working; tailorings remain to do.
X-SVN-Rev: 41390
2018-05-16 23:26:58 +00:00
Andy Heninger
0b8b2750e3 ICU-13194 RBBI safe rule synthesis, merge to trunk.
X-SVN-Rev: 41210
2018-04-07 00:03:36 +00:00
Andy Heninger
b58e06a604 ICU-13630 WSegSpace, use word property value in rules definitions.
X-SVN-Rev: 41205
2018-04-06 17:31:18 +00:00
Andy Heninger
4e097dc49f ICU-13630 Update RBBI for Unicode 11 data.
X-SVN-Rev: 41194
2018-04-04 23:37:43 +00:00
Markus Scherer
03303a6cb6 ICU-13630 Unicode 11 beta data apr02 (security apr03), fix ICU4C tests except RBBI
X-SVN-Rev: 41191
2018-04-03 23:09:49 +00:00
Andy Heninger
aead9fb553 ICU-13194 RBBI auto reverse tables: size reduction, and remove hand written rules.
X-SVN-Rev: 41163
2018-03-28 01:20:13 +00:00
Markus Scherer
af6a771267 ICU-13630 implement, test, use emoji property Extended_Pictographic
X-SVN-Rev: 41094
2018-03-12 05:53:02 +00:00
Markus Scherer
b3aec18a3c ICU-13630 ucase.icu formatVersion 4: more compressible exceptions, and more room for future exceptions growth
X-SVN-Rev: 41093
2018-03-12 00:15:40 +00:00
Peter Edberg
c033aaaf41 ICU-13603 Integrate data from CLDR release-33-alpha for ICU 61rc
X-SVN-Rev: 40980
2018-02-25 01:19:24 +00:00
Peter Edberg
6d58ccae2a ICU-13474 Integrate cldr/tags/release-32-0-1.beta1 into ICU, update tests
X-SVN-Rev: 40685
2017-12-02 00:02:56 +00:00
Peter Edberg
dbc623d0d8 ICU-13359 Integrate updated CLDR 32 @13761 for 60rc. The previous issues #13366,#13368,#13374,#13390,#13391 are fixed.
X-SVN-Rev: 40582
2017-10-06 00:46:48 +00:00
Peter Edberg
3bfe134a86 ICU-13359 Integrate CLDR 32 alpha data into ICU for ICU 60rc; known issues #13366,#13368
X-SVN-Rev: 40463
2017-09-26 21:20:01 +00:00
Andy Heninger
e220fe9dd6 ICU-9954 rbbi forward iteration performance improvement. Ensure rules always make some progress; keep engine out of the fail-safe fall back path for rules that do not advance.
X-SVN-Rev: 40461
2017-09-26 20:16:03 +00:00
Andy Heninger
4e1c4096a6 ICU-9954 Break Iteration, remove reverse rules, add boundary caching.
X-SVN-Rev: 40433
2017-09-19 18:17:22 +00:00
Andy Heninger
6d7c1bc487 ICU-13261 remove gratuitous difference between line_normal_cj.txt and line_loose_cj.txt.
X-SVN-Rev: 40326
2017-08-11 20:54:22 +00:00
Peter Edberg
c70ebe3ce1 ICU-13159 Integrate CLDR trunk@13513 (cldr/tags/icu-milestone-60-0-1)
X-SVN-Rev: 40193
2017-06-22 23:05:41 +00:00
Peter Edberg
1d8d98451d ICU-13099 BRS, integrate data from CLDR release-31-0-1.RC2; ICU4J test failure due to #13101
X-SVN-Rev: 39983
2017-04-05 00:51:27 +00:00
Andy Heninger
77416a6e7c ICU-13058 Break iteration, propagate emoji rule updates into remaining unfixed rule tailorings.
X-SVN-Rev: 39925
2017-03-24 05:54:13 +00:00
Andy Heninger
af55f69558 ICU-13058 Break iteration tests & rules update for new prop data. Tests are now passing, but changes are not yet propagated into all rule tailorings.
X-SVN-Rev: 39922
2017-03-24 01:31:00 +00:00
Andy Heninger
51e21b5242 ICU-13058 Add RBBI test of Unicode emoji-test.txt file, and partial update of break rules recent emoji changes.
X-SVN-Rev: 39909
2017-03-23 00:20:20 +00:00