Commit graph

991 commits

Author SHA1 Message Date
Peter Edberg
5bfeae38ee ICU-22583 CLDR release-44-1 to ICU maint/maint-74 part 3 (ICU sources: lib, tools, tests) 2023-12-07 10:35:01 -08:00
Peter Edberg
2f7bfd87cb ICU-22326 CLDR release-44-beta5 to ICU main part 3 (ICU sources: lib, tools, tests) 2023-10-26 10:59:18 -07:00
DraganBesevic
72099ee64c ICU-22325 CLDR 44 beta2 integration to ICU part two, source files generated or copied from CLDR 2023-10-04 15:18:56 -07:00
Peter Edberg
e1af930c6a ICU-22325 BRS 74rc move cldr testdata to consistent place, adjust test & tools to match 2023-10-03 10:24:27 -07:00
Markus Scherer
c670bbd5b0 ICU-22420 GB18030 change 3 mappings for GBK/web compat 2023-09-27 08:37:24 -07:00
Peter Edberg
7ff2fff2b8 ICU-22325 CLDR release-44-alpha3 to main part 2 (data generated or copied from CLDR) 2023-09-15 14:02:20 -07:00
DraganBesevic
bb7352990e ICU-22325 CLDR 44 alpha2 integration to ICU part three, source files changes 2023-09-13 11:06:53 -07:00
DraganBesevic
6b08bb601c ICU-22325 CLDR 44 alpha2 integration to ICU part two, source files generated or copied from CLDR 2023-09-13 11:06:53 -07:00
Mihai Nita
5fb2a6ad06 ICU-22324 Mavenization, updating the cldr-to-icu scripts and instructions 2023-09-05 10:24:23 -07:00
Peter Edberg
2270c174a5 ICU-22325 CLDR release-44-alpha1 to main:
- binaries, binary-as-source, CLDR data sources;
  - CLDR test data & dtd, ICU lib/tool/test source updates.
2023-08-22 14:40:51 -07:00
Frank Tang
ffc449de62 ICU-20777 Merge the likelySubtags implemention
Change testdata/likelySubtags.txt to consider FAIL line

ICU-20777 Fix Java Tests

ICU-20777 Fix all issues

ICU-20777 Incase timeout

ICU-20777

ICU-20777 Skip Data Driven test
2023-08-18 09:35:54 -07:00
Markus Scherer
81a6edb287 ICU-22404 Unicode 15.1 data 20230811 plus UTS46 fix 2023-08-16 14:25:22 -07:00
Rich Gillam
56850c9a42 ICU-22402 Add support in ICU and in the CLDR-to-ICU tool for the new nativeSpaceReplacement and parameterDefault
resources for PersonNameFormatter in CLDR. Regenerated the ICU4J data resources as well as the ICU4C resources
to include the new resources.
2023-08-08 14:42:02 -07:00
DraganBesevic
1f07d2b29f ICU-22325 Integrate CLDR 44.1 to ICU, add personName testdata, fix RBBITestMonkey 2023-07-28 16:53:50 -07:00
Elango Cheran
2e45e6ec0e ICU-22404 Unicode 15.1 beta data files & API constants
See #2492

Co-authored-by: Andy Heninger <andy.heninger@gmail.com>
Co-authored-by: Robin Leroy <egg.robin.leroy@gmail.com>
2023-07-13 19:26:14 -07:00
Frank Tang
f00ff4f5e3 ICU-22406 Add LIBRARY_DATA_DIR 2023-06-13 22:01:59 -07:00
Frank Tang
ea7ed9a9db ICU-22406 passing -i to genrb
Passing -i to genrb to include ucadata.icu data in
--disable-shared build
2023-06-13 22:01:59 -07:00
Peter Edberg
7f5d679a98 ICU-22357 Update gb18030 mappings for the -2022 version
See #2430
2023-05-18 08:51:47 -07:00
Peter Edberg
5618203821 ICU-22360 revert portions of #2159 which included @ in ALetter for wordbreak, update tests 2023-05-06 21:36:46 -07:00
Markus Scherer
f4687fc25a ICU-22221 update root collation again from CLDR 43 2023-04-06 08:20:03 -07:00
Peter Edberg
3db74e8ae7 ICU-22220 CLDR release-43-beta2 to ICU main 2023-03-15 20:52:34 -07:00
Shane F. Carr
2a9d0ccdb2 ICU-22283 Add additional ERoundingMode variants
See #2329
2023-03-14 00:51:42 -07:00
Peter Edberg
18f6a3a6e2 ICU-22220 CLDR release-43-alpha2 to ICU main 2023-02-27 11:09:02 -08:00
Markus Scherer
d86b1cebe1 ICU-22220 update root collation from CLDR 43 2023-02-22 17:13:13 -08:00
Peter Edberg
8d411e9b6a ICU-22220 integrate CLDR release-43-m0 to ICU main for 73, update maven-build files 2023-01-10 11:32:24 -08:00
allenwtsu
80fb309c8a ICU-22100 Remove unicode blocks from Japanese ML phrase breaking
See #2278
2023-01-09 17:38:51 -08:00
Shuhei Iitsuka
b6b7b045e9 ICU-22100 Incorporate BudouX into ICU (C++) 2022-12-02 10:11:06 -08:00
Jungshik Shin
05dc2ac924 ICU-22119 Add lw=phrase for Korean using line_*_phrase_cj
brkitr/ko.txt is created to use line_*_.cj.txt for both
lw=phrase and lw != phrase cases for Korean. This is the simplest
way to fix ICU-22119 taking advantage of the fact that ICU
does not have a Korean dictionary so we don't have to worry about
adding the list of Korean particles to keep them attached to the
preceeding word.

The downside is that it only works when the locale is ko or ja while
it should work in any locale. Another is it makes ICU deviate from
CSS3 by using the same CJ (conditonal Japanese) rules for Korean as
well. However, CSS3 spec is wrong on that point and should be changed.
See https://unicode-org.atlassian.net/browse/CLDR-4931 .
2022-11-07 22:30:49 +00:00
Peter Edberg
49b08b414d ICU-21958 integrate CLDR release-42-beta2 to ICU main for 72 2022-09-29 10:12:36 -07:00
Peter Edberg
1de1e36d6f ICU-21957 integrate CLDR release-42-alpha3 to ICU main for 72 2022-09-08 18:19:10 -07:00
Fredrik Roubert
030fa1a479 ICU-21148 Consistently use standard lowercase true/false everywhere.
This is the normal standard way in C, C++ as well as Java and there's no
longer any reason for ICU to be different. The various internal macros
providing custom boolean constants can all be deleted and code as well
as documentation can be updated to use lowercase true/false everywhere.
2022-09-07 20:56:33 +02:00
Markus Scherer
8050af5484 ICU-21980 Unicode 15 update 2022aug30 2022-08-31 16:15:42 -07:00
Peter Edberg
49d192fefe ICU-22112 word break updates for @,colon; colon tailorings for fi,sv
See #2159
2022-08-23 12:45:55 -07:00
allenwtsu
8c669a7c2e ICU-22012 Add more Japanese words into the dictionary 2022-08-23 10:18:45 -07:00
Peter Edberg
ca9bdb9780 ICU-21957 integrate CLDR release-42-alpha2 to ICU main for 72 2022-08-22 13:07:59 -07:00
Peter Edberg
0266970e97 ICU-21957 integrate CLDR release-42-alpha1 to ICU main for 72 2022-08-05 09:39:58 -07:00
Peter Edberg
dcd19ae9bc ICU-21957 integrate CLDR release-42-alpha0 (first with Survey Tool data) to ICU main for 72 (#2142) 2022-07-29 15:32:45 -07:00
Peter Edberg
6394a48d06 ICU-21957 integrate CLDR release-42-m2 (mid milestone) to ICU main for 72 2022-07-14 10:56:39 -07:00
allenwtsu
929cf40ecb ICU-22059 Add one Thai word into the Thai dictionary
See #2112
2022-06-27 09:27:56 -07:00
Peter Edberg
64b3548126 ICU-21957 integrate CLDR release-42-m1 (early milestone) to ICU main for 72 (rebased on main) +
FormattedStringBuilderTest::testInsertOverflow infolns,logKnownIssue skip for CI exhaustive crash
2022-05-27 13:50:43 -07:00
Markus Scherer
3859735e3b ICU-21980 Unicode 15 collation data 2022-05-25 18:23:11 +00:00
Markus Scherer
e1be738ccb ICU-21980 Unicode 15 pre-beta data files, new prop values 2022-05-25 18:23:11 +00:00
allenwtsu
bdcec144b9 ICU-22012 Add four Japanese word into the dictionary
See #2072
2022-05-11 08:19:53 -07:00
Markus Scherer
43d082665e ICU-22006 icupkg: %%ALIAS & %%Parent do not need truncation parent 2022-04-29 17:50:11 +00:00
Peter Edberg
571d12abfb ICU-21409 add word for bell to laodict 2022-03-09 15:14:42 -08:00
Andy Heninger
f783a84d2f ICU-21592 Linebreak loose cj rules cleanup
This is a followup to PR #1991, Update cj normal/loose linebreak per CSS

The original change to the line_loose_cj rules involved splitting hyphens out
of the BA (Break After) class, allowing a break when they follow an ID. This
change simplifies the the rules for doing that.

It also fixes a problem with the original change that had altered the behavior
of BAX hyphens that followed Regional Indicators or Unattached Combining Marks.
2022-02-24 21:27:26 -08:00
Peter Edberg
4cfe96c508 ICU-21592 Update cj normal/loose linebreak per CSS 2022-02-22 13:16:09 -08:00
allenwtsu
7d825cb204 ICU-21699 Add some more particles 2022-02-21 08:54:54 -08:00
allenwtsu
a7b2d9dae1 ICU-21699 Add Japanese particle 2022-02-10 18:50:41 -08:00
allenwtsu
2a7c465284 ICU-21699 Add breakpoint between Japanese and Alphabet 2022-02-09 21:12:49 -08:00