Robin Leroy
8d86ca142e
ICU-22941 Revert "ICU-22112 word break updates for @,colon; colon tailorings for fi,sv"
...
This reverts commit 49d192fefe
.
2024-11-05 22:59:24 +01:00
Jungshik Shin
05dc2ac924
ICU-22119 Add lw=phrase for Korean using line_*_phrase_cj
...
brkitr/ko.txt is created to use line_*_.cj.txt for both
lw=phrase and lw != phrase cases for Korean. This is the simplest
way to fix ICU-22119 taking advantage of the fact that ICU
does not have a Korean dictionary so we don't have to worry about
adding the list of Korean particles to keep them attached to the
preceeding word.
The downside is that it only works when the locale is ko or ja while
it should work in any locale. Another is it makes ICU deviate from
CSS3 by using the same CJ (conditonal Japanese) rules for Korean as
well. However, CSS3 spec is wrong on that point and should be changed.
See https://unicode-org.atlassian.net/browse/CLDR-4931 .
2022-11-07 22:30:49 +00:00
Peter Edberg
49d192fefe
ICU-22112 word break updates for @,colon; colon tailorings for fi,sv
...
See #2159
2022-08-23 12:45:55 -07:00
allenwtsu
7d825cb204
ICU-21699 Add some more particles
2022-02-21 08:54:54 -08:00
Peter Edberg
398489b915
ICU-21900 integrate CLDR release-41-alpha2 to ICU main for 71 front-load
2022-02-15 21:51:09 -08:00
Peter Edberg
008fddfaac
ICU-21900 integrate CLDR release-41-alpha1 to ICU main for 71 front-load
2022-02-14 12:09:15 -08:00
Peter Edberg
2f8749a026
ICU-21900 integrate CLDR release-41-alpha0 to ICU main for 71 front-load
2022-02-07 22:02:36 -08:00
allenwtsu
d0290c03db
ICU-21699 Phrase based breaking(C++)
...
See #1936
2022-01-13 20:22:05 -08:00
Markus Scherer
75ac80bd68
ICU-21580 change site.icu-project.org to icu.unicode.org etc
2021-10-21 15:54:42 -07:00
Peter Edberg
49dda34fb1
ICU-21581 integrate CLDR 40a0 to ICU trunk
2021-08-18 23:59:19 -07:00
Peter Edberg
31182a99b4
ICU-21480 integrate CLDR release-39-alpha4 to ICU trunk
2021-02-25 10:19:57 -08:00
Victor Chang
d56f291178
ICU-20659 Fix DTD link in XML data files
...
- http://www.unicode.org/repos/cldr/trunk/common/dtd/ldml.dtd returns
HTTP 302 error and redirects to an html page, not a dtd content
apparently.
- Clone the dtd files from CLDR release-35-1
https://raw.githubusercontent.com/unicode-org/cldr/release-35-1/common/dtd/ldml.dtd
2019-07-11 09:19:29 -07:00
Dongyuan Liu
46a888be87
ICU-13441 For zh/ja, tailor linebreak classes for quotations such as “ 201C and ” 201D
2018-11-14 19:53:12 -08:00
Peter Edberg
b6074fe044
ICU-20119 63rc BRS, integrate cldr 34-alpha2, part 1 icu4c
2018-09-27 14:27:41 -07:00
Andy Heninger
254e5f9580
ICU-13420 svn properties check tool fix, and prop update of files match autoprops settings.
...
X-SVN-Rev: 40674
2017-11-29 19:32:58 +00:00
Andy Heninger
309364fee5
ICU-13049 svn utf-8 & other property fixes.
...
X-SVN-Rev: 39844
2017-03-17 00:37:59 +00:00
Michael Ow
61607c2773
ICU-12564 Update copyright notice in trunk
...
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126
ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
...
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f
ICU-12564 Update copyright notice in ICU4C
...
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
George Rhoten
d7e92f2c9a
ICU-9503 Undo removal of lenient parse data. Only English wasn't moved to CLDR.
...
X-SVN-Rev: 38461
2016-03-02 08:16:29 +00:00
John Emmons
75ed4ce808
ICU-11728 First cut CLDR 28 data integration
...
X-SVN-Rev: 37524
2015-06-10 18:38:06 +00:00
Andy Heninger
6a4799e345
ICU-11608 remove lines with $ svn keywords
...
X-SVN-Rev: 37367
2015-04-20 20:43:56 +00:00
John Emmons
26a401e17a
ICU-10750 Remove obsolete files from source/data/xml
...
X-SVN-Rev: 37187
2015-03-07 16:06:51 +00:00
John Emmons
368eb4bb16
ICU-11555 Integrate CLDR 27 data
...
X-SVN-Rev: 37169
2015-03-06 22:58:33 +00:00
Peter Edberg
3565561b05
ICU-9379 Draft new linebreak files & related generated data per cldrbug 4931
...
X-SVN-Rev: 37037
2015-02-18 08:37:16 +00:00
Peter Edberg
d87c86274c
ICU-10326 Add dictionary-based word/line break for Burmese/Myanmar
...
X-SVN-Rev: 36397
2014-09-08 22:16:21 +00:00
Peter Edberg
1b8eb15e1a
ICU-11173 CLDR tags/release-26-d01 into ICU4C trunk with related test & lib code updates
...
X-SVN-Rev: 36313
2014-09-02 23:18:20 +00:00
John Emmons
7525392d15
ICU-10745 Merge CLDR25 data into trunk
...
X-SVN-Rev: 35429
2014-03-12 04:34:00 +00:00
Steven R. Loomis
4c308228b1
ICU-10286 check in stub xml files so that the .txt gets generated. See ICU-10750 for removal of such stubs.
...
X-SVN-Rev: 35349
2014-03-05 23:25:48 +00:00
John Emmons
a869b0d483
ICU-10335 Merge completed CLDR24 branch into trunk.
...
X-SVN-Rev: 34238
2013-09-07 20:46:42 +00:00
Peter Edberg
bf4126616b
ICU-7647 Add/use LaoBreakEngine and laodict.txt; more useful messages in gendict
...
X-SVN-Rev: 34229
2013-09-06 23:43:13 +00:00
Jennifer Chye
22ffd50c07
ICU-10246 Remove extra lenient-parse rules from source/data/xml (again).
...
X-SVN-Rev: 34023
2013-08-08 22:56:45 +00:00
Jennifer Chye
7f28ebe179
ICU-10246 Revert changeset 33903 for until 52m1 is complete.
...
X-SVN-Rev: 33909
2013-07-11 17:04:13 +00:00
Jennifer Chye
0530fbf243
ICU-10246 Remove duplicate lenient-parse rules in data/xml/rbnf.
...
X-SVN-Rev: 33903
2013-07-10 20:55:11 +00:00
John Emmons
28932c056e
ICU-9890 Merge CLDR23 data
...
X-SVN-Rev: 33355
2013-03-03 21:34:51 +00:00
Peter Edberg
6478ab13ae
ICU-9876 Add data/xml/main/ms_Latn.xml
...
X-SVN-Rev: 33082
2013-01-26 07:49:38 +00:00
John Emmons
276a244c9b
ICU-9251 First cut merge of CLDR 22 data
...
X-SVN-Rev: 32275
2012-08-28 21:56:06 +00:00
Maxime Serrano
43a4d7c0d4
ICU-9353 remove last mentions of word_ja.txt
...
X-SVN-Rev: 32196
2012-08-17 21:58:20 +00:00
Maxime Serrano
c64c0299d7
ICU-9353 merge dbbi-tries work into the trunk
...
X-SVN-Rev: 32184
2012-08-16 23:01:49 +00:00
Peter Edberg
42eb71a706
ICU-9034 Delete obsolete <icu:isLeapMonth> in data/xml/main/root.xml
...
X-SVN-Rev: 31210
2012-01-16 07:12:30 +00:00
Peter Edberg
cec4d76254
ICU-8978 Integrate CLDR 21m2 data. Update dtfmttst.cpp for timezone name cleanup.
...
Update transrt.cpp to exclude 0970 from roundtrip tests; it was now included because
Unicode 6.1 moved it from Common to Devanagari, but it has no mapping from InterIndic
to anything else.
X-SVN-Rev: 31074
2011-12-09 08:39:46 +00:00
John Emmons
7a0d90c14e
ICU-8556 Merge CLDR release-2-0-d02 data
...
X-SVN-Rev: 30101
2011-05-12 02:11:29 +00:00
Peter Edberg
6f1601400e
ICU-8539 Add ja linebreak tailoring to match CSS normal; break before small kana and prolonged mark
...
X-SVN-Rev: 30061
2011-05-09 08:16:34 +00:00
John Emmons
2d4a2ae78f
ICU-8489 Merge CLDR release-2-0-d01 into ICU
...
X-SVN-Rev: 30037
2011-05-05 18:12:27 +00:00
Peter Edberg
7aaca9b950
ICU-8329 Roll in Khmer dictionary word break code from George, data from Nathan/sbbic.org
...
X-SVN-Rev: 30019
2011-05-04 13:25:37 +00:00
Peter Edberg
33120b6943
ICU-8046 CLDR 1.9 integration, fix he,fi brkitr
...
X-SVN-Rev: 28914
2010-10-26 07:38:20 +00:00
Peter Edberg
b52138122f
ICU-8046 CLDR 1.9 integration pass 3 (CLDR r5147, still minus 14 transliterators)
...
X-SVN-Rev: 28906
2010-10-25 22:37:20 +00:00
Peter Edberg
8c3fb82cd5
ICU-8046 CLDR 1.9 integration pass 1, hand-edited files in data/
...
X-SVN-Rev: 28846
2010-10-18 05:15:17 +00:00
Peter Edberg
8cab1047c4
ICU-7965 th uses legacy clusters with 0E33 0EB3 added to Extend; remove test timebomb
...
X-SVN-Rev: 28663
2010-09-21 03:38:50 +00:00
Peter Edberg
f3316f90bc
ICU-7928 Update files that control building of cldr data for ICU
...
X-SVN-Rev: 28605
2010-09-13 19:08:21 +00:00