Robin Leroy
8d86ca142e
ICU-22941 Revert "ICU-22112 word break updates for @,colon; colon tailorings for fi,sv"
...
This reverts commit 49d192fefe
.
2024-11-05 22:59:24 +01:00
Jungshik Shin
05dc2ac924
ICU-22119 Add lw=phrase for Korean using line_*_phrase_cj
...
brkitr/ko.txt is created to use line_*_.cj.txt for both
lw=phrase and lw != phrase cases for Korean. This is the simplest
way to fix ICU-22119 taking advantage of the fact that ICU
does not have a Korean dictionary so we don't have to worry about
adding the list of Korean particles to keep them attached to the
preceeding word.
The downside is that it only works when the locale is ko or ja while
it should work in any locale. Another is it makes ICU deviate from
CSS3 by using the same CJ (conditonal Japanese) rules for Korean as
well. However, CSS3 spec is wrong on that point and should be changed.
See https://unicode-org.atlassian.net/browse/CLDR-4931 .
2022-11-07 22:30:49 +00:00
Peter Edberg
49d192fefe
ICU-22112 word break updates for @,colon; colon tailorings for fi,sv
...
See #2159
2022-08-23 12:45:55 -07:00
allenwtsu
7d825cb204
ICU-21699 Add some more particles
2022-02-21 08:54:54 -08:00
Peter Edberg
008fddfaac
ICU-21900 integrate CLDR release-41-alpha1 to ICU main for 71 front-load
2022-02-14 12:09:15 -08:00
Peter Edberg
2f8749a026
ICU-21900 integrate CLDR release-41-alpha0 to ICU main for 71 front-load
2022-02-07 22:02:36 -08:00
allenwtsu
d0290c03db
ICU-21699 Phrase based breaking(C++)
...
See #1936
2022-01-13 20:22:05 -08:00
Markus Scherer
75ac80bd68
ICU-21580 change site.icu-project.org to icu.unicode.org etc
2021-10-21 15:54:42 -07:00
Peter Edberg
49dda34fb1
ICU-21581 integrate CLDR 40a0 to ICU trunk
2021-08-18 23:59:19 -07:00
Victor Chang
d56f291178
ICU-20659 Fix DTD link in XML data files
...
- http://www.unicode.org/repos/cldr/trunk/common/dtd/ldml.dtd returns
HTTP 302 error and redirects to an html page, not a dtd content
apparently.
- Clone the dtd files from CLDR release-35-1
https://raw.githubusercontent.com/unicode-org/cldr/release-35-1/common/dtd/ldml.dtd
2019-07-11 09:19:29 -07:00
Dongyuan Liu
46a888be87
ICU-13441 For zh/ja, tailor linebreak classes for quotations such as “ 201C and ” 201D
2018-11-14 19:53:12 -08:00
Peter Edberg
b6074fe044
ICU-20119 63rc BRS, integrate cldr 34-alpha2, part 1 icu4c
2018-09-27 14:27:41 -07:00
Andy Heninger
254e5f9580
ICU-13420 svn properties check tool fix, and prop update of files match autoprops settings.
...
X-SVN-Rev: 40674
2017-11-29 19:32:58 +00:00
Andy Heninger
309364fee5
ICU-13049 svn utf-8 & other property fixes.
...
X-SVN-Rev: 39844
2017-03-17 00:37:59 +00:00
Michael Ow
61607c2773
ICU-12564 Update copyright notice in trunk
...
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126
ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
...
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f
ICU-12564 Update copyright notice in ICU4C
...
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
John Emmons
75ed4ce808
ICU-11728 First cut CLDR 28 data integration
...
X-SVN-Rev: 37524
2015-06-10 18:38:06 +00:00
Andy Heninger
6a4799e345
ICU-11608 remove lines with $ svn keywords
...
X-SVN-Rev: 37367
2015-04-20 20:43:56 +00:00
John Emmons
368eb4bb16
ICU-11555 Integrate CLDR 27 data
...
X-SVN-Rev: 37169
2015-03-06 22:58:33 +00:00
Peter Edberg
3565561b05
ICU-9379 Draft new linebreak files & related generated data per cldrbug 4931
...
X-SVN-Rev: 37037
2015-02-18 08:37:16 +00:00
Peter Edberg
d87c86274c
ICU-10326 Add dictionary-based word/line break for Burmese/Myanmar
...
X-SVN-Rev: 36397
2014-09-08 22:16:21 +00:00
Steven R. Loomis
4c308228b1
ICU-10286 check in stub xml files so that the .txt gets generated. See ICU-10750 for removal of such stubs.
...
X-SVN-Rev: 35349
2014-03-05 23:25:48 +00:00
Peter Edberg
bf4126616b
ICU-7647 Add/use LaoBreakEngine and laodict.txt; more useful messages in gendict
...
X-SVN-Rev: 34229
2013-09-06 23:43:13 +00:00
Maxime Serrano
43a4d7c0d4
ICU-9353 remove last mentions of word_ja.txt
...
X-SVN-Rev: 32196
2012-08-17 21:58:20 +00:00
Maxime Serrano
c64c0299d7
ICU-9353 merge dbbi-tries work into the trunk
...
X-SVN-Rev: 32184
2012-08-16 23:01:49 +00:00
Peter Edberg
cec4d76254
ICU-8978 Integrate CLDR 21m2 data. Update dtfmttst.cpp for timezone name cleanup.
...
Update transrt.cpp to exclude 0970 from roundtrip tests; it was now included because
Unicode 6.1 moved it from Common to Devanagari, but it has no mapping from InterIndic
to anything else.
X-SVN-Rev: 31074
2011-12-09 08:39:46 +00:00
Peter Edberg
6f1601400e
ICU-8539 Add ja linebreak tailoring to match CSS normal; break before small kana and prolonged mark
...
X-SVN-Rev: 30061
2011-05-09 08:16:34 +00:00
Peter Edberg
7aaca9b950
ICU-8329 Roll in Khmer dictionary word break code from George, data from Nathan/sbbic.org
...
X-SVN-Rev: 30019
2011-05-04 13:25:37 +00:00
Peter Edberg
33120b6943
ICU-8046 CLDR 1.9 integration, fix he,fi brkitr
...
X-SVN-Rev: 28914
2010-10-26 07:38:20 +00:00
Peter Edberg
b52138122f
ICU-8046 CLDR 1.9 integration pass 3 (CLDR r5147, still minus 14 transliterators)
...
X-SVN-Rev: 28906
2010-10-25 22:37:20 +00:00
Peter Edberg
8c3fb82cd5
ICU-8046 CLDR 1.9 integration pass 1, hand-edited files in data/
...
X-SVN-Rev: 28846
2010-10-18 05:15:17 +00:00
Peter Edberg
8cab1047c4
ICU-7965 th uses legacy clusters with 0E33 0EB3 added to Extend; remove test timebomb
...
X-SVN-Rev: 28663
2010-09-21 03:38:50 +00:00
John Emmons
ae306b3797
ICU-7429 Change ICU specials to point to current DTD
...
X-SVN-Rev: 27726
2010-03-01 23:13:18 +00:00
John Emmons
dfccc29d1e
ICU-7173 Merge CLDR 1.8p1 snapshot data
...
X-SVN-Rev: 26733
2009-10-02 21:06:55 +00:00
George Rhoten
6a7d96026a
ICU-5766 Remove Extended Grapheme Cluster from Break Iteration.
...
X-SVN-Rev: 22560
2007-08-29 06:10:43 +00:00
John Emmons
687be68872
ICU-5766 Use cldr 1.5 dtd
...
X-SVN-Rev: 22066
2007-07-20 00:11:37 +00:00
Andy Heninger
ca5d005978
ICU-5766 Extended Grapheme Clusters for ICU4C
...
X-SVN-Rev: 21933
2007-07-10 01:25:26 +00:00
Ram Viswanadha
cb17e6f035
ICU-5117 generate data files from LDML2ICUConverter
...
X-SVN-Rev: 19569
2006-04-21 00:55:24 +00:00