Elango Cheran
06c077bd35
ICU-22503 add property Indic_Conjunct_Break
2024-07-26 14:47:39 -07:00
Markus Scherer
876816b0a1
ICU-22707 new short aliases ID_Status, ID_Type
2024-07-18 23:56:34 +00:00
Markus Scherer
560e4bbf41
ICU-22707 preparseucd.py
2024-04-29 17:00:55 -07:00
Markus Scherer
cce162bf4d
ICU-11396 new properties Identifier_Status & Identifier_Type
...
See #2879
2024-03-20 13:20:14 -07:00
Markus Scherer
d8659b476d
ICU-22404 new properties IDS_Unary_Operator, ID_Compat_Math_*, NFKC_SCF
2023-09-16 14:41:51 -07:00
Markus Scherer
c5d0fff5a0
ICU-21980 parse multiple @missing
lines
2022-06-02 21:29:24 +00:00
Markus Scherer
e1be738ccb
ICU-21980 Unicode 15 pre-beta data files, new prop values
2022-05-25 18:23:11 +00:00
Markus Scherer
f9beb616a8
ICU-21652 add emoji properties of strings
...
- 7 new properties: API constants & property names
- u_stringHasBinaryProperty(s, property) & UCharacter.hasBinaryProperty(s, property)
- two additional source data files
- new genprops part for writing new binary data file uemoji.icu
- data for existing emoji properties moved from uprops.icu (hardcoded in C++) to uemoji.icu (always loaded)
- new EmojiProps implementation
2021-09-08 12:15:50 -07:00
Markus Scherer
41aa7159ea
ICU-21635 Unicode 14 data files 20210820, line break LB30b.2
...
See #1807
2021-08-23 22:11:49 +00:00
Markus Scherer
d4c92ebcfc
ICU-21635 Unicode 14 beta
2021-06-21 22:26:15 +00:00
gnrunge
d0096a84e7
ICU-21243 Migrates preparseucd.py script to Python 3. Python 3 changes
...
the order of elements in an iterator from Python 2 with the result
that the generated data in ppucd.txt changes with respect to the selection
of a property value used to compact the output when there is a
property with equal count of the two most frequent values. This
change doesn't change the validity of the generated ppucd.txt file.
While at it, also migrated script parsescriptmetadata.py to Python 3.
2020-12-01 13:02:52 -08:00
Markus Scherer
a7e378d587
ICU-20893 Unicode 13 beta
...
See PR #915 , see changes.txt
- Unicode 13 beta data as of 2019-nov-21
- uprops.icu format version 7.7 with more bits for Script/Script_Extensions
- more bits in spoof checker ScriptSet
- root line break rules adjusted for UAX 14 changes, from Andy
- line break tailorings not yet in sync with root
2019-11-21 17:35:53 -08:00
Markus Scherer
0565894534
ICU-20497 Unicode 12.1
2019-04-04 10:23:24 -07:00
Markus Scherer
ea7c030961
ICU-20203 update ICU to Unicode 12 beta
...
- data as of 2018-nov-26
- API constants for new blocks & scripts
- sync RBBIMonkeyTest.java test data with C++
2018-11-28 23:13:07 +01:00
Markus Scherer
d2ec8987a7
ICU-8966 ICU-12850 add API/data/code for text layout properties InPC, InSC, vo ( #92 )
...
ICU-8966: Indic_Positional_Category & Indic_Syllabic_Category
ICU-12850: Vertical_Orientation
2018-09-27 14:27:39 -07:00
Fredrik Roubert
12e2a72747
ICU-20062 Set the Python -B flag to inhibit the writing of .pyc files.
...
This will prevent littering the source tree with spurious .pyc files.
The potential faster execution when re-running a script that has an
up-to-date .pyc file is negligible.
2018-09-27 14:27:38 -07:00
Markus Scherer
a4e66ded6d
ICU-13630 switch from IdnaTest.txt to IdnaTestV2.txt new in Unicode 11 see Unicode PRI 375
...
X-SVN-Rev: 41294
2018-04-30 03:17:11 +00:00
Markus Scherer
1752b5c8c9
ICU-13630 Unicode 11 beta data mar06, API constants for new property values
...
X-SVN-Rev: 41092
2018-03-09 23:53:02 +00:00
Yoshito Umaoka
1870215131
ICU-13358 Fixed cpyscan problems. Enhanced cpyscan.pl to use online version of cpyskip.txt by default. Added the new Unicode copyright comment in many tools files.
...
X-SVN-Rev: 40527
2017-10-03 02:32:50 +00:00
Markus Scherer
acf2b4cc82
ICU-13186 stop prepending UTF-8 BOM to some Unicode files
...
X-SVN-Rev: 40149
2017-06-02 22:52:19 +00:00
Markus Scherer
b2ead3e2e1
ICU-8130 UTS 46 conformance test using Unicode IdnaTest.txt
...
X-SVN-Rev: 40130
2017-05-23 04:44:58 +00:00
Markus Scherer
20bee936b1
ICU-12985 ppucd.txt more readable unassigned ranges; block compaction by size savings not value plurality reduces clutter
...
X-SVN-Rev: 40096
2017-05-02 22:53:28 +00:00
Markus Scherer
761c994436
ICU-12985 pre-parse VerticalOrientation.txt
...
X-SVN-Rev: 40086
2017-04-28 20:29:22 +00:00
Markus Scherer
6ce7f348a3
ICU-12985 implement the binary Emoji_Component property for emoji 5
...
X-SVN-Rev: 40082
2017-04-26 23:58:36 +00:00
Markus Scherer
edce2be62c
ICU-12985 Unicode 10 data 20170418, new property values, adjust tools & tests
...
X-SVN-Rev: 40079
2017-04-26 21:17:13 +00:00
Markus Scherer
1982037316
ICU-12900 change ppucd.txt for copyright scanner patterns
...
X-SVN-Rev: 39921
2017-03-23 17:30:41 +00:00
Markus Scherer
466a569c58
ICU-12900 mostly still Unicode 9.0 but Unicode 10 beta (20170322) segmentation & bidi data and draft emoji 5.0 (also 20170322)
...
X-SVN-Rev: 39915
2017-03-23 02:14:00 +00:00
Markus Scherer
8d3a176d4f
ICU-12526 ignore inline comments in script metadata
...
X-SVN-Rev: 38709
2016-05-05 23:53:32 +00:00
Markus Scherer
3e5578f3bf
ICU-12526 uprops.icu formatVersion 7.3: support new fraction numeric values like 3/80; ppucd.txt mostly no block compression for String/Misc properties; minor bug fixes
...
X-SVN-Rev: 38706
2016-05-05 22:51:18 +00:00
Markus Scherer
dbebd188e7
ICU-12526 initial Unicode 9 data
...
X-SVN-Rev: 38698
2016-05-04 23:54:37 +00:00
Markus Scherer
0390f4c86c
ICU-11802 add 4 Emoji properties from emoji-data.txt 2.0
...
X-SVN-Rev: 38182
2016-01-21 04:34:33 +00:00
Markus Scherer
99c4dfa565
ICU-11574 Unicode 8 updates
...
X-SVN-Rev: 37353
2015-04-16 23:42:50 +00:00
Markus Scherer
2436998dd3
ICU-10821 ppucd.txt: find & write current-year copyright, escape non-ASCII in heading comments
...
X-SVN-Rev: 35600
2014-04-04 18:01:48 +00:00
Markus Scherer
f440aa17d9
ICU-10821 initial tools update for Unicode 7.0
...
X-SVN-Rev: 35596
2014-04-03 22:43:00 +00:00
Markus Scherer
c9dc52d608
ICU-10128 remove version suffixes from UCD files, so that they are easy to compare as a tree of files
...
X-SVN-Rev: 33565
2013-04-30 16:27:15 +00:00
Markus Scherer
f452c2eff4
ICU-10128 add 2 new script codes from ISO 15924: Aghb & Mahj
...
X-SVN-Rev: 33563
2013-04-29 22:39:38 +00:00
Markus Scherer
dabb8350c7
ICU-10128 encode new properties bpt & bpb in ubidi.icu format version 2.1
...
X-SVN-Rev: 33557
2013-04-26 23:45:27 +00:00
Markus Scherer
3db9d2b0f7
ICU-10128 parse the new BidiBrackets.txt
...
X-SVN-Rev: 33554
2013-04-26 00:06:57 +00:00
Markus Scherer
7f3718899a
ICU-9538 parse CLDR scriptMetadata.txt
...
X-SVN-Rev: 33259
2013-02-17 23:16:09 +00:00
Markus Scherer
db9611caa9
ICU-9437 support UCD 6.2
...
X-SVN-Rev: 32062
2012-07-24 21:11:29 +00:00
Markus Scherer
979a273104
ICU-8995 add new ISO script code Hluw=Anatolian Hieroglyphs
...
X-SVN-Rev: 31248
2012-01-23 19:51:22 +00:00
Markus Scherer
4ad12dc318
ICU-8995 merge idna2nrm.py into preparseucd.py
...
X-SVN-Rev: 31229
2012-01-19 18:51:33 +00:00
Markus Scherer
f72bdf2ffb
ICU-9023 reduce norm2/nfkc.txt to a delta over nfc.txt
...
X-SVN-Rev: 31200
2012-01-12 01:02:38 +00:00
Markus Scherer
b2a9c8508e
ICU-8972 generate norm2/nfkc_cf.txt from preparseucd.py
...
X-SVN-Rev: 31197
2012-01-10 22:59:14 +00:00
Markus Scherer
07a5ec42af
ICU-8972 stop copying UCD .txt files into the ICU source tree that are not parsed any more except by preparseucd.py
...
X-SVN-Rev: 31195
2012-01-10 22:07:51 +00:00
Markus Scherer
e8d8222080
ICU-8972 write * Unicode version line to norm2/.txt files
...
X-SVN-Rev: 31192
2012-01-10 20:56:22 +00:00
Markus Scherer
72559ba1cd
ICU-8972 replace gennorm with code in preparseucd.py
...
X-SVN-Rev: 31174
2011-12-28 19:23:13 +00:00
Markus Scherer
162f137de9
ICU-8972 document stable pnames_data.h output
...
X-SVN-Rev: 31171
2011-12-23 04:29:33 +00:00
Markus Scherer
a65dbd9267
ICU-8972 pnames_data.h: remove redundant _COUNT constants, add static
...
X-SVN-Rev: 31168
2011-12-22 06:52:39 +00:00
Markus Scherer
1ec1832428
ICU-8972 bug fixes
...
X-SVN-Rev: 31166
2011-12-22 06:28:59 +00:00