Elango Cheran
3663cc1e0f
ICU-22707 Add support for property Modifier_Combining_Mark
2024-07-30 15:44:32 -07:00
Elango Cheran
06c077bd35
ICU-22503 add property Indic_Conjunct_Break
2024-07-26 14:47:39 -07:00
Markus Scherer
47e9389b8e
ICU-22785 move cptrie bit setter to toolutil; add getCPTrieSize()
2024-06-04 18:51:53 -07:00
Markus Scherer
c439dcdf27
ICU-22785 uprops.icu: coalesce scx+sc bits
2024-06-04 18:51:53 -07:00
Markus Scherer
0d8a3ccd11
ICU-22785 move Block bits from propsvec0 to new trie
2024-06-04 10:49:53 -07:00
Markus Scherer
4d9612bfa2
ICU-22707 Age 6:2, run generate.sh
2024-04-29 17:00:55 -07:00
Markus Scherer
cce162bf4d
ICU-11396 new properties Identifier_Status & Identifier_Type
...
See #2879
2024-03-20 13:20:14 -07:00
Markus Scherer
d8659b476d
ICU-22404 new properties IDS_Unary_Operator, ID_Compat_Math_*, NFKC_SCF
2023-09-16 14:41:51 -07:00
Markus Scherer
79ab90b5f9
ICU-6065 UnicodeSet::closeOver(simple case folding)
...
See #2322
2023-03-02 08:12:57 -08:00
Markus Scherer
f9beb616a8
ICU-21652 add emoji properties of strings
...
- 7 new properties: API constants & property names
- u_stringHasBinaryProperty(s, property) & UCharacter.hasBinaryProperty(s, property)
- two additional source data files
- new genprops part for writing new binary data file uemoji.icu
- data for existing emoji properties moved from uprops.icu (hardcoded in C++) to uemoji.icu (always loaded)
- new EmojiProps implementation
2021-09-08 12:15:50 -07:00
Markus Scherer
5a7a8f015c
ICU-21461 uprops.h: remove unused gc macros
...
and some underused character constants
2021-02-01 18:45:59 -08:00
Markus Scherer
a7e378d587
ICU-20893 Unicode 13 beta
...
See PR #915 , see changes.txt
- Unicode 13 beta data as of 2019-nov-21
- uprops.icu format version 7.7 with more bits for Script/Script_Extensions
- more bits in spoof checker ScriptSet
- root line break rules adjusted for UAX 14 changes, from Andy
- line break tailorings not yet in sync with root
2019-11-21 17:35:53 -08:00
Markus Scherer
ea7c030961
ICU-20203 update ICU to Unicode 12 beta
...
- data as of 2018-nov-26
- API constants for new blocks & scripts
- sync RBBIMonkeyTest.java test data with C++
2018-11-28 23:13:07 +01:00
Markus Scherer
98f9170004
ICU-20250 make UnicodeSet(intprop=value) faster
...
- fastpath for UnicodeSet.add(new last range)
- fewer UnicodeSet memory allocations:
initial internal list array, exponential array growth,
allocate strings list/set only when first one is added
- faster CodePointTrie.getRange(): fewer calls to filter function
- revert UnicodeSet(intprop=value) from trie ranges to range starts + lookup
- cache per-int-prop range starts: fewer lookups
2018-11-12 16:05:57 -08:00
Markus Scherer
82f0f480d4
ICU-20086 C++ sets & maps for Unicode properties ( #93 )
...
also create ucpmap.h from renamed parts of ucptrie.h
2018-09-27 14:27:39 -07:00
Markus Scherer
d2ec8987a7
ICU-8966 ICU-12850 add API/data/code for text layout properties InPC, InSC, vo ( #92 )
...
ICU-8966: Indic_Positional_Category & Indic_Syllabic_Category
ICU-12850: Vertical_Orientation
2018-09-27 14:27:39 -07:00
Markus Scherer
af6a771267
ICU-13630 implement, test, use emoji property Extended_Pictographic
...
X-SVN-Rev: 41094
2018-03-12 05:53:02 +00:00
Markus Scherer
eb57bf7c90
ICU-12985 implement the binary Prepended_Concatenation_Mark property
...
X-SVN-Rev: 40084
2017-04-27 21:11:01 +00:00
Markus Scherer
6ce7f348a3
ICU-12985 implement the binary Emoji_Component property for emoji 5
...
X-SVN-Rev: 40082
2017-04-26 23:58:36 +00:00
Andy Heninger
04448b004f
ICU-12764 UTF-8 source files, update file encoding comments.
...
X-SVN-Rev: 39641
2017-02-03 18:57:23 +00:00
Andy Heninger
242e02c388
ICU-12764 icu4c utf-8 source files, update Copyright notices.
...
X-SVN-Rev: 39583
2017-01-20 00:20:31 +00:00
Michael Ow
61607c2773
ICU-12564 Update copyright notice in trunk
...
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126
ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
...
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f
ICU-12564 Update copyright notice in ICU4C
...
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
Markus Scherer
5e69db5c2f
ICU-12526 integrate Unicode 9 beta
...
X-SVN-Rev: 38753
2016-05-19 22:48:18 +00:00
Peter Edberg
f5d28f72d2
ICU-11802 Remove comma at end of enum that was breaking one autobuild
...
X-SVN-Rev: 38190
2016-01-22 17:39:30 +00:00
Markus Scherer
c10fdc6e29
ICU-11802 add 4 Emoji properties from emoji-data.txt 2.0
...
X-SVN-Rev: 38183
2016-01-21 04:39:21 +00:00
Markus Scherer
6876123c0e
ICU-9437 update to Unicode 6.2; merge in branches/markus/uni62 r32274
...
X-SVN-Rev: 32282
2012-08-28 23:37:10 +00:00
Markus Scherer
9bcc40cf9e
ICU-8452 un-autoconf platform.h, move user-configurable settings to uconfig.h, and other changes; from svn merge --reintegrate branches/markus/platformh at r30411
...
X-SVN-Rev: 30412
2011-07-23 00:10:30 +00:00
Markus Scherer
b6036a94f9
ICU-8615 implement optional IDNA2008 CONTEXTO check in UTS46
...
X-SVN-Rev: 30268
2011-07-01 22:19:14 +00:00
Markus Scherer
c04082d93c
ICU-8105 hardcode Unicode property names data (formatVersion 2); includes new dictionary-type tries (ByteTrie & UCharTrie see ticket #8167 ); merge branches/markus/tries -r 29040:29249
...
X-SVN-Rev: 29252
2010-12-31 18:21:36 +00:00
Markus Scherer
b5e1330176
ICU-7264 merge Unicode 6.0 into trunk from branches/markus/uni60 -r 28339:28657
...
X-SVN-Rev: 28661
2010-09-21 00:12:49 +00:00
Markus Scherer
0acda636e4
ICU-7722 build canonical-iterator data from nfc.nrm (port Java code to C++)
...
X-SVN-Rev: 28117
2010-06-01 06:10:26 +00:00
Markus Scherer
8ddbd1394c
ICU-7273 merge in Normalizer2 API & code, and ICU-5785 UnicodeSet::span(UnicodeString) and ICU-7296 tempSubString()/retainBetween(); merge -r 26971:27150 branches/markus/norm2
...
X-SVN-Rev: 27155
2010-01-06 23:50:03 +00:00
Markus Scherer
66b63f9c48
ICU-7084 Unicode 5.2: merge -r 26464:26890 branches/markus/uni52 into trunk, and a little cleanup (C++)
...
X-SVN-Rev: 26898
2009-11-13 19:25:21 +00:00
Markus Scherer
8406b8e2bd
ICU-6765 fix Windows compilation problem: ICU internal #define collisions with Windows STL headers
...
X-SVN-Rev: 25481
2009-02-25 01:22:27 +00:00
Andy Heninger
71bf003171
ICU-5696 Unicode 5.1 Update
...
X-SVN-Rev: 23761
2008-04-04 22:47:43 +00:00
George Rhoten
06c982a592
ICU-5279 Better hiding and testing of internal functions.
...
X-SVN-Rev: 21686
2007-06-10 20:11:28 +00:00
George Rhoten
fad658d66d
ICU-5279 Reduce the set of exported functions.
...
X-SVN-Rev: 21621
2007-06-03 06:08:46 +00:00
George Rhoten
b8328839ca
ICU-4707 Fix some compiler warnings about unused code.
...
X-SVN-Rev: 19351
2006-03-16 04:54:35 +00:00
Markus Scherer
e6a0df52ee
ICU-4199 enum/name API support for C/POSIX character classes, and UnicodeSet support for [:Assigned:]
...
X-SVN-Rev: 17730
2005-05-28 22:54:36 +00:00
Markus Scherer
7f8d06b42b
ICU-4334 Unicode 4.1 update
...
X-SVN-Rev: 17104
2005-01-10 18:02:54 +00:00
Markus Scherer
aa6cd66256
ICU-3858 remove data for properties that were moved to ucase.icu and ubidi.icu, and simplify remaining structure
...
X-SVN-Rev: 17074
2005-01-02 00:22:48 +00:00
Markus Scherer
ca773f99db
ICU-4287 move bidi/shaping properties to ubidi.icu
...
X-SVN-Rev: 17056
2004-12-31 13:36:49 +00:00
Markus Scherer
ca77616509
ICU-3432 move uniset.cpp data for case closure to ucase.icu; have gencase build case closure data; ucase.c use it; UnicodeSet::closeOver() call that
...
X-SVN-Rev: 16902
2004-12-02 04:18:35 +00:00
George Rhoten
e69fca9d5f
ICU-4078 Fix for AIX with Visual Age 5 compiler, and make sure that the
...
disentanglement is correct by putting all the UnicodeSet virtual functions are
in one file. Also move some of the rule whitespace handling into better
locations.
X-SVN-Rev: 16519
2004-10-15 22:56:26 +00:00
Markus Scherer
f7b7183d7a
ICU-4078 enable building a UnicodeSet from a property using/loading only the relevant data file
...
X-SVN-Rev: 16313
2004-09-13 23:33:22 +00:00
Markus Scherer
bd1f26f937
ICU-4078 move property name comparison functions from uprops to propname
...
X-SVN-Rev: 16303
2004-09-11 22:02:10 +00:00
Markus Scherer
b5eebb331e
ICU-4078 move functions to make uchar.c (core properties APIs) independent of uprops.c (universal properties APIs)
...
X-SVN-Rev: 16301
2004-09-11 15:04:40 +00:00
Markus Scherer
8a3a93deed
ICU-4078 use USetAdder interface to remove dependencies of low-level code on the USet/UnicodeSet implementation
...
X-SVN-Rev: 16265
2004-09-07 17:59:53 +00:00