Commit graph

90 commits

Author SHA1 Message Date
Elango Cheran
3663cc1e0f ICU-22707 Add support for property Modifier_Combining_Mark 2024-07-30 15:44:32 -07:00
Elango Cheran
06c077bd35 ICU-22503 add property Indic_Conjunct_Break 2024-07-26 14:47:39 -07:00
Markus Scherer
47e9389b8e ICU-22785 move cptrie bit setter to toolutil; add getCPTrieSize() 2024-06-04 18:51:53 -07:00
Markus Scherer
c439dcdf27 ICU-22785 uprops.icu: coalesce scx+sc bits 2024-06-04 18:51:53 -07:00
Markus Scherer
0d8a3ccd11 ICU-22785 move Block bits from propsvec0 to new trie 2024-06-04 10:49:53 -07:00
Markus Scherer
4d9612bfa2 ICU-22707 Age 6:2, run generate.sh 2024-04-29 17:00:55 -07:00
Markus Scherer
cce162bf4d ICU-11396 new properties Identifier_Status & Identifier_Type
See #2879
2024-03-20 13:20:14 -07:00
Markus Scherer
d8659b476d ICU-22404 new properties IDS_Unary_Operator, ID_Compat_Math_*, NFKC_SCF 2023-09-16 14:41:51 -07:00
Markus Scherer
79ab90b5f9 ICU-6065 UnicodeSet::closeOver(simple case folding)
See #2322
2023-03-02 08:12:57 -08:00
Markus Scherer
f9beb616a8 ICU-21652 add emoji properties of strings
- 7 new properties: API constants & property names
- u_stringHasBinaryProperty(s, property) & UCharacter.hasBinaryProperty(s, property)
- two additional source data files
- new genprops part for writing new binary data file uemoji.icu
- data for existing emoji properties moved from uprops.icu (hardcoded in C++) to uemoji.icu (always loaded)
- new EmojiProps implementation
2021-09-08 12:15:50 -07:00
Markus Scherer
5a7a8f015c ICU-21461 uprops.h: remove unused gc macros
and some underused character constants
2021-02-01 18:45:59 -08:00
Markus Scherer
a7e378d587 ICU-20893 Unicode 13 beta
See PR #915, see changes.txt
- Unicode 13 beta data as of 2019-nov-21
- uprops.icu format version 7.7 with more bits for Script/Script_Extensions
- more bits in spoof checker ScriptSet
- root line break rules adjusted for UAX 14 changes, from Andy
- line break tailorings not yet in sync with root
2019-11-21 17:35:53 -08:00
Markus Scherer
ea7c030961 ICU-20203 update ICU to Unicode 12 beta
- data as of 2018-nov-26
- API constants for new blocks & scripts
- sync RBBIMonkeyTest.java test data with C++
2018-11-28 23:13:07 +01:00
Markus Scherer
98f9170004 ICU-20250 make UnicodeSet(intprop=value) faster
- fastpath for UnicodeSet.add(new last range)
- fewer UnicodeSet memory allocations:
  initial internal list array, exponential array growth,
  allocate strings list/set only when first one is added
- faster CodePointTrie.getRange(): fewer calls to filter function
- revert UnicodeSet(intprop=value) from trie ranges to range starts + lookup
- cache per-int-prop range starts: fewer lookups
2018-11-12 16:05:57 -08:00
Markus Scherer
82f0f480d4
ICU-20086 C++ sets & maps for Unicode properties (#93)
also create ucpmap.h from renamed parts of ucptrie.h
2018-09-27 14:27:39 -07:00
Markus Scherer
d2ec8987a7
ICU-8966 ICU-12850 add API/data/code for text layout properties InPC, InSC, vo (#92)
ICU-8966: Indic_Positional_Category & Indic_Syllabic_Category

ICU-12850: Vertical_Orientation
2018-09-27 14:27:39 -07:00
Markus Scherer
af6a771267 ICU-13630 implement, test, use emoji property Extended_Pictographic
X-SVN-Rev: 41094
2018-03-12 05:53:02 +00:00
Markus Scherer
eb57bf7c90 ICU-12985 implement the binary Prepended_Concatenation_Mark property
X-SVN-Rev: 40084
2017-04-27 21:11:01 +00:00
Markus Scherer
6ce7f348a3 ICU-12985 implement the binary Emoji_Component property for emoji 5
X-SVN-Rev: 40082
2017-04-26 23:58:36 +00:00
Andy Heninger
04448b004f ICU-12764 UTF-8 source files, update file encoding comments.
X-SVN-Rev: 39641
2017-02-03 18:57:23 +00:00
Andy Heninger
242e02c388 ICU-12764 icu4c utf-8 source files, update Copyright notices.
X-SVN-Rev: 39583
2017-01-20 00:20:31 +00:00
Michael Ow
61607c2773 ICU-12564 Update copyright notice in trunk
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126 ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f ICU-12564 Update copyright notice in ICU4C
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
Markus Scherer
5e69db5c2f ICU-12526 integrate Unicode 9 beta
X-SVN-Rev: 38753
2016-05-19 22:48:18 +00:00
Peter Edberg
f5d28f72d2 ICU-11802 Remove comma at end of enum that was breaking one autobuild
X-SVN-Rev: 38190
2016-01-22 17:39:30 +00:00
Markus Scherer
c10fdc6e29 ICU-11802 add 4 Emoji properties from emoji-data.txt 2.0
X-SVN-Rev: 38183
2016-01-21 04:39:21 +00:00
Markus Scherer
6876123c0e ICU-9437 update to Unicode 6.2; merge in branches/markus/uni62 r32274
X-SVN-Rev: 32282
2012-08-28 23:37:10 +00:00
Markus Scherer
9bcc40cf9e ICU-8452 un-autoconf platform.h, move user-configurable settings to uconfig.h, and other changes; from svn merge --reintegrate branches/markus/platformh at r30411
X-SVN-Rev: 30412
2011-07-23 00:10:30 +00:00
Markus Scherer
b6036a94f9 ICU-8615 implement optional IDNA2008 CONTEXTO check in UTS46
X-SVN-Rev: 30268
2011-07-01 22:19:14 +00:00
Markus Scherer
c04082d93c ICU-8105 hardcode Unicode property names data (formatVersion 2); includes new dictionary-type tries (ByteTrie & UCharTrie see ticket #8167); merge branches/markus/tries -r 29040:29249
X-SVN-Rev: 29252
2010-12-31 18:21:36 +00:00
Markus Scherer
b5e1330176 ICU-7264 merge Unicode 6.0 into trunk from branches/markus/uni60 -r 28339:28657
X-SVN-Rev: 28661
2010-09-21 00:12:49 +00:00
Markus Scherer
0acda636e4 ICU-7722 build canonical-iterator data from nfc.nrm (port Java code to C++)
X-SVN-Rev: 28117
2010-06-01 06:10:26 +00:00
Markus Scherer
8ddbd1394c ICU-7273 merge in Normalizer2 API & code, and ICU-5785 UnicodeSet::span(UnicodeString) and ICU-7296 tempSubString()/retainBetween(); merge -r 26971:27150 branches/markus/norm2
X-SVN-Rev: 27155
2010-01-06 23:50:03 +00:00
Markus Scherer
66b63f9c48 ICU-7084 Unicode 5.2: merge -r 26464:26890 branches/markus/uni52 into trunk, and a little cleanup (C++)
X-SVN-Rev: 26898
2009-11-13 19:25:21 +00:00
Markus Scherer
8406b8e2bd ICU-6765 fix Windows compilation problem: ICU internal #define collisions with Windows STL headers
X-SVN-Rev: 25481
2009-02-25 01:22:27 +00:00
Andy Heninger
71bf003171 ICU-5696 Unicode 5.1 Update
X-SVN-Rev: 23761
2008-04-04 22:47:43 +00:00
George Rhoten
06c982a592 ICU-5279 Better hiding and testing of internal functions.
X-SVN-Rev: 21686
2007-06-10 20:11:28 +00:00
George Rhoten
fad658d66d ICU-5279 Reduce the set of exported functions.
X-SVN-Rev: 21621
2007-06-03 06:08:46 +00:00
George Rhoten
b8328839ca ICU-4707 Fix some compiler warnings about unused code.
X-SVN-Rev: 19351
2006-03-16 04:54:35 +00:00
Markus Scherer
e6a0df52ee ICU-4199 enum/name API support for C/POSIX character classes, and UnicodeSet support for [:Assigned:]
X-SVN-Rev: 17730
2005-05-28 22:54:36 +00:00
Markus Scherer
7f8d06b42b ICU-4334 Unicode 4.1 update
X-SVN-Rev: 17104
2005-01-10 18:02:54 +00:00
Markus Scherer
aa6cd66256 ICU-3858 remove data for properties that were moved to ucase.icu and ubidi.icu, and simplify remaining structure
X-SVN-Rev: 17074
2005-01-02 00:22:48 +00:00
Markus Scherer
ca773f99db ICU-4287 move bidi/shaping properties to ubidi.icu
X-SVN-Rev: 17056
2004-12-31 13:36:49 +00:00
Markus Scherer
ca77616509 ICU-3432 move uniset.cpp data for case closure to ucase.icu; have gencase build case closure data; ucase.c use it; UnicodeSet::closeOver() call that
X-SVN-Rev: 16902
2004-12-02 04:18:35 +00:00
George Rhoten
e69fca9d5f ICU-4078 Fix for AIX with Visual Age 5 compiler, and make sure that the
disentanglement is correct by putting all the UnicodeSet virtual functions are
in one file. Also move some of the rule whitespace handling into better
locations.

X-SVN-Rev: 16519
2004-10-15 22:56:26 +00:00
Markus Scherer
f7b7183d7a ICU-4078 enable building a UnicodeSet from a property using/loading only the relevant data file
X-SVN-Rev: 16313
2004-09-13 23:33:22 +00:00
Markus Scherer
bd1f26f937 ICU-4078 move property name comparison functions from uprops to propname
X-SVN-Rev: 16303
2004-09-11 22:02:10 +00:00
Markus Scherer
b5eebb331e ICU-4078 move functions to make uchar.c (core properties APIs) independent of uprops.c (universal properties APIs)
X-SVN-Rev: 16301
2004-09-11 15:04:40 +00:00
Markus Scherer
8a3a93deed ICU-4078 use USetAdder interface to remove dependencies of low-level code on the USet/UnicodeSet implementation
X-SVN-Rev: 16265
2004-09-07 17:59:53 +00:00