Commit graph

47 commits

Author SHA1 Message Date
Markus Scherer
b6dcc95d3c ICU-21833 remove redundant void parameter lists
See #2351
2023-03-02 09:31:57 -08:00
Fredrik Roubert
2de88f9d9c ICU-21833 Replace UChar with char16_t in all C++ code. 2023-02-06 19:27:44 +01:00
Fredrik Roubert
633438f8da ICU-20973 Change all equality operator return types from UBool to bool. 2021-08-17 00:35:00 +02:00
Jeff Genovy
341608f7b6 ICU-21108 Update to use/support VS2019, and add extra CI builds for VS2017 and VS2019.
Change to use the Windows 10 SDK for Win32 (Win7) builds on the VS2019 ADO images.
The Windows 10 SDK is backwards compatible to Windows 7, if WINVER and
_WIN32_WINNT are set before compiling.

Clean up the vcxproj files from VC7.1
2020-09-02 13:01:27 -07:00
Andy Heninger
003b431540 ICU-13590 RBBI, improve handling of concurrent look-ahead rules.
Change the mapping from rule number to boundary position to use a simple array
instead of a linear search lookup map.

Look-ahead rules have a preceding context, a boundary position, and following context.
In the implementation, when the preceding context matches, the potential boundary
position is saved. Then, if the following context proves to match, the saved boundary is
returned as an actual boundary.

Look-ahead rules are numbered, and the implementation maintains a map from
rule number to the tentative saved boundary position.

In an earlier improvement to the rule builder, the rule numbering was changed to be a
contiguous sequence, from the original sparse numbering. In anticipation of
changing the mapping from number to position to use a simple array.
2020-07-21 14:39:15 -07:00
Andy Heninger
1eef362329 ICU-13565 Break Iteration, remove the dictionary bit from the implementation.
For identifying text that needs to be handled by a word dictionary for Break Iteration,
change from using a bit in the character category to sorting all dictionary categories
together, and recording the boundary between the non-dictionary and dictionary ranges.

This is internal to the implementaion. It does not affect behavior.
It does increase the number of character categories that can be handled using a
compact 8 bit Trie, from 127 to 255.
2020-06-17 12:00:14 -07:00
Andy Heninger
f0ad454691 ICU-13565 RBBI, make all state table row data be unsigned. 2020-06-01 20:05:17 -07:00
Frank Tang
c5ebb80a73 ICU-13565 Reduce size of BreakIterator brk files
See #1100
2020-05-27 14:26:10 -07:00
Frank Tang
94c9ff2089 ICU-20991 Trace BreakIterator/BreakEngine creation
See #1014
2020-03-06 14:18:43 -08:00
Andy Heninger
198a14956e ICU-13194 RBBI, bump data version number. New reverse tables are not compatible.
X-SVN-Rev: 41168
2018-03-28 22:07:24 +00:00
Andy Heninger
aead9fb553 ICU-13194 RBBI auto reverse tables: size reduction, and remove hand written rules.
X-SVN-Rev: 41163
2018-03-28 01:20:13 +00:00
Andy Heninger
b1b0be93ea ICU-13194 RBBI safe tables, all tests passing!
X-SVN-Rev: 41155
2018-03-26 23:01:16 +00:00
Andy Heninger
660d38bc7f ICU-13194 rbbi safe rule synth, work in progress.
X-SVN-Rev: 41118
2018-03-17 00:34:48 +00:00
Andy Heninger
3d4a3fbaa8 ICU-13569 rbbi state table opt, work in progress.
X-SVN-Rev: 40855
2018-02-08 01:42:04 +00:00
Markus Scherer
27f8d70bcd ICU-13503 declare variable-length array at end of struct with length 1 to disable bounds checkers
X-SVN-Rev: 40736
2017-12-14 21:25:46 +00:00
Andy Heninger
4e1c4096a6 ICU-9954 Break Iteration, remove reverse rules, add boundary caching.
X-SVN-Rev: 40433
2017-09-19 18:17:22 +00:00
Andy Heninger
4a5b4e4b44 ICU-12507 RBBI data loading, incorporate Markus's review comments.
X-SVN-Rev: 40268
2017-07-17 21:22:53 +00:00
Andy Heninger
a9cedfb775 ICU-12507 bump rbbi data version for change to UTrie2; consolidate version checking.
X-SVN-Rev: 40183
2017-06-20 22:11:08 +00:00
Andy Heninger
a3a2b57516 ICU-12507 ICU4C RBBI, switch to UTrie2
X-SVN-Rev: 40105
2017-05-03 23:44:14 +00:00
Andy Heninger
04448b004f ICU-12764 UTF-8 source files, update file encoding comments.
X-SVN-Rev: 39641
2017-02-03 18:57:23 +00:00
Andy Heninger
242e02c388 ICU-12764 icu4c utf-8 source files, update Copyright notices.
X-SVN-Rev: 39583
2017-01-20 00:20:31 +00:00
Michael Ow
61607c2773 ICU-12564 Update copyright notice in trunk
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126 ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f ICU-12564 Update copyright notice in ICU4C
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
Andy Heninger
9a2a9bea34 ICU-11036 RBBI, check data header when opening.
X-SVN-Rev: 36366
2014-09-05 21:23:48 +00:00
Andy Heninger
45561757cc ICU-10182 Build Time mutexes, merge from development branch.
X-SVN-Rev: 34139
2013-08-30 01:23:42 +00:00
Andy Heninger
ae87a3acc2 ICU-10051 Mutexes: introduce UInitOnce; remove UMTX_CHECK; replace all uses of UMTX_CHECK. All the directories this time.
X-SVN-Rev: 33788
2013-06-01 03:37:16 +00:00
Yoshito Umaoka
e9503bdade ICU-8909 Fixed various warnings reported by a source code analysis tool.
X-SVN-Rev: 30958
2011-11-14 19:32:51 +00:00
Markus Scherer
9bcc40cf9e ICU-8452 un-autoconf platform.h, move user-configurable settings to uconfig.h, and other changes; from svn merge --reintegrate branches/markus/platformh at r30411
X-SVN-Rev: 30412
2011-07-23 00:10:30 +00:00
Peter Edberg
d08940016b ICU-5858 Add constructors for RuleBasedBreakIterator and RBBIDataWrapper
that take RBBIDataHeader* but do not adopt it.

X-SVN-Rev: 24641
2008-09-25 05:48:27 +00:00
Andy Heninger
f327cc7af2 ICU-4773 add beginning of input test to rbbi rules
X-SVN-Rev: 18589
2005-09-27 00:03:32 +00:00
Eric Mader
5dc3d7c9d4 ICU-4561 Update copyright notices for ICU 3.4
X-SVN-Rev: 17822
2005-06-07 23:38:09 +00:00
Andy Heninger
a992612ff4 ICU-4157 Add compatibility with earlier compiled break rules, in addition to the new format for Unicode 4.1
X-SVN-Rev: 17405
2005-03-28 05:21:50 +00:00
Eric Mader
14fbb48bf9 ICU-3770 Updated copyright notices for ICU 3.0
X-SVN-Rev: 15385
2004-05-18 22:01:41 +00:00
Alan Liu
3f4f09f5a9 ICU-3668 make getters const
X-SVN-Rev: 14787
2004-03-27 07:05:32 +00:00
Andy Heninger
fdb386fb81 ICU-3043 multiple status values implementation
X-SVN-Rev: 14640
2004-03-05 05:04:10 +00:00
Andy Heninger
a57f032f42 ICU-3437 rbbi debug functions, better conditional compilation.
X-SVN-Rev: 14006
2003-12-04 22:44:05 +00:00
Andy Heninger
840affc200 ICU-2924 rbbi builder, better handling of !!lookAheadHardBreak option
X-SVN-Rev: 13994
2003-12-04 02:12:42 +00:00
Vladimir Weinstein
276f4c40a0 ICU-2235 remove C++ style comments in order to allow real including from .c files
X-SVN-Rev: 13841
2003-11-24 19:48:37 +00:00
Markus Scherer
d5f3278e54 ICU-2235 allow header file inclusion from .c for the swap function declaration
X-SVN-Rev: 13751
2003-11-19 01:00:18 +00:00
Syn Wee Quek
558442a420 ICU-2292 line breaks passing on default option
X-SVN-Rev: 13636
2003-11-07 22:49:38 +00:00
Andy Heninger
a9cdcba39e ICU-2924 RBBI rule builder, changes for safe point rules. Work in progress.
X-SVN-Rev: 13578
2003-11-05 02:03:44 +00:00
Andy Heninger
212715d5fb ICU-2235 add RBBI data swapping
X-SVN-Rev: 13231
2003-09-29 17:24:15 +00:00
Markus Scherer
87ab6dcc57 ICU-2244 change internal classes to inherit UMemory, not UObject
X-SVN-Rev: 9960
2002-10-04 01:23:34 +00:00
Markus Scherer
11948b52df ICU-1962 change UObject: RTTI pure virtual, remove other boilerplate for now
X-SVN-Rev: 8977
2002-06-29 00:04:16 +00:00
Markus Scherer
5a81709c73 ICU-1962 derive all ICU C++ classes from common UObject base class
X-SVN-Rev: 8953
2002-06-27 01:19:20 +00:00
Andy Heninger
32c09250b7 ICU-45 new builder for RBBI rules, initial checkin
X-SVN-Rev: 8939
2002-06-25 17:23:07 +00:00