Commit graph

31 commits

Author SHA1 Message Date
Frank Tang
02d5e71903 ICU-22342 Implement ExternalBreakEngineAPI
ICU-22342 Fix comments
2023-08-30 11:43:16 -07:00
Shuhei Iitsuka
b6b7b045e9 ICU-22100 Incorporate BudouX into ICU (C++) 2022-12-02 10:11:06 -08:00
allenwtsu
a7b2d9dae1 ICU-21699 Add Japanese particle 2022-02-10 18:50:41 -08:00
allenwtsu
2a7c465284 ICU-21699 Add breakpoint between Japanese and Alphabet 2022-02-09 21:12:49 -08:00
allenwtsu
d0290c03db ICU-21699 Phrase based breaking(C++)
See #1936
2022-01-13 20:22:05 -08:00
Frank Tang
704415402a ICU-21847 Move UnicodeSet to stack in constructor
See #1941
2021-12-01 22:23:47 -08:00
Peter Varga
1986dcd0d8 ICU-21734 Add missing overrides 2021-09-07 17:42:14 -07:00
Frank Tang
4136fa207f ICU-21569 Propagate LSTM memory allocation issue 2021-05-04 15:09:54 -07:00
Markus Scherer
a18df7ba28 ICU-21267 stop using FALSE & TRUE macros in most library-internal headers 2020-09-10 13:55:46 -07:00
Andy Heninger
d7f2cd98d3 ICU-10688 Break Iterator type logic removal, a few minor cleanups.
X-SVN-Rev: 40689
2017-12-05 00:07:30 +00:00
Andy Heninger
ca7b62180e ICU-10688 branch, work in progress.
X-SVN-Rev: 40686
2017-12-03 00:36:54 +00:00
Andy Heninger
4e1c4096a6 ICU-9954 Break Iteration, remove reverse rules, add boundary caching.
X-SVN-Rev: 40433
2017-09-19 18:17:22 +00:00
Andy Heninger
242e02c388 ICU-12764 icu4c utf-8 source files, update Copyright notices.
X-SVN-Rev: 39583
2017-01-20 00:20:31 +00:00
Michael Ow
61607c2773 ICU-12564 Update copyright notice in trunk
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Yoshito Umaoka
00ca13e126 ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f ICU-12564 Update copyright notice in ICU4C
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
Peter Edberg
d87c86274c ICU-10326 Add dictionary-based word/line break for Burmese/Myanmar
X-SVN-Rev: 36397
2014-09-08 22:16:21 +00:00
Andy Heninger
f71b9053d2 ICU-8550 Dictionary Break Iterator, fixes to work with UTF-8 text.
X-SVN-Rev: 35724
2014-05-17 00:44:39 +00:00
Peter Edberg
bf4126616b ICU-7647 Add/use LaoBreakEngine and laodict.txt; more useful messages in gendict
X-SVN-Rev: 34229
2013-09-06 23:43:13 +00:00
Steven R. Loomis
1361355eb5 ICU-9551 fix uconfig failures that were in m2, disable CJKBreak without normalization. Merges from r32354 and r32355 by dsengup.
X-SVN-Rev: 32367
2012-09-11 04:02:08 +00:00
Markus Scherer
6da17ae2f5 ICU-9353 fix dictionary class name in comments
X-SVN-Rev: 32330
2012-09-04 20:05:20 +00:00
Maxime Serrano
c64c0299d7 ICU-9353 merge dbbi-tries work into the trunk
X-SVN-Rev: 32184
2012-08-16 23:01:49 +00:00
Steven R. Loomis
ff960d1f97 ICU-8714 - reimplement r30379 - now passes tests
X-SVN-Rev: 30383
2011-07-18 21:54:42 +00:00
Steven R. Loomis
fd3246217f ICU-8714 backing out r30379 - broke the build
X-SVN-Rev: 30380
2011-07-18 19:51:16 +00:00
Steven R. Loomis
ba3834755c ICU-8714 patches from George
X-SVN-Rev: 30379
2011-07-18 19:13:31 +00:00
Peter Edberg
7aaca9b950 ICU-8329 Roll in Khmer dictionary word break code from George, data from Nathan/sbbic.org
X-SVN-Rev: 30019
2011-05-04 13:25:37 +00:00
George Rhoten
c34f0ecd83 ICU-5426 Compact cached UnicodeSets, and make setCharacters take a const UnicodeSet.
X-SVN-Rev: 20460
2006-09-30 00:37:45 +00:00
George Rhoten
6e035da1d5 ICU-5032 Comment out unused @internal code.
X-SVN-Rev: 19656
2006-05-26 00:57:09 +00:00
Andy Heninger
9f85d5dd08 ICU-5170 move RBBI from CharacterIterator to UText
X-SVN-Rev: 19579
2006-04-22 05:29:27 +00:00
George Rhoten
7bc8795cf7 ICU-5151 Hide internal functions.
X-SVN-Rev: 19524
2006-04-08 08:34:52 +00:00
Deborah Goldsmith
490cb834fa ICU-5117 Thai break should work in all locales
X-SVN-Rev: 19408
2006-03-23 00:54:12 +00:00