Commit graph

4881 commits

Author SHA1 Message Date
Andy Heninger
e6892996b1 ICU-22584 Fix RBBI rule builder stack overflow.
The problem was found by fuzz testing.

A rule consisting of a long literal string produces a large, unbalanced parse tree,
one node per string element. Deleting the tree was recursive, once per node, resulting
in deep recursion.

This PR changes node deletion to use an iterative (non-recursive) approach.

This change only affects rule building. There is no change to the RBBI run time
using pre-built rules.
2023-12-08 12:49:26 -08:00
Rich Gillam
bcae6f2a43 ICU-22575 Change AvailableFormatsSink to allow locales to inherit availableFormats items from the root locale. 2023-12-04 12:47:50 -08:00
Fredrik Roubert
d2a5638336 ICU-22522 Make operator==() const to resolve C++20 ambiguity. 2023-12-04 16:17:20 +01:00
Frank Tang
d93c7b60fc ICU-22568 return TimeZomeFormat::createInstance for bogus locale 2023-12-01 15:10:41 -08:00
Fredrik Roubert
f99f8c678b ICU-22522 Delete unused variables.
Clang 16 is more thorough in finding unused variables, so these must be
removed to be able to compile this code using Clang 16 and -Werror.
2023-11-30 15:34:36 +01:00
Frank Tang
276d3dc865 ICU-22493 Implement First Day Override in Calendar 2023-11-29 11:55:51 -08:00
Frank Tang
757d2cd90a ICU-22555 Fix infinity loop in RuleBasedCollator constructor
Fix C++ and Java code.
Add unit tests for both C++ and Java.
2023-11-29 11:31:27 -08:00
Frank Tang
fa6a4661ba ICU-22545 Fix addLikelySubtags for pseudo Locales 2023-10-31 15:00:34 -07:00
Frank Tang
1bb711ad20 ICU-22540 Add new CLDR units ronto, ronna, quecto, and quetta 2023-10-31 14:58:52 -07:00
Shane F. Carr
b070c932ad
ICU-22560 Merge ICU 74 maintenance branch to main (#2689) 2023-10-30 13:50:52 -07:00
Frank Tang
92eeb45811 ICU-22547 fix addLikelySubtags for 4 chars script code
Also fix ICU-22546 to correct the comments in the API doc
and add additional unit tests
2023-10-27 17:29:05 -07:00
Frank Tang
cdab88ff4e ICU-22513 Return error if days is too large in IslamicUmalquraCalendar
If the year is too large it may overflow the int32_t variable and cause
slow or infinity loop, return error if the year is too large that the
conversion to day may overflow int32_t. Limit the value to max value of
int32_t divide by 400.
2023-10-26 17:09:41 -07:00
Peter Edberg
2f7bfd87cb ICU-22326 CLDR release-44-beta5 to ICU main part 3 (ICU sources: lib, tools, tests) 2023-10-26 10:59:18 -07:00
DraganBesevic
597e3110a5 ICU-22325 CLDR 44 beta2 integration to ICU part three, source files changes 2023-10-04 15:18:56 -07:00
Frank Tang
d82ad9975d ICU-22526 Allow GMT-23:59 time zone
ECMA402 now allow offset timezone from -23:59 to 23:59.
We need to lower the minimum ZONE_OFFSET value to -23:59
2023-10-04 14:43:36 -07:00
Fredrik Roubert
c587069895 ICU-22527 Add -Wambiguous-reversed-operator test code. 2023-10-04 00:05:31 +02:00
Peter Edberg
e1af930c6a ICU-22325 BRS 74rc move cldr testdata to consistent place, adjust test & tools to match 2023-10-03 10:24:27 -07:00
Frank Tang
05b0e7abaf ICU-22517 Limit the closure expansion loop and return error
To avoid very slow return from the constructor, we return
error while the Collation rule expand too big.
Add a soft limit to limit to the number of loop needed for 8 Hanguls
  Necessary number of loop: H(0)=0; H(i)=3H(i-1)+2.
  Where i is the length of Hangul in the rule.
  H(1) = 2, H(2) = 8, H(3)=26, H(4)=80, H(5) = 242 ...
2023-10-02 19:06:38 -07:00
Markus Scherer
f6d09d514d ICU-22527 add Measure::operator!=() 2023-10-02 17:03:28 -07:00
Frank Tang
07137b64e4 ICU-22521 Return U_INTERNAL_PROGRAM_ERROR instead of % 0
When the gap is 0, return status as U_INTERNAL_PROGRAM_ERROR
and avoid the operation of "% gap"
2023-09-28 14:23:07 -07:00
Younies Mahmoud
c4689841c0 ICU-22463 Fix the conversion from gasoline-equivalent units to kilograms-per-meter-squared-per-second
See #2616
2023-09-23 06:48:25 +02:00
Frank Tang
4fcf8d22b9 ICU-22507 Fix stack overflow in ChineseCalendar::isLeapMonthBetween
Rewrite the recursive call to while loop to avoid stack overflow
when the two values have big gap.
Include tests to verify the problem in unit test.
2023-09-20 23:56:15 -07:00
Rich Gillam
1b980e5999 ICU-21877 Fixed it so that getAllChildrenWithFallback() correctly calls its sink with every possible resource the
one the user requested might be inheriting elements from.
2023-09-20 14:09:21 -07:00
Frank Tang
f3b869cbb0 ICU-22512 Fix broken TestHebrewCalendarInTemporalLeapYear
Fix broken test mistakenly landed in
https://github.com/unicode-org/icu/pull/2274

Some important steps were missed in the last landing.
2023-09-19 09:47:03 -07:00
Markus Scherer
d8659b476d ICU-22404 new properties IDS_Unary_Operator, ID_Compat_Math_*, NFKC_SCF 2023-09-16 14:41:51 -07:00
Rich Gillam
14ca2b0e6d ICU-22313 Various fixes for duration formatting:
- Changed the C++ and Java interfaces to that the URBNF_DURATION ruleset is marked deprecated.
- Fixed a bug in RuleBasedNumberFormat in both Java and C++ that caused the existing duration-formatting rules to produce bogus results when used on a non-integral value.

(Earlier versions of this PR added code to use a MeasureFormat under the covers when a caller used
unum_open(UNUM_DURATION).  I took that out because of backward compatibility concerns, so we're still using RBNF
in the C API.  I'm hoping to add a "real" duration formatter in ICU 75.)
2023-09-15 16:27:58 -07:00
Peter Edberg
28572ab67e ICU-22325 CLDR release-44-alpha3 to main part 3 (ICU sources: lib, tools, tests) 2023-09-15 14:02:20 -07:00
Frank Tang
386e9a10db ICU-22504 Fix buffer overflow write error 2023-09-15 11:01:37 -07:00
Frank Tang
9fb9bd4950 ICU-22342 Rename fillBreak to fillBreaks 2023-09-14 10:04:57 -07:00
DraganBesevic
bb7352990e ICU-22325 CLDR 44 alpha2 integration to ICU part three, source files changes 2023-09-13 11:06:53 -07:00
Frank Tang
68a61daa95 ICU-22497 Fix buffer-overflow READ for toLanguateTag 2023-09-12 15:21:26 -07:00
Rich Gillam
21f74b3698 ICU-22455 Implemented algorithm in CLDR-16981 to preserve regional unit overrides when they don't conflict
with the ms subtag.
2023-09-05 16:52:18 -04:00
Mihai Nita
5fb2a6ad06 ICU-22324 Mavenization, updating the cldr-to-icu scripts and instructions 2023-09-05 10:24:23 -07:00
Frank Tang
02d5e71903 ICU-22342 Implement ExternalBreakEngineAPI
ICU-22342 Fix comments
2023-08-30 11:43:16 -07:00
Rich Gillam
8817c25c1e ICU-22449 Fixed SimpleDateFormat (in C++ and Java) to correctly honor the rg and hc subtags in the locale when choosing the hour cycle. 2023-08-28 13:30:45 -04:00
Robin Leroy
f79fe9347a ICU-22332 bidiSkeleton and LTR- and RTL-confusabilities 2023-08-23 15:56:02 +02:00
Peter Edberg
2270c174a5 ICU-22325 CLDR release-44-alpha1 to main:
- binaries, binary-as-source, CLDR data sources;
  - CLDR test data & dtd, ICU lib/tool/test source updates.
2023-08-22 14:40:51 -07:00
Frank Tang
667ee72b7c ICU-22457 count() of getUnicodeKeywords is incorrect
The count() is incorrect if the Locale contains extension which is not -u-
for example -x-, -t-.

Currently, this PR only contains tests to show the problem.

ICU-22457 Fix the enum_count
2023-08-21 14:44:10 -07:00
Frank Tang
ffc449de62 ICU-20777 Merge the likelySubtags implemention
Change testdata/likelySubtags.txt to consider FAIL line

ICU-20777 Fix Java Tests

ICU-20777 Fix all issues

ICU-20777 Incase timeout

ICU-20777

ICU-20777 Skip Data Driven test
2023-08-18 09:35:54 -07:00
Markus Scherer
81a6edb287 ICU-22404 Unicode 15.1 data 20230811 plus UTS46 fix 2023-08-16 14:25:22 -07:00
yumaoka
6338b704ff ICU-22452 API for resolving preferred IANA zone ID from a zone ID. 2023-08-11 17:00:05 -04:00
James Choi
b70b2d0433 ICU-22363 Add support for QNX 7.1
Signed-off-by: James Choi <chachoi@blackberry.com>
2023-08-10 17:46:01 -07:00
Robin Leroy
a6fc915e05 ICU-22404 Strip default ignorable code points in the skeleton for confusable detection 2023-08-10 17:55:14 +02:00
Robin Leroy
47e47ec439 ICU-22433 uppercase decomposed greek to decomposed greek and precomposed greek to precomposed greek. 2023-08-08 22:20:20 +02:00
DraganBesevic
1f07d2b29f ICU-22325 Integrate CLDR 44.1 to ICU, add personName testdata, fix RBBITestMonkey 2023-07-28 16:53:50 -07:00
Frank Tang
19bac42f98 ICU-22446 Fix incorrect Hebrew ORDINAL MONTH bug 2023-07-27 22:18:00 -07:00
Frank Tang
03e98c04f3 ICU-22442 Sync the spelling of NEHASSA to ICU4J NEHASSE
ICU4J has that as part of public API but ++ is private
so we should sync to the ICU4J one.
2023-07-27 09:56:31 -07:00
Elango Cheran
2e45e6ec0e ICU-22404 Unicode 15.1 beta data files & API constants
See #2492

Co-authored-by: Andy Heninger <andy.heninger@gmail.com>
Co-authored-by: Robin Leroy <egg.robin.leroy@gmail.com>
2023-07-13 19:26:14 -07:00
Frank Tang
5826bf7ed7 ICU-22434 Not calling setFirstDayOfWeek(MONDAY) if the locale has fw
The Calendar constructor already take care of the fw override.
We should not set the first day of week for iso8601 to Monday if
we have a fw keyword/type in the locale.

ICU-22434 Fix incorrect calendar keyword extraction
2023-07-13 09:49:32 -07:00
Peter Edberg
70d308731a ICU-22325 BRS 74 front-load update version to 74.0.1 2023-07-10 16:45:41 -07:00