Commit graph

32889 commits

Author SHA1 Message Date
DraganBesevic
d04154833c ICU-22723 Integrate CLDR 46 release alpha0, part 3, source files 2024-08-14 17:09:35 -07:00
DraganBesevic
b865f26876 ICU-22723 Integrate CLDR 46 release alpha0, part 2, data files 2024-08-14 17:09:35 -07:00
DraganBesevic
24966d7a21 ICU-22723 Integrate CLDR 46 release alpha0, part 1, binary files 2024-08-14 17:09:35 -07:00
Fredrik Roubert
37b2bc6999 ICU-22721 Use correct initializer list syntax.
This will make the code ever so slightly simpler but more importantly
make it possible to compile also when using -D_GLIBCXX_DEBUG.
2024-08-13 21:33:53 -07:00
Fredrik Roubert
bae2aa65d8 ICU-22696 Avoid unnecessary copies of already NUL terminated strings. 2024-08-13 21:15:26 +02:00
Markus Scherer
72206495de ICU-22843 UnicodeString <-> std::u16string_view / wstring_view via templates 2024-08-13 09:10:01 -07:00
Fredrik Roubert
7ffbe77e12 ICU-22696 Update ulocimp_setKeywordValue() to use std::string_view. 2024-08-13 14:03:18 +02:00
Frank Tang
0bd2b4d10c ICU-22849 Fix memLeak in RBBIRuleBuilder by using LocalMemory 2024-08-09 16:19:06 -07:00
Mihai Nita
b5b3e16afa ICU-22845 Better iterations for the ICU4J UnicodeSet 2024-08-09 08:56:25 -07:00
StefanStojanovic
66ba09973a ICU-22787 Fix ClangCL compilation on Windows 2024-08-09 10:54:21 +05:30
Tim Chevalier
57ed0a2a53 ICU-22794 MF2: Move .json files for tests into top-level testdata/ directory
Modify ICU4C and ICU4J test readers to handle all tests

Add `ignoreJava` and `ignoreCpp` properties to tests where needed

Includes parser bug fixes:

ICU4J: require a complex-body after declarations

ICU4J: Correctly parse the complex body after an unsupported statement

ICU4J: Handle date params in tests and remove default params for tests

ICU4J: Handle decimal params in tests

ICU4J: Require whitespace before variable/literal in reserved annotation

ICU4J: Require whitespace between options

ICU4J: Require a variable-expression in an .input declaration

ICU4J: don't require space between last key and pattern in variant

ICU4J: don't require space between selectors

ICU4J: allow whitespace after '=' in option

ICU4J: parse escape sequences in quoted literals according to grammar

ICU4J: allow whitespace within markup after attributes list
2024-08-08 09:14:44 -07:00
Mihai Nita
366bb463b1 ICU-22722 Update the ICU Vice-Chair for Maven publishing 2024-08-07 07:36:23 -07:00
Fredrik Roubert
8a6d59ec80 ICU-22696 Update ulocimp_to*{Key,Type}() to use std::string_view. 2024-08-07 14:14:23 +02:00
Frank Tang
1eb0ed2fad ICU-22818 Fix memory leak during error in messageformat2_data_model.cpp
See #3077
2024-08-06 15:39:20 -07:00
Frank Tang
a22dc93e3a ICU-22825 Fix memLeak during error in tznames_impl.cpp
Rewrite the TextTrieMap::put() which should delete the value
during error instead of deleting key.
Rewrite to simplified the error handling.
2024-08-06 09:58:15 -07:00
Frank Tang
6de4472db0 ICU-22820 Fix memLeak during error in genrb
Fix also ICU-22821 ICU-22822
2024-08-06 18:00:27 +02:00
Frank Tang
09ae31427e ICU-22830 Fix memLeak in numrange_fluent.cpp
Maybe related to the leak in ICU-22800
2024-08-05 19:30:54 -07:00
Frank Tang
575bb781cf ICU-22831 Fix memLeak in number_longnames.cpp by using LocalPointer
Maybe fix issues in ICU-22800
2024-08-05 19:28:34 -07:00
Frank Tang
6c8c6aa430 ICU-22827 Fix memLeak by using LocalUResourceBundlePointer 2024-08-05 16:34:16 -07:00
Frank Tang
2f5a46ffd5 ICU-22819 Fix memory leak during error 2024-08-05 11:41:06 -07:00
Frank Tang
49867907f9 ICU-22824 Fix mem Leak during error in uidna.cpp 2024-08-05 11:31:14 -07:00
Frank Tang
bca5fa5004 ICU-22826 Fix memory leak 2024-08-01 13:46:58 -07:00
Fredrik Roubert
dd65ee3f0b ICU-22696 Update ulocimp_getKeywordValue() to use std::string_view. 2024-07-31 15:39:15 +02:00
Elango Cheran
3663cc1e0f ICU-22707 Add support for property Modifier_Combining_Mark 2024-07-30 15:44:32 -07:00
Fredrik Roubert
10fe2a6110 ICU-22696 Add uhash support for std::string_view. 2024-07-30 06:45:43 +02:00
Fredrik Roubert
8891c070bd ICU-22696 Add implicit conversion from StringPiece to std::string_view.
This will allow ICU4C to seamlessly use std::string_view internally
while continuing to use StringPiece in the public API.
2024-07-30 06:45:33 +02:00
Fredrik Roubert
5d7cbdbc02 ICU-22696 Delete unused code.
These optional output parameters weren't used when these function were
originally added so they were most likely included just in case someone
would want to use them in the future, but that was 10 years ago now and
they still haven't been used yet, so it's unlikely that they'll be used
in the foreseeable future and call sites as well as the implementation
can instead be simplified by removing them.
2024-07-29 22:03:10 +02:00
Elango Cheran
06c077bd35 ICU-22503 add property Indic_Conjunct_Break 2024-07-26 14:47:39 -07:00
Peter Edberg
2cbfd134ef ICU-22741 Update CLDR-ICU instructions to mention some hardcoded lists that may need updating
Co-authored-by: Steven R. Loomis <srl295@gmail.com>
2024-07-26 12:17:00 -07:00
Frank Yung-Fong Tang
23d9628f88 ICU-22801 Try to add LEAKSANITIZER
See #3041
2024-07-23 09:34:04 -07:00
Frank Yung-Fong Tang
40b2ec3c37 ICU-22814 Add CIFuzz to ICU
See #3059
2024-07-19 15:53:12 -07:00
Carlos Alberto Lopez Perez
8ca6bc7545 ICU-22813 Rise the size of the buffers used for the command strings at pkgdata
The tool pkgdata uses snprintf() to build the strings of the commands that
will execute later during the install process. But the maximum size of this
buffers is not enough when there is a long path.

This has caused issues on some CI systems that use very long paths, causing
the install process to produce a wrong result.

The maximum path on Linux is 4096 (defined as PATH_MAX at <linux/limits.h>)
So the size of SMALL_BUFFER_MAX_SIZE should be 4096 to avoid errors related
to truncated paths.
2024-07-19 18:02:53 +00:00
Robin Leroy
4acb4724cf ICU-22707 Copy new monkey rules to ICU4J 2024-07-18 23:56:34 +00:00
Robin Leroy
d1490896ed ICU-22707 Fix an ancient bug in moveIndex32 2024-07-18 23:56:34 +00:00
Robin Leroy
b3ec8d1abd ICU-22707 Port the old monkey rule changes to ICU4J 2024-07-18 23:56:34 +00:00
Robin Leroy
7a52b0602a ICU-22707 Port the line monkey partition to ICU4J 2024-07-18 23:56:34 +00:00
Robin Leroy
80c3389bcd ICU-22707 Copy data-driven test file to ICU4J 2024-07-18 23:56:34 +00:00
Markus Scherer
558b3c02a6 ICU-22707 generate ICU4J .brk files 2024-07-18 23:56:34 +00:00
Robin Leroy
a53a8bdc73 ICU-22707 UTC-179-C28 Improved expectation 2024-07-18 23:56:34 +00:00
Robin Leroy
7d14e8db45 ICU-22707 Patch tailored new monkeys, manually for the last hunk on line_(loose|normal)_cj 2024-07-18 23:56:34 +00:00
Robin Leroy
3a004d400f ICU-22707 Patch tailored rules (manually for hunks 1 and 6 on loose(_phrase)?_cj) 2024-07-18 23:56:34 +00:00
Robin Leroy
782d5cc339 ICU-22707 UTC-179-A102 Consider using a macro throughout the rules for [\p{ea=F}\p{ea=W}\p{ea=H}]. 2024-07-18 23:56:34 +00:00
Robin Leroy
20fdebcb35 ICU-22707 UTC-180? Give up on 16.0β rules, amend LB10 and LB21a instead. 2024-07-18 23:56:34 +00:00
Robin Leroy
cc64ec7c94 ICU-22707 feed more bits to the starving monkeys: ranlux48 rather than a 32-bit LCG 2024-07-18 23:56:34 +00:00
Robin Leroy
47a8ea4065 ICU-22707 smarter old monkeys: refine the partition on interesting sets 2024-07-18 23:56:34 +00:00
Robin Leroy
600011eb7d ICU-22707 LB19 rules that seem to actually work with ea=W BA and CM. Update default new monkeys. 2024-07-18 23:56:34 +00:00
Robin Leroy
6d3b988fd0 ICU-22707 UTC-179-C30 [\p{lb=IS} & [\p{ea=F}\p{ea=W}\p{ea=H}]] has been emptied 2024-07-18 23:56:34 +00:00
Robin Leroy
5d7df44d2f ICU-22707 UTC-179-C28 examples from https://github.com/unicode-org/properties/issues/197 = L2/24-064 §5.5 (also in LineBreakTest.txt but less legible). 2024-07-18 23:56:34 +00:00
Robin Leroy
79f4745494 ICU-22707 UTC-179-C25 Limit LB21a to the Hebrew-hyphen-non-Hebrew case 2024-07-18 23:56:34 +00:00
Robin Leroy
83f3334b96 ICU-22707 UTC-179-C32 Upstream and improve the old Finnish tailoring LB20a from CLDR-3029 and ICU-8151 2024-07-18 23:56:34 +00:00