Commit graph

31607 commits

Author SHA1 Message Date
Frank Tang
9a2177c575 ICU-21569 Add GA to test LSTM configuration
1. Add GA to test BreakIterator under LSTM configuration (remove Thai
and Burmese dictionary and include Thai and Burmese LSTM)
2. Add LSTMDataName for the purpose of testing.
3. Add file base test code to test BreakIterator match results from test
file generated by pythong code in
https://github.com/unicode-org/lstm_word_segmentation/blob/master/segment_text.py
4. Fix a LSTMBreakEngine::divideUpDictionaryRange bug when the return value
should only contains the number of words found when the passed in foundBreaks
already contains some data.
5. Change the cintltest TestSwapData from testing thaidict to laodict so
it will not break while we filter out thaidict under the LSTM
configuration.
2021-04-30 20:02:09 -07:00
Frank Tang
253c54ab81 ICU-21569 Add BOM to the beginning of the lstm files (minor fix of Part 2 C++)
See #1702
2021-04-29 12:30:46 -07:00
Victor Chang
fa4c12cdc2 ICU-21567 Avoid using regex in ULocale.getName()
No behavior change is expected
2021-04-29 12:46:38 +02:00
Frank Tang
b13be666cd ICU-21597 Fix Null-deref W in canonicalizeLocale 2021-04-28 15:37:08 -07:00
Frank Tang
19c52a4fe3 ICU-21569 LSTM Part 2- include C++ implementation
See #1689
2021-04-28 00:51:11 -07:00
Andy Heninger
7577899ff3 ICU-21591 Release lock in SimpleDateFormat::tzFormat in case of failure
Also remove the use of the unsafe double-checked lock idiom in the same
function, SimpleDateFormat::tzFormat(). Synchronization now always uses a
mutex, which is slower, but in the context of format or parse operations,
shouldn't be significant.

Added synchronization to one more unsafe direct reference to a const
SimpleDateFormat::fTimeZoneFormat. In the assignment operator.
2021-04-27 18:12:31 -07:00
Frank Yung-Fong Tang
f3f24f1423 ICU-21569 LSTM Part 1- data file and build tool
See #1688
2021-04-23 17:22:45 -07:00
Frank Tang
4689706386 ICU-21593 Merge ReadAndConvertFile 2021-04-23 14:33:10 -07:00
Elango Cheran
581c427557 ICU-21523 Declare default impl for copy operator for GCC warning 2021-04-14 12:32:52 -07:00
Jeff Genovy
060687b1a2 ICU-21535 Remove AppVeyor from top-level Readme
The Cygwin build was moved from AppVeyor to Azure CI in PR #1694
https://github.com/unicode-org/icu/pull/1694

We should also remove it from the top-level Readme file as well.
2021-04-14 10:44:50 -07:00
Frank Tang
2dc5bea906 ICU-21587 Fix memory bug w/ baseName
Edge cases not fixed in assign and move assign operator
while the locale is long and call setKeywordValue with incorrect
keyword/values.
2021-04-13 16:33:06 -07:00
Frank Tang
45b893c1f9 ICU-21539 Fix getUnicodeKeywords to skip non keys
See #1683
2021-04-08 22:02:15 -07:00
Daniel Ju
7b971a2b23 ICU-21535 Port Cygwin CI build from AppVeyor to Azure Pipelines 2021-04-08 17:04:07 -07:00
Shane F. Carr
2ea794d35b
ICU-21578 Merge maint/maint-69 into main 2021-04-08 11:54:56 -05:00
Jeff Genovy
898238b110 ICU-21576 Bump guava from 27.1-jre to 30.0-jre 2021-04-07 17:24:17 -07:00
Shane Carr
bc22e421bf ICU-21578 Merging maint/maint-69 into main (conflict: icudata.jar) 2021-04-07 15:15:05 -07:00
Long Nguyen
0e7b442886 ICU-21560 mingw: Remove version numbers from link flags 2021-04-07 13:47:42 -05:00
Mojca Miklavec
ff702ad727 ICU-21513 check if TARGET_OS_SIMULATOR has been defined
clang 11 throws an error when using undefined variables starting with
TARGET_OS_, and TARGET_OS_SIMULATOR is missing in earlier versions
of macOS, so the code would not compile without an additional guard.

See also ICU-13440.
2021-04-07 13:47:42 -05:00
Frank Tang
b926f52688 ICU-21550 Add zzzz to subdivision if len==2
See #1662
2021-04-07 13:47:42 -05:00
Rich Gillam
866a640b70 ICU-21552 Changed to only display "Encountered empty table" warning in verbose mode. 2021-04-07 13:47:42 -05:00
Markus Scherer
152867f7ab ICU-21459 properly guard BytesTrie.Result.getValue()
and clone/copy objects so that objects shared among instances are not modified
and use an atomic int for the C++ refcount
2021-04-06 17:10:46 -07:00
Shane F. Carr
7fd2844d10 ICU-21540 Fix FormattedList test and behavior regression 2021-04-06 16:55:30 -05:00
Markus Scherer
928febc705 ICU-21561 rename StringSegment.equals() to contentEquals()
and remove hashCode() because of now-missing equals() and because
StringSegment is mutable and documented as not suitable for HashMaps
2021-04-05 17:00:31 -07:00
Jeff Genovy
c5406692bb ICU-21546 Fix warnings from running the samples with MSVC. 2021-04-05 13:59:43 -07:00
Elango Cheran
72de651291 ICU-21565 Remove Gemfile.lock from config files in repo for docs 2021-04-05 10:08:30 -07:00
Steven R. Loomis
2dce62892b ICU-21568 Pre-initialize CLDRConfig and SupplementalDataInfo
- avoid static init problem from CLDR-14636
2021-04-01 20:25:33 -07:00
Shane F. Carr
33ba23402d ICU-21545 Dump UCPTrie test data to golden files
See #1660
2021-04-01 20:29:49 -05:00
Erik Torres
7afcbdb551 ICU-21546 BRS69GA Update version numbers for 69GA
See #1677
2021-04-01 18:21:00 -07:00
Elango Cheran
75c58211c0 ICU-21310 replacement User Guide main index page
...and nav_order and link fixes from Elango. See #1680
2021-04-01 17:24:44 -07:00
Peter Edberg
f5d8b57d01 ICU-21413 Update laodict.txt per https://github.com/rober42539/lao-dictionary; add tests 2021-04-01 10:27:53 -07:00
Peter Edberg
544d097fa3 ICU-21546 brs 69, add note about genren process 2021-03-31 16:23:18 -07:00
Long Nguyen
fcd8c34f5e ICU-21560 mingw: Remove version numbers from link flags 2021-03-31 13:02:35 -07:00
Frank Tang
0ba723ded2 ICU-21563 Fix delete-incomplete on windows
See #1671
2021-03-31 12:02:58 -07:00
Frank Tang
bdb4c7025a ICU-21526 Change return of UElementComparator to int32_t
See #1621
2021-03-31 12:02:08 -07:00
Mojca Miklavec
05b20c1ccb ICU-21513 check if TARGET_OS_SIMULATOR has been defined
clang 11 throws an error when using undefined variables starting with
TARGET_OS_, and TARGET_OS_SIMULATOR is missing in earlier versions
of macOS, so the code would not compile without an additional guard.

See also ICU-13440.
2021-03-31 10:49:30 -07:00
Frank Tang
78c3e1ed6d ICU-21562 Add include to fix std:abs build problem 2021-03-30 21:33:22 -07:00
Markus Scherer
46720c4120 ICU-21559 more guards for self-assignment 2021-03-30 12:19:11 -07:00
Frank Tang
15d3f9f65d ICU-21550 Add zzzz to subdivision if len==2
See #1662
2021-03-29 19:04:53 -07:00
Peter Edberg
e7db5754ba ICU-21546 integrate CLDR release-39-beta2 to ICU maint-69, just affects cldr-icu pom 2021-03-25 17:02:23 -07:00
Elango Cheran
35fe8534f2 ICU-21310 Consolidate ICU4C and ICU4J Readmes via the User Guide 2021-03-24 19:03:53 -07:00
Hugo van der Merwe
0ad4614a04 ICU-21508 Rename 'master' to 'main' in docs, maven files, scripts, etc
See #1664
2021-03-24 18:56:50 +01:00
Elango Cheran
227c729b0e ICU-21117 Use Bazel to automate generation of Unicode data files 2021-03-24 10:39:38 -07:00
Rich Gillam
29d2e85041 ICU-21552 Changed to only display "Encountered empty table" warning in verbose mode. 2021-03-22 17:03:25 -07:00
Steven R. Loomis
2b3a3724c7 ICU-20233 tool: improvements to commit checker for CLDR
- improve checks for CLDR resolutions
- update batch fetch from 50 to 100
- do not show no-commit tickets under illegal-state
- add navigation
- use numeric IDs for status, resolution, types with constants
 (S_, R_, I_)

fix bugs from CLDR-14224
- do not show not-in-query issues as open
- cache single-issue queries
- give information on each problem as to what it is
- put 'review' in a separate section (not counted in Problems)
- more detailed fix info under each issue

- fix section headings, always show component, add nocopyright option
- sort open issues by componnent
- sort the 'review' list

ICU-20444:
- exclude (via git-cherry) any commits already cherry picked to the branch.
- used a workaround to https://github.com/gitpython-developers/GitPython/issues/846
that seemed to work well,
- warn about issues with excluded commits

CLDR-14524:
- Exclude "Obsolete" from requiring commits
2021-03-17 23:28:04 -05:00
Shane F. Carr
0bd125eb45 ICU-21480 Update double-conversion 2021-03-17 19:45:13 -05:00
Jeff Genovy
8c9c14e64c ICU-21542 Have GHA output ICU4J test failures if there are any. 2021-03-17 15:19:11 -07:00
Frank Tang
cec7de7a39 ICU-21537 Fix invalid free by long locale name
Do not free baseName if it is pointing to fullNameBuffer.

Better Fix
2021-03-17 10:34:27 -07:00
gnrunge
3d7ba6560e ICU-21520 Fixes typo in name of test data file; removes a regex that
evaluates to the empty string and the result files are missing the
perf test name as a result. E.g. '_Mar_16_155820.html' instead of
'unisetperf_Mar_16_155820.html'.
Not obvious why the regex was even created since it doesn't seem
necessary because the name of the caller doesn't require additional
parsing. The rpaticular code is from 2002, maybe changes in Perl
account for the issue(?)
2021-03-16 20:01:16 -07:00
Elango Cheran
2ae2f31c2f ICU-21480 Change ICU4C/J Readme update date after version change 2021-03-16 13:34:06 -07:00
Craig Cornelius
bc7e2e230c ICU-21480 Scrub TODOs for completed issues 21292 and 21236
See #1652
2021-03-16 11:17:14 -07:00