Commit graph

23081 commits

Author SHA1 Message Date
Markus Scherer
f9beb616a8 ICU-21652 add emoji properties of strings
- 7 new properties: API constants & property names
- u_stringHasBinaryProperty(s, property) & UCharacter.hasBinaryProperty(s, property)
- two additional source data files
- new genprops part for writing new binary data file uemoji.icu
- data for existing emoji properties moved from uprops.icu (hardcoded in C++) to uemoji.icu (always loaded)
- new EmojiProps implementation
2021-09-08 12:15:50 -07:00
Peter Edberg
6244d57559 ICU-21685 In list of currencies from ucurr_openISOCurrencies(): add VES, remove EQE 2021-09-08 10:49:14 -07:00
Peter Varga
1986dcd0d8 ICU-21734 Add missing overrides 2021-09-07 17:42:14 -07:00
Peter Varga
90ee0c788f ICU-21734 Add missing overrides to pure virtual methods
These methods are pure virtual in the base class too.
2021-09-07 15:34:48 -07:00
Peter Varga
80d42a13fc ICU-21734 Add missing overrides in tests 2021-09-07 13:56:33 -07:00
Rich Gillam
19b8b076f9 ICU-20992 Added new API for iterating child resources while following aliases, and changed DateTimePatternGenerator
to use it.
2021-09-03 12:47:24 -07:00
Rich Gillam
01e1adc9e4 ICU-21460 Changed the ULocale initializers to allow locale IDs that use BCP47 syntax, but with '_' as a field delimiter.
(APIs that specifically require BCP47 syntax are unaffected-- they still require '-').
2021-09-03 12:47:02 -07:00
yumaoka
369357aaa7 ICU-21530 Added U_HIDE_DEPRECATED_API to deprecated methods. 2021-09-03 09:04:54 -04:00
Andy Heninger
081cf77330 ICU-21662 Improve UVector error handling.
- Add updated versions of UVector::addElement() and ensureCapacity() that respect
  incoming errors.
  Follow on to c26aebe, which renamed the original versions.

- Add UVector::adoptElement() as a replacement for addElement() when the vector
  has a deleter function set, meaning that it adopts ownership of its elements.

  The intent is to make the behavior clearer at the call sites when looking
  at unfamiliar code.

- Make all functions with an incoming failure, as indicated by a UErrorCode parameter,
  leave the vector unchanged.

- Change all functions that store object pointers into the vector such that,
  when the store cannot be completed for any reason _and_ the vector has a deleter function,
  then the incoming object is deleted.

  This change can simplify the error handling code around calls to the affected functions
  (addElement() and insertElementAt(), in particular)

- Add index bounds checking on functions where it was possible - that is, on functions
  with both U_ErrorCode and index parameters.

- Changed to more modern C++ idioms in some parts of the UVector implementation.

- Review & update as required all uses of the UVector functions
  setElementAt(), insertElementAt(), setSize(), sortedInsert()
  these being the functions with changed behavior on error conditions
  (aside from addElement()).

This PR will be followed by more, switching call sites in various ICU services
from UVector::addElementX() (old behavior on errors)
to   UVector::addElement()  (new behavior on errors)
2021-09-02 19:15:36 -07:00
Younies Mahmoud
33e1d87041 ICU-21551 Fixing unit prefix normalization order to match LDML spec
See #1754
2021-09-02 12:23:33 +02:00
Jeff Genovy
08dbf093ce ICU-21747 Upgrade WarningsAsErrors build bot to Clang-13 on Ubuntu 20.04 2021-09-01 17:04:42 -07:00
Peter Edberg
bbd72fb93e ICU-21581 integrate CLDR release-40-alpha2 to ICU trunk 2021-09-01 16:49:45 -07:00
Markus Scherer
e121dde220 ICU-20769 getAliasTargetAsResourceBundle() only needs ResourceData 2021-09-01 13:56:11 -07:00
Jeff Genovy
cb351ee077 ICU-21740 Output source code line when an error occurs when building with MSVC. 2021-09-01 11:48:17 -07:00
Jeff Genovy
b10a467212 ICU-21579 Fix compiler warnings with MSVC. 2021-08-31 14:03:21 -07:00
Younies
f811ae946f ICU-21730 Check unit aliases support.
See #1804
2021-08-31 18:34:12 +02:00
Markus Scherer
a300d8dd02 ICU-20769 refactor UResourceBundle.fResData & init_resb_result()
- replace UResourceBundle.fResData with access via fData
- change ResourceDataValue.resData into a pointer
- move C tests to C++ that stack-allocate now-C++ UResourceBundle
- init_resb_result():
  - remove redundant ResourceData parameter
  - simplify memory management, and various small simplifications
  - /LOCALE/path reuse valid locale, no ures_open()
  - pull alias-fetching code into separate function getAliasTargetAsResourceBundle()
  - `const char *` for keyPath pieces
  - s/noAlias/recursionDepth/
- getAllItemsWithFallback() set correct parentBundle.fValidLocaleDataEntry
2021-08-31 02:26:33 +00:00
Jeff Genovy
5b96c2e847 ICU-21710 Additional clean up after removing BOYER_MOORE code from usearch.cpp
Changes:
- We can completely remove the shift tables and related fields from
  data structs.

- Creation of UStringSearch objects should be faster now,
  as it doesn't waste time computing the unused shift tables.

- The sizeof(UStringSearch) is decreased from 5240 to 3192, so
  this should help to reduce memory for applications that create many string search objects.

Note regarding the comments on initialize(). It actually does not set illegal argument error if pattern is all ignoreables. Added a test case for it.
2021-08-30 15:08:16 -07:00
Erik Torres
2b895a71e1 ICU-21555 Fix typos in repo from letter d to i
Fix typos in repo from letter d to i
2021-08-30 16:45:43 -05:00
Fredrik Roubert
0a1cfa398c ICU-20973 Use standard keywords true & false to initialize type bool.
Now when all equality operators return standard bool (commit 633438f),
it no longer makes any sense to use the ICU4C constants TRUE & FALSE
or local variables of type UBool for their return value.
2021-08-26 18:53:10 +02:00
Jeff Genovy
6d850be783 ICU-21710 Remove BOYER_MOORE dead code from usearch.cpp 2021-08-25 19:48:17 -07:00
Jeff Genovy
9ddda243d7 ICU-21705 ICU-21706 Fix crash if ICU's default locale has BCP47 Unicode Extensions, and fix ures_openDirect crash with NULL locale ID.
Add test case for ures_openDirect with NULL locale ID.
2021-08-25 19:47:15 -07:00
Fredrik Roubert
3534d337c6 ICU-20973 Rewrite polymorphic CacheKeyBase equality operators for C++20.
The existing polymorphic equality operators that use different types for
the `this` and `other` objects are ambiguous with C++20 resolution rules
that require equality for reversed arguments.

In order to resolve that, while also possibly making the implementation
somewhat simpler overall, the implementation classes (LocaleCacheKey
and DateFmtBestPatternKey) now get normal (non-polymorphic) equality
operators that are trivially non-ambiguous (and as a bonus also don't
need any type casts), while the dynamic type checking logic is moved
into protected helper functions, which in the end are invoked
(without any ambiguity) by friend operators in the base class.

This way, all equality testing of cache key objects ends up taking one
of these two possible paths:

1. Both sides of the equality operator are of the same implementation
   type (ie. LocaleCacheKey or DateFmtBestPatternKey):

   The type specific equality operator is called directly, comparing the
   relevant attributes of the two objects directly.

2. The two sides of the equality operator are either of different types
   or of some base class type:

   The friend equality operators of CacheKeyBase call the virtual helper
   function to figure out whether the two objects are actually of the
   same type and if they are and this type is an implementation type
   then does the necessary type cast to get to 1.
2021-08-26 00:12:04 +02:00
Shane F. Carr
c12c5b5a4c ICU-21215 Add default constructor to FormattedNumberRange 2021-08-25 13:34:48 -05:00
Shane F. Carr
8cd025bb9c ICU-21590 Add skeleton syntax for integer-width-trunc 2021-08-25 12:51:51 -05:00
Markus Scherer
486e2d36ac ICU-21648 limit backslash-uhhhh escapes to ASCII hex digits 2021-08-25 17:04:24 +00:00
Markus Scherer
ae3f2ae7ca ICU-21663 UnicodeSetIterator: make protected members private 2021-08-25 00:51:14 +00:00
Shane F. Carr
e2acbdee22 ICU-21693 Fix FormattedNumber::toDecimalNumber on 0 2021-08-23 21:10:07 -05:00
Markus Scherer
41aa7159ea ICU-21635 Unicode 14 data files 20210820, line break LB30b.2
See #1807
2021-08-23 22:11:49 +00:00
Peter Edberg
7e07633997 ICU-21581 lingering exhaustive fail, need to separately skip sd_Deva month name exemplar test 2021-08-19 22:09:48 -07:00
Peter Edberg
f6d47953c0 ICU-21581 integrate CLDR release-40-alpha1 to ICU trunk 2021-08-19 15:27:38 -07:00
Shane F. Carr
4e01fba906 ICU-21358 Use sign position to format approximate numbers
See #1635
2021-08-19 16:18:19 -05:00
Peter Edberg
49dda34fb1 ICU-21581 integrate CLDR 40a0 to ICU trunk 2021-08-18 23:59:19 -07:00
Fredrik Roubert
51c11da869 ICU-20973 Delete unnecessary deprecated TimeUnitFormat::operator!=().
An operator!=() is already defined by a base class so this class doesn't
need to define it again and not doing so avoids a C++20 ambiguity.

This simplifies the code.
2021-08-19 01:22:48 +02:00
Frank Tang
126af02948 ICU-21729 Fix setLanguageTag doc matching Java 2021-08-18 16:16:10 -07:00
Rich Gillam
1ec55bf021 ICU-21202 Fixed error in alias resolution in uresbund.cpp. 2021-08-18 10:12:20 -07:00
Fredrik Roubert
88896d005d ICU-20973 Update configure files from configure.ac using autoreconf. 2021-08-17 00:35:00 +02:00
Fredrik Roubert
dc1b8ca26e ICU-20973 Use the Clang -Wno-ambiguous-reversed-operator flag. 2021-08-17 00:35:00 +02:00
Fredrik Roubert
2166b512fc ICU-20973 Manually resolve C++20 reversed argument order ambiguity. 2021-08-17 00:35:00 +02:00
Fredrik Roubert
633438f8da ICU-20973 Change all equality operator return types from UBool to bool. 2021-08-17 00:35:00 +02:00
Gregorio Litenstein
ee7468a565 ICU-21064 Fix declspec macro conflict on clang. 2021-08-16 11:06:46 -07:00
Jeff Genovy
744ca71663 ICU-21707 Fix LocaleBuilder assumption that the default locale doesn't have any BCP47 extension tags.
Add test case for LocaleBuilder with default locale with extensions.

Use Locale::getRoot().clone() instead of new Locale();

Add CI build bot with LANG that has extension tags
2021-08-12 16:58:12 -07:00
Myles C. Maxfield
e2fafa5840 ICU-21694 ubidi.h sample code would not compile 2021-08-04 11:47:05 -07:00
Rich Gillam
b03b8be741 ICU-21639 Added an internal utility class to streamline preflighting and heap-allocating a char buffer for a locale ID
and changed several internal methods in ULocale to use it, so that they work correctly on locale IDs that are longer
than ULOC_FULLNAME_CAPACITY.
2021-08-02 13:15:29 -07:00
Andy Heninger
c26aebe802 ICU-21662 Rename UVector::addElement().
This is the first step towards improving the error handling and out-of-memory
behavior of UVector::addElement(). A followup PR will add back a new addElement()
with corrected error handling, then additional followups will switch call sites
from the original (renamed) function to the new addElement().

This commit includes no logic or behavior changes; it only renames the existing functions.
2021-07-28 15:36:50 -07:00
Rich Gillam
dcfdaca46c ICU-21647 Added support for narrow quarter names to DateFormatSymbols and SimpleDateFormat. 2021-07-28 11:14:36 -07:00
David Haney
9ff39368b2 ICU-21680 Add -std flag when generating dependency information 2021-07-27 08:40:09 -04:00
Peter Edberg
eed6388dab ICU-21622 Add UYW to list of currencies from ucurr_openISOCurrencies() 2021-07-22 16:51:20 -07:00
Rich Gillam
d5cb201e52 ICU-21624 Fixed it so that a DecimalFormat no longer owns two separate DecimalFormatSymbols objects. 2021-07-22 14:17:19 -07:00
Rich Gillam
9a614752ce ICU-21584 Added code to allow regions of type "grouping" to return their children. 2021-07-22 12:55:50 -07:00