icu/testdata/message2/README.txt
Tim Chevalier 747d5eef3b ICU-22834 Update tests to reflect MF2 schema in conformance repo
This also updates the spec tests from the current version of the MFWG
repository and removes some duplicate tests.
Spec tests now reflect the message-format-wg repo as of
5612f3b050

It also updates both the ICU4C and ICU4J parsers to follow the
current test schema in the conformance repository.

This includes adding code to both parsers to allow `src` to be
either a single string or an array of strings (per
https://github.com/unicode-org/conformance/pull/255 ),
and eliminating `srcs` in tests.

It also includes other changes to make updated spec tests pass:

ICU4C: Allow trailing whitespace for complex messages, due to spec change
ICU4C: Parse number literals correctly in Number::format
ICU4J: Allow trailing whitespace after complex body, per spec change
ICU4C: Fix bug that was assuming an .input variable can't have a reserved annotation
ICU4C: Fix bug where unsupported '.i' was parsed as an '.input'
ICU4C/ICU4J: Handle markup with space after the initial left curly brace
ICU4C: Check for duplicate variant errors
ICU4C/ICU4J: Handle leading whitespace in complex messages
ICU4J: Treat whitespace after .input keyword as optional
ICU4J: Don't format unannotated number literals as numbers
2024-09-18 07:46:29 -07:00

43 lines
1.6 KiB
Text

© 2024 and later: Unicode, Inc. and others.
License & terms of use: http://www.unicode.org/copyright.html
The format of the JSON files in this directory and subdirectories
follow the test schema defined in the Conformance repository:
https://github.com/unicode-org/conformance/blob/main/schema/message_fmt2/testgen_schema.json
(as of https://github.com/unicode-org/conformance/pull/255 or later).
# JSON notes
In the "params" field, a date parameter can be expressed as:
{ "date": n }
where n is a number representing a Unix timestamp.
In the "params" field, a decimal string parameter can be expressed as:
{ "decimal": s }
where s is a string.
Optional fields, "ignoreJava" and "ignoreCpp" can be used
for tests currently expected to fail in the respective language.
The field may have any value; if it's
present, the test is ignored. (The value can be a comment explaining
why it's expected to fail.)
Tests in the `spec/` subdirectory are taken from https://github.com/unicode-org/message-format-wg/blob/main/test .
If the contents change upstream, then the corresponding tests in CLDR
need to be updated (also see https://unicode-org.atlassian.net/browse/ICU-22812 ).
## ICU4J only
The `cleanSrc` fields is used to represent normalized input (ICU4C has its
own function for normalizing input).
## ICU4C only
Additional "char" and "line" fields may be present with integer values,
used for tests expected to trigger a syntax error.
If present, "char" reflects the expected character offset and "line"
reflects the expected line number in the parse error.
The files with "diagnostics" in the name have these fields filled in.