mirror of
https://github.com/unicode-org/icu.git
synced 2025-04-10 15:42:14 +00:00
1. Add GA to test BreakIterator under LSTM configuration (remove Thai and Burmese dictionary and include Thai and Burmese LSTM) 2. Add LSTMDataName for the purpose of testing. 3. Add file base test code to test BreakIterator match results from test file generated by pythong code in https://github.com/unicode-org/lstm_word_segmentation/blob/master/segment_text.py 4. Fix a LSTMBreakEngine::divideUpDictionaryRange bug when the return value should only contains the number of words found when the passed in foundBreaks already contains some data. 5. Change the cintltest TestSwapData from testing thaidict to laodict so it will not break while we filter out thaidict under the LSTM configuration. |
||
---|---|---|
.. | ||
workflows | ||
lstm_for_th_my.json | ||
pull_request_template.md |