Skip to content

Commit

Permalink
CLDR-17830 Migration item (collation)
Browse files Browse the repository at this point in the history
  • Loading branch information
macchiati authored Sep 6, 2024
1 parent 6f15dc1 commit dde7a08
Showing 1 changed file with 23 additions and 16 deletions.
39 changes: 23 additions & 16 deletions docs/site/downloads/cldr-46.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,9 @@ title: CLDR 46 Release Note
|:---:|:----------:|:---------:|:------:|:--------:|:------------:|:---:|:----------:|:---------:|:---------:|
| 46 | 2024-10-~~XX~~ | ~~[v46]()~~ | ~~[CLDR46](http://unicode.org/Public/cldr/46/)~~ | [Charts46](http://unicode.org/cldr/charts/dev) | [LDML46](http://www.unicode.org/reports/tr35/proposed.html) | [Δ46](https://unicode-org.atlassian.net/issues/?jql=project+%3D+CLDR+AND+status+%3D+Done+AND+resolution+%3D+Fixed+AND+fixVersion+%3D+%2246%22+ORDER+BY+priority+DESC) | ~~[release-46]()~~ | [ΔDtd46](https://www.unicode.org/cldr/charts/dev/supplemental/dtd_deltas.html) | ~~[46.0.0](https://github.com/unicode-org/cldr-json/releases/tag/46.0.0)~~ |

<span style="color:red; font-weight: bold;">This is an alpha version of the CLDR v46 data.</span>
The data is available at [release-46-alpha3](https://github.com/unicode-org/cldr/releases/tag/release-46-alpha3). Feedback is welcome via [tickets](https://github.com/unicode-org/cldr/blob/main/docs/requesting_changes.md#requesting-updates-to-locale-data-through-a-ticket). (The CLDR site is undergoing a migration to Markdown, so the UI for navigation is temporary.)
<span style="color:red; font-weight: bold;">This is an alpha version of CLDR v46.</span>

It only covers the data, which is available at [release-46-alpha3](https://github.com/unicode-org/cldr/releases/tag/release-46-alpha3). An update targeted at September 25 will include includes specification changes and fix other TBDs. Feedback is welcome via [tickets](https://github.com/unicode-org/cldr/blob/main/docs/requesting_changes.md#requesting-updates-to-locale-data-through-a-ticket). (The CLDR site is undergoing a migration to Markdown, so the UI for navigation is temporary.)

## Overview

Expand All @@ -18,7 +19,12 @@ CLDR data is used by all [major software systems](https://cldr.unicode.org/index
(including all mobile phones) for their software internationalization and localization,
adapting software to the conventions of different languages.

The largest changes in this release were the updates to Unicode 16.0, substantial additions of Emoji search keyword data, and ‘upleveling’ the locale coverage.
The most significant changes in this release were:

- Updates to Unicode 16.0 (including major changes to collation),
- Further revisions to the Message Format 2.0 tech preview,
- Substantial additions and modifications of Emoji search keyword data,
- ‘Upleveling’ the locale coverage.

### Locale Coverage Status
#### Current Levels
Expand Down Expand Up @@ -46,8 +52,6 @@ For a full listing, see [Coverage Levels](https://unicode.org/cldr/charts/46/sup

**TBD: Add the specification changes by Sept 25**

**Note: This will include Message Format 2.0 specification changes. A draft is at https://github.com/unicode-org/message-format-wg/tree/main/spec**

## Data Changes

### DTD Changes
Expand Down Expand Up @@ -79,28 +83,28 @@ For the reason why, see the algorithm in [Likely Subtags](https://cldr-smoke.uni
- Note: A fallback language is used when the user's primary language is unavailable,
and either the user doesn't have any secondaries language in their settings (as on Android or iOS) or those secondary languages are also not available.
As a result of this change, when the primary and secondary languages are not available, the fallback language would be the system default instead of Russian.
- Added the mapping `desired="scn" → supported="it"`.
- Changed the deprecated code `knn` to `gom`.
- Added the mapping `desired="scn" → supported="it"` (Sicilian → Italian).
- Changed the deprecated code Goan Konkani (`gom`) to Konkani (`kok`).
7. Transforms
1. Major update to `Han → Latn`, reflecting new data in Unicode 16.0
2. Fixes for Arabic numbers, and a Farsi vowel
8. Other Unicode 16.0 changes
1. Additional numbering systems
2. Additional scripts and script identifiers
3. ScriptMeta has been expanded for Unicode 16.0
9. Locale identifiers
9. Other updates
1. The subdivision identifiers have been updated to the latest available from ISO
- The removed identifiers have been deprecated
- Missing names have been added (from Wikidata)
2. The language subtags, script subtags, and variant subtags have been updated to the latest from IANA
- Some codes have been deprecated
3. Parent and DefaultContent mappings have been added for kaa and kok; DefaultContent mappings added for `kk`, `lld`, `ltg`, `mhn`, and `zh_Latn_CN`
10. Territory Info has been updated from World Bank and other sources: gdp, population, languages.
11. LanguageGroup info has been updated from Wikidata
12. Plural rules have been added for some new locales
13. Week data
- The first day of the week has been changed for `AE`
- Hour preferences (12 v 24) have been added for `en_H`K, `en_MY`, `en_IL`
- Some codes have been deprecated
3. Parent and defaultContent mappings have been added for Kara-Kalpak (`kaa`) and Konkani (`kok`); defaultContent mappings have bee added for Kazakh (`kk`), Ladin (`lld`), Latgalian (`ltg`), Mócheno (`mhn`), and Chinese (Latin, China) (`zh_Latn_CN`).
4. Territory Info (gdp, population, languages) has been updated from World Bank and other sources.
5. LanguageGroup info has been updated from Wikidata
6. Plural rules have been added for some new locales
7. Week data
- The first day of the week has been changed for `AE`
- Hour preferences (12 v 24) have been added for English as used in Hong Kong, Malaysia, and Israel (`en_HK`, `en_MY`, `en_IL`)

For a full listing, see [¤¤BCP47 Delta](https://unicode.org/cldr/charts/46/delta/bcp47.html) and [¤¤Supplemental Delta](https://unicode.org/cldr/charts/46/delta/supplemental-data.html)

Expand Down Expand Up @@ -183,6 +187,9 @@ There were the following new test files:

## Migration

1. Databases that use collation keys are sensitive to any changes in collation, and will need reindexing.
This can happen with any CLDR release (especially those for a new version of Unicode), but more characters are affected in this release: see above.

**TBD**

## [Known Issues](https://unicode-org.atlassian.net/issues/CLDR-17535?jql=project%20%3D%20cldr%20and%20labels%20%3D%20%22ReleaseKnownIssue%22%20and%20status%20!%3D%20done)
Expand Down

0 comments on commit dde7a08

Please sign in to comment.