Fix ISO Latin 1 Encoding/Decoding issues #1219

jmschonfeld · 2025-03-19T17:14:55Z

As mentioned in #1216, the ISO Latin 1 Encoding/Decoding written natively in swift-foundation does not behave correctly. I accidentally referenced an incorrect code page table when writing the original implementation which limited the supported characters to a subset of those actually supported. In reality, ISO Latin 1 is just the 0x0 through 0xFF unicode code points encoded as single bytes. This means that encoding should simply encode the code point value (and fail for any code point greater than 0xFF) and decoding can simply extend each byte to 16 bits and parse as UTF-16 since all bytes are valid by definition.

This updates the implementation and adds some extra characters to the unit test to validate this behavior.

jmschonfeld · 2025-03-19T17:15:05Z

@swift-ci please test

Fix ISO Latin 1 Encoding/Decoding issues

a0bf741

jmschonfeld requested review from iCharlesHu, itingliu and parkera March 19, 2025 17:14

parkera approved these changes Mar 19, 2025

View reviewed changes

jmschonfeld merged commit 9ba455d into swiftlang:main Mar 19, 2025
3 checks passed

jmschonfeld deleted the fix-iso-latin1 branch March 19, 2025 21:24

jmschonfeld added a commit to jmschonfeld/swift-foundation that referenced this pull request Mar 19, 2025

Fix ISO Latin 1 Encoding/Decoding issues (swiftlang#1219)

7704a3a

jmschonfeld mentioned this pull request Mar 19, 2025

[6.1] Fix ISO Latin 1 Encoding/Decoding issues #1221

Merged

parkera pushed a commit that referenced this pull request Mar 21, 2025

Fix ISO Latin 1 Encoding/Decoding issues (#1219) (#1221)

7d4817b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix ISO Latin 1 Encoding/Decoding issues #1219

Fix ISO Latin 1 Encoding/Decoding issues #1219

Uh oh!

jmschonfeld commented Mar 19, 2025

Uh oh!

jmschonfeld commented Mar 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix ISO Latin 1 Encoding/Decoding issues #1219

Fix ISO Latin 1 Encoding/Decoding issues #1219

Uh oh!

Conversation

jmschonfeld commented Mar 19, 2025

Uh oh!

jmschonfeld commented Mar 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants