tr1/text: improve text handling in TR1 #1933

rr- · 2024-11-21T00:50:06Z

Checklist

I have read the coding conventions
I have added a changelog entry about what my pull request accomplishes, or it is an internal change

Description

This pull request extends the named sequences support we have introduced in 4.6 to also support Unicode strings. I have confirmed the following languages to have full coverage:


Basque	Irish
Belarusian	Italian
Bosnian	Latvian
Bulgarian	Lithuanian
Catalan	Macedonian
Croatian	Malay
Czech	Maltese
Danish,	Northern Sami
Dutch	Norwegian
English	Polish,
Estonian	Portuguese
Faroese	Romanian
Finnish	Russian
French	Serbian
Galician	Slovak
German,	Slovenian
Greek	Spanish
Hungarian	Swedish
Icelandic	Turkish
Indonesian	…and possibly more.

Asian and Arabic languages remain unsupported at the moment. While Arabic is still far away due to the RTL rendering order, we're much closer to supporting CJK.
The sprites for the characters come from Arsunt's extended font posted in Tomb Raider Forums.

Pivotal for this feature is a textfile containing manual Unicode codepoint mappings. Although I initially experimented with JSON, YAML, and CSV, I discovered through testing that using a DSL (domain-specific language) designed specifically for this purpose offers the best readability.
The mapping file is used by the tooling in the tools/glyphs/ directory and serves two roles:

Hardcoding Unicode to sprite mapping
It generates C macros that map Unicode code points and escaped sequences to O_ALPHABET's sprite indices, specify glyph dimensions, and instruct how to compose compound characters - all getting hardcoded into the executable.
Guidance for font.bin creation
It directs the injector tool in creating the font.bin file that contains O_ALPHABET sprite bitmaps, along with additional positional information.

Some sprite indices are fixed. This is for compatibility with the original game to retains original text format even if font.bin goes missing.
Creating sprites for all possible accented characters is a challenging and resource-intensive task. Instead, the mapping allows us to combine certain characters so that the game overlays one glyph on another. However, we only support one accent per glyph. Consequently, Vietnamese, despite using the Latin alphabet, is currently unsupported due to its extensive use of diacritics.

As we now have many more glyphs to compare, the time-consuming O(n^2) loop that matched user string characters with all possible glyphs has been replaced by uthash lookups for faster glyph retrieval. This approach requires precise knowledge of glyph sizes, necessitating some additional parsing, but it benefits from eliminating ambiguity in glyph matches. An additional benefit is improved handling of Unicode codepoints without declared mappings: by traversing entire codepoints rather than incrementing the pointer by 1 byte, the process avoids ending up in the middle of an incomplete UTF-8 codepoint, preventing garbled text.

github-actions · 2024-11-21T00:53:17Z

Download the built assets for this pull request:

aredfan · 2024-11-24T18:08:14Z

Overall LGTM. The only issue I found is the TRUB gameflow doesn't point to the font.bin file, which causes this issue.

Resolves #386, #636, #1928 and #1919.

The time-consuming O(n^2) loop that compared user string characters with all possible glyphs has been replaced by `uthash` lookups for improved glyph lookup speed. This requires precise glyph size knowledge, which involves some additional parsing. An added benefit is improved handling of unknown Unicode glyphs: by moving through entire codepoints rather than incrementing the pointer by 1 byte, the process avoids ending in the middle of an incomplete UTF-8 codepoint.

rr- · 2024-11-24T18:26:40Z

@aredfan fixed and made sure that we do not attempt to draw unavailable sprites.

data/tr1/ship/cfg/TR1X_gameflow.json5

lahm86

Looks fantastic, thank you for doing this.
Noticed a small issue with scaling in the details menu, but it's on develop too - will raise a separate ticket.

rr- added Feature New functionality TR1 labels Nov 21, 2024

rr- self-assigned this Nov 21, 2024

rr- force-pushed the glyphs branch 2 times, most recently from c76de3f to 441aa28 Compare November 24, 2024 16:58

rr- changed the title ~~tr1/text: experimental support for injected glyphs~~ tr1/text: improve text handling in TR1 Nov 24, 2024

rr- force-pushed the glyphs branch from 441aa28 to ab3738f Compare November 24, 2024 17:20

rr- marked this pull request as ready for review November 24, 2024 17:26

rr- requested review from a team as code owners November 24, 2024 17:26

rr- requested review from lahm86, walkawayy and aredfan and removed request for a team November 24, 2024 17:26

rr- added 2 commits November 24, 2024 19:25

tr1/text: support Unicode glyphs

599e735

Resolves #386, #636, #1928 and #1919.

rr- force-pushed the glyphs branch from ab3738f to adefb55 Compare November 24, 2024 18:26

walkawayy reviewed Nov 24, 2024

View reviewed changes

data/tr1/ship/cfg/TR1X_gameflow.json5 Show resolved Hide resolved

walkawayy approved these changes Nov 24, 2024

View reviewed changes

aredfan approved these changes Nov 24, 2024

View reviewed changes

lahm86 approved these changes Nov 24, 2024

View reviewed changes

lahm86 mentioned this pull request Nov 24, 2024

Create TR1 font builder LostArtefacts/TRXInjectionTool#10

Merged

This was linked to issues Nov 24, 2024

Some symbols are not displayed in the game #1928

Closed

Accents in Spanish words #1919

Closed

rr- merged commit a8d4af8 into develop Nov 24, 2024
7 checks passed

rr- deleted the glyphs branch November 24, 2024 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tr1/text: improve text handling in TR1 #1933

tr1/text: improve text handling in TR1 #1933

rr- commented Nov 21, 2024 •

edited

Loading

github-actions bot commented Nov 21, 2024 •

edited

Loading

aredfan commented Nov 24, 2024

rr- commented Nov 24, 2024

lahm86 left a comment

tr1/text: improve text handling in TR1 #1933

tr1/text: improve text handling in TR1 #1933

Conversation

rr- commented Nov 21, 2024 • edited Loading

Checklist

Description

github-actions bot commented Nov 21, 2024 • edited Loading

aredfan commented Nov 24, 2024

rr- commented Nov 24, 2024

lahm86 left a comment

Choose a reason for hiding this comment

rr- commented Nov 21, 2024 •

edited

Loading

github-actions bot commented Nov 21, 2024 •

edited

Loading