Feature: handle tracef's `%c` as unicode code point #411

zetanumbers · 2022-02-27T23:00:52Z

Previously converted such character to UTF-16 char code, so large unicode characters would have been truncated. Now it's possible to pass unicode characters.

aduros · 2022-03-01T11:03:45Z

We should probably match the same behavior as C's printf, which I think truncates to 8 bits for %c.

For me this program:

printf("Hello %c\n", 12345678);

Prints Hello N.

zetanumbers · 2022-03-01T13:16:47Z

We should probably match the same behavior as C's printf, which I think truncates to 8 bits for %c.

But why? It's not like we are trying to implement libc. With this PR we would able to pass rust's char for example.

zetanumbers · 2022-03-01T13:30:55Z

Btw if we truncate, should we truncate to 7 bits for ASCII, or truncate to 8 bits and allow some UTF-16 char codes? Aren't non-ASCII characters for printf OS dependent?

aduros · 2022-03-02T14:02:56Z

Could we truncate to 8 bits? libc printf semantics aren't perfect, but at least they're well-defined and we don't need to document our own special handling of certain features.

For printing unicode characters, isn't it possible to use %s instead of %c? Or just format the string directly in Rust.

zetanumbers · 2022-03-04T08:59:06Z

Could we truncate to 8 bits? libc printf semantics aren't perfect, but at least they're well-defined and we don't need to document our own special handling of certain features.

Until and even then we truncate to 8 bits, we probably could handle non-ascii chars as unicode code points instead of UTF-16 char codes?

zetanumbers · 2022-03-04T09:29:39Z

For printing unicode characters, isn't it possible to use %s instead of %c? Or just format the string directly in Rust.

Current %s implementation only works on ascii null-terminated strings.

https://github.com/aduros/wasm4/blob/main/runtimes/web/src/runtime.ts#L272

To manually tracef in Rust you would:

Create an empty string;
Gradually write to this string other substrings, numbers, etc. Meanwhile the String would grow (reallocate) gradually increasing its capacity;
Flush the whole string onto a single line via traceUtf8;
Deallocate the string.

This brings some runtime (~7KiB on all code optimizations) into the binary. It could have been better (now only ~2KIB) if there was an ability flush the line by parts, requiring no allocations.

feat: handle tracef %c as unicode code point in web runtime

56229a9

Previously converted such character to UTF-16 char code, so large unicode characters would have been truncated. Now it's possible to pass unicode characters.

zetanumbers mentioned this pull request Mar 1, 2022

tracef!() zetanumbers/wasm4-rs#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: handle tracef's `%c` as unicode code point #411

Feature: handle tracef's `%c` as unicode code point #411

zetanumbers commented Feb 27, 2022 •

edited

Loading

aduros commented Mar 1, 2022

zetanumbers commented Mar 1, 2022

zetanumbers commented Mar 1, 2022

aduros commented Mar 2, 2022

zetanumbers commented Mar 4, 2022 •

edited

Loading

zetanumbers commented Mar 4, 2022

Feature: handle tracef's %c as unicode code point #411

Are you sure you want to change the base?

Feature: handle tracef's %c as unicode code point #411

Conversation

zetanumbers commented Feb 27, 2022 • edited Loading

aduros commented Mar 1, 2022

zetanumbers commented Mar 1, 2022

zetanumbers commented Mar 1, 2022

aduros commented Mar 2, 2022

zetanumbers commented Mar 4, 2022 • edited Loading

zetanumbers commented Mar 4, 2022

Feature: handle tracef's `%c` as unicode code point #411

Feature: handle tracef's `%c` as unicode code point #411

zetanumbers commented Feb 27, 2022 •

edited

Loading

zetanumbers commented Mar 4, 2022 •

edited

Loading