Binary, octal, and hexadecimal literals and formatting #1968

clnhlzmn · 2020-09-18T03:07:36Z

Adds support for binary, octal, and hexadecimal literals.

…est error about documentation

josdejong · 2020-09-20T15:51:48Z

Looks really good and straightforward at first sight Colin! Thanks a lot 👍

I'll do some more indepth review soon. Maybe a first feedback: I think we should write a section about these new features in the docs, on the expression parser syntax page and/or maybe also on the numbers page, not sure.

clnhlzmn · 2020-09-20T17:54:21Z

Looks really good and straightforward at first sight Colin! Thanks a lot 👍

Thanks! I am working on some more improvements to this that I will push soon.

clnhlzmn · 2020-09-21T00:53:41Z

In the last commit I modified the way the literals are parsed and formatted. The literals are parsed as 32 bit 2s complement signed integers. This change causes the following comparisons to be true:

math.evaluate('0xffffffff') === -1
math.evaluate('0o37777777777') === -1
math.evaluate('0b11111111111111111111111111111111') === -1

This is contrary to the way these literals are treated in plain JS where they would be treated as unsigned integers, i.e.:

0xffffffff === 4294967296
0o37777777777 === 4294967296
0b11111111111111111111111111111111 === 4294967296

I believe it makes more sense to treat binary/octal/hex literals as signed integers because that's what the bitwise operators in mathjs and plain JS operate on.

Similarly the format functions bin, oct, and hex now will format -1 as '0b11111111111111111111111111111111', '0o37777777777', and '0xffffffff' respectively. Before the last commit the format functions just used Number.toString(base) without any additional modification which would format -1 as '-1'.

This commit also adds checking that a literal doesn't exceed the size of a 32 bit 2s complement signed integer in the parser and checks that the value to be formatted is an integer and is within the size of such integers. For example, math.evaluate('0x100000000') will throw an SyntaxError and math.hex(2**32) will also throw an error where before the last commit they would not.

I am not sure if it makes sense for this addition to behave this way or not.

Please let me know what you think @josdejong.

josdejong · 2020-09-23T13:28:43Z

I did some more thorough testing, and this really works like a charm. Nice job Christopher!

Some feedbacks:

Parsing as signed integers makes a lot of sense to me.
The checks for "overflow" and integer input are very nice! That prevents nasty surprises
I think we should still write a section in the docs of the expresssion about bin/oct/hex input and formatting.
Should we make the number of bits (currently 32) configurable? Or is this just the most common number of bits? 64 bits doesn't work for regular numbers, so it makes sense, though when using BigNumbers it would be possible. however, making this configurable and using BigNumber under the hood would also complicate matters quite a bit.

clnhlzmn · 2020-09-23T14:55:26Z

I did some more thorough testing, and this really works like a charm. Nice job Christopher!

It's Colin, but thanks!

1. Parsing as signed integers makes a lot of sense to me.

Me too.

2. The checks for "overflow" and integer input are very nice! That prevents nasty surprises

Thanks!

3. I think we should still write a section in the docs of the expresssion about bin/oct/hex input and formatting.

I will get on this as soon as I have a chance.

4. Should we make the number of bits (currently 32) configurable? Or is this just the most common number of bits? 64 bits doesn't work for regular numbers, so it makes sense, though when using BigNumbers it would be possible. however, making this configurable and using BigNumber under the hood would also complicate matters quite a bit.

I do think the number of bits should be configurable. I like the idea of using BigNumbers behind the scenes, but I understand that would be quite a larger undertaking (maybe?). I had an idea that if you use a literal, like 0xff, you get a signed 32 bit int (which could be a BigNumber or a regular number) by default. Maybe then there could be a configuration that changes that default behavior for literals. Then there would also be separate functions that would parse strings as different sized integers for example int8('0xff') would result in -1 and int16('0xff') would give you 255. Then the formatting functions would have to be extended with different versions for different sizes also. That doesn't seem like the best way to do it though.

I think ideally (and I'm pretty sure you mentioned this elsewhere) there would be separate types (Int8, Int16, etc.) behind the scenes and a literal like 0xff will give you Int32 by default and if you want something else you use a constructor with a string like int64('0xff'). Then the format functions would work with those types automatically. As far as I can tell, though, that would require an exponential amount of work in all the other math functions to make those sized types play nice.

josdejong · 2020-09-23T15:22:14Z

Sorry for mixing up your name Colin :(

Ok let's think through making number of bits configurable.

It may be quite easy to implement it as BigNumbers, we'll have to see.

I was also thinking: JavaScript does not have types like Int32, but instead of using BigNumber, we could also use BigInt. One issue though is that none of the mathjs functions yet supports BigInt, so we would need to implement something there first. And Internet Explorer doesn't support BigInt, so we would need to fall back to number and limit the max bits to 32 on this browser for example.

josdejong · 2020-09-23T15:24:15Z

Maybe we should support both number and BigNumber and simply respect the config option {number: 'number' | 'BigNumber'}. In case of number, we would have to limit the config option bits to 32.

clnhlzmn · 2020-09-23T15:31:23Z

I was thinking the types Int32 and friends would be mathjs types that would be implemented using BigNumber under the hood.

josdejong · 2020-09-23T15:43:04Z

Ah, I get what you mean. So we could make new custom data types for it like Int32, Int64, etc. This is possible but then we have to implement support for those types in all functions, which is a lot of work. "Just" using BigNumber/number (depending on config) would make it much easier since all functions already support those data types.

clnhlzmn · 2020-09-23T16:21:11Z

I added a small section in the syntax documentation.

josdejong · 2020-09-26T09:14:52Z

Thanks for adding docs, looks good.

I think this PR is ready to go, unless we want to directly implement support for configurable number of bits and integration with an other type like BigNumber or BigInt or anything. I think this first step will not conflict with those extensions so I expect we can do that safely as a separate, second iteration. What do you think?

clnhlzmn · 2020-09-26T14:06:29Z

I think this is a good first step too. I agree the other improvements can come later as a second step.

josdejong · 2020-09-26T15:45:02Z

👍 merging now.

Thanks again!

clnhlzmn · 2020-09-26T15:54:32Z

Thanks!

josdejong · 2020-09-26T17:03:54Z

Published now in v7.3.0 🎉

clnhlzmn added 3 commits September 17, 2020 22:46

allow binary, octal, and hex literals as in JS (0b, 0o, 0x)

730a077

add tests

9477a3a

fix lint issues

9227776

clnhlzmn mentioned this pull request Sep 18, 2020

Custom user literals support #909

Closed

add notation for binary, octal, and hex in formatNumber

f9dcc44

clnhlzmn changed the title ~~Feature/bin oct hex literals~~ Binary, octal, and hexadecimal literals and formatting Sep 18, 2020

clnhlzmn added 3 commits September 18, 2020 15:28

remove the extra format notations

69f1e01

add bin, oct, and hex functions for formatting

c45ae68

move bin, oct, and hex from base.js to their own files, fixed built t…

6ad6ede

…est error about documentation

parse and format treat values as 32 bit signed 2s complement integers

c1a39bd

josdejong mentioned this pull request Sep 23, 2020

Number Base conversions #95

Closed

add section in syntax documentation

3b3543c

typo

c5b8ef0

clnhlzmn marked this pull request as ready for review September 26, 2020 14:06

Merge branch 'develop' into feature/bin-oct-hex-literals

05ce721

josdejong merged commit f5d843b into josdejong:develop Sep 26, 2020

ovk mentioned this pull request Sep 29, 2020

Add BigNumber support for hex/oct/bin literals and functions #1982

Closed

clnhlzmn deleted the feature/bin-oct-hex-literals branch October 12, 2020 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Binary, octal, and hexadecimal literals and formatting #1968

Binary, octal, and hexadecimal literals and formatting #1968

clnhlzmn commented Sep 18, 2020

josdejong commented Sep 20, 2020

clnhlzmn commented Sep 20, 2020

clnhlzmn commented Sep 21, 2020 •

edited

Loading

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 23, 2020

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 26, 2020

clnhlzmn commented Sep 26, 2020

josdejong commented Sep 26, 2020

clnhlzmn commented Sep 26, 2020

josdejong commented Sep 26, 2020

Binary, octal, and hexadecimal literals and formatting #1968

Binary, octal, and hexadecimal literals and formatting #1968

Conversation

clnhlzmn commented Sep 18, 2020

josdejong commented Sep 20, 2020

clnhlzmn commented Sep 20, 2020

clnhlzmn commented Sep 21, 2020 • edited Loading

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 23, 2020

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 23, 2020

clnhlzmn commented Sep 23, 2020

josdejong commented Sep 26, 2020

clnhlzmn commented Sep 26, 2020

josdejong commented Sep 26, 2020

clnhlzmn commented Sep 26, 2020

josdejong commented Sep 26, 2020

clnhlzmn commented Sep 21, 2020 •

edited

Loading