Simplify parsing numeric arguments with predefined actions #63

lichray · 2019-11-23T02:40:31Z

Currently, we have no predefined actions at all (other than identity), and this is an obvious use case to improve. Asking users to write lambdas to parse raises the bar for use, and difficult to upgrade to a <charconv> future. For the most common inputs, users should be able to express what do they expect rather than how do they handle.

Python's argparse provides type=int and type=float, which do not take hexadecimal inputs and are lack of support for types of different ranges (int, long, etc.) We need to able to express both for C++ applications.

I propose to add a .scan method to the fluent interface, inspired by scanf. I took the name from scnlib. Usage looks like the following:

program.add_argument("-x")
       .scan<'d', int>();
program.add_argument("scale")
       .scan<'g', double>();

The non-type template argument specifies how the input "looks like" (where I call it shape), and the type template argument specifies what the return value of the predefined action is. The acceptable types are:

floating point: float, double, long double
integral (+ make_unsigned): signed char, short, int, long, long long

and the acceptable shapes are:

floating point:
- 'a': hexadecimal floating-point
- 'f': fixed form only
- 'e': scientific form only
- 'g': general form (either fixed or scientific)
integral:
- 'd': decimal
- 'u': decimal, requires an unsigned type
- 'o': octal, requires an unsigned type
- 'x': hexadecimal, requires an unsigned type
- 'i': anything that from_chars's grammar accepts in base == 0

The exact grammar should follow from_chars's requirement. But our first implementation may still use strtol and strtod. When encounters errors, throw exceptions similar to what stoi and stod do.

FAQ

Can I omit the type? No.
Can I omit the shape? Not for these. from_chars and strto? default to parse anything, but Python's type=int and type=float only parse decimal. I'm not sure whether we can agree on a default. But when extending this for other types in the future, we may.
How to extend this to other common types? Let's keep an eye on how the Text parsing proposal evolves.
Why using non-type template parameters? So that we can select predefined actions at compile-time and go straight assigning one to mAction in each call to scan.
Can auto non-type template parameter help? Sadly no. .scan<int('d')>() is okay but .scan<(unsigned long)'x'> is terrible.
Can we support uppercase shapes like 'X'? What do you expect them to do? I guess letting them behave as same as the lowercase counterparts is the only reasonable answer. If we agree on that, I'm okay with it.

The text was updated successfully, but these errors were encountered:

p-ranav · 2019-11-25T01:52:16Z

Your proposal sounds good to me.

fixes: p-ranav#63

lichray added a commit to lichray/argparse that referenced this issue Nov 26, 2019

Parse floating-point numbers in .scan

00bdf13

fixes: p-ranav#63

lichray added a commit to lichray/argparse that referenced this issue Nov 26, 2019

Parse floating-point numbers in .scan

e8a44d2

fixes: p-ranav#63

lichray mentioned this issue Nov 26, 2019

Simplify parsing numeric arguments with .scan #65

Merged

p-ranav closed this as completed in #65 Nov 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify parsing numeric arguments with predefined actions #63

Simplify parsing numeric arguments with predefined actions #63

lichray commented Nov 23, 2019

p-ranav commented Nov 25, 2019

Simplify parsing numeric arguments with predefined actions #63

Simplify parsing numeric arguments with predefined actions #63

Comments

lichray commented Nov 23, 2019

FAQ

p-ranav commented Nov 25, 2019