Refactor expression lexers and specialise parsers #42

jg-rp · 2022-01-23T09:36:32Z

Both template and expression lexing functions are currently defined in liquid.lex, and parsers for template tags and tag expressions are bundled into liquid.parse. Moreover, all tag expressions are parsed through liquid.parse.ExpressionParser.parse_expression(), which handles liquid identifiers, loops and boolean expressions.

For reasons of easier maintenance and potential improvements in performance, I intend to move and refactor each of the expression lexers into their own package, along with a specialised parser and independent TokenStream (independent from the top-level token stream).

Built-in tags will transition to use these new parsers now, via liquid.Environment.parse_*_expression_value functions. Existing tokenize* functions and the ExpressionParser will be maintained until at least Python Liquid version 2.0, which is quite some time away, for those who use them in custom tags.

Some possible optimisations that can be realised include:

Lexers that yield tuples rather than NamedTuples. Benchmarks show the former to be faster.
Lexers that recognise identifiers with bracketed indexes and string literals. Doing this with regular expressions in the lexer will be much faster than stepping through a token stream in the parser.
Don't do unnecessary infix operator parsing when handling loop or output expressions. They don't have any infix operators.
Don't do unnecessary token precedence look-ups when handling expression that don't have any precedence rules. Only boolean expression use precedence rules.
Remove unnecessary prefix parsing. No built-in tag expression uses prefix operators. Negative numbers can be handled during tokenization.

The text was updated successfully, but these errors were encountered:

jg-rp added the enhancement New feature or request label Jan 23, 2022

jg-rp self-assigned this Jan 23, 2022

jg-rp added a commit that referenced this issue Jan 23, 2022

Refactor expression lexers and specialize parsers. See #42, #39.

c56f703

jg-rp closed this as completed in fbf6b32 Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor expression lexers and specialise parsers #42

Refactor expression lexers and specialise parsers #42

jg-rp commented Jan 23, 2022 •

edited

Loading

Refactor expression lexers and specialise parsers #42

Refactor expression lexers and specialise parsers #42

Comments

jg-rp commented Jan 23, 2022 • edited Loading

jg-rp commented Jan 23, 2022 •

edited

Loading