Lexical syntax simplification #90

emberian · 2014-05-24T17:47:38Z

emberian · 2014-05-24T17:48:49Z

Another benefit of this is that the output of the lexer can be only spans and their associated token type, rather than having to do any work.

lilyball · 2014-05-24T18:38:37Z

active/0000-lexical-syntax-simplification.md

+
+    LIT_STR_RAW
+      : 'r' LIT_STR_RAW_INNER
+      | 'r' '"' .*? '"'


This can't just be 'r' LIT_STR_RAW_INNER2? (and the inner tokens should probably be swapped).

Indeed it can.

emberian · 2014-05-24T19:44:39Z

This needs to take into account rust-lang/rust#14400 still.

chris-morgan · 2014-05-25T02:53:15Z

active/0000-lexical-syntax-simplification.md

+      ;
+
+    LIT_FLOAT
+      : [0-9][0-9_]* ('.' [0-9][0-9_]*)? ([eE] [-+]? [0-9][0-9]*)? FLOAT_SUFFIX?


The exponent [0-9]* part—should it be [0-9_]*?

Also I think this will be tightening what is accepted; at present, for example, 1. is acceptable (but not 1.f32 for clear reasons), but this change will break that. Is that deliberate? Desirable? &c.

Good catch. That was not deliberate, I didn't mean to change the float literal syntax at all.

pcwalton · 2014-05-25T18:52:12Z

+1, sounds like an improvement for rustfmt

emberian · 2014-05-25T21:57:30Z

@kballard is the CRLF stuff correct? I extended the places that accept newline to also accept '\r\n', but not '\r', and I've removed '\r' from the whitespace skipping.

lilyball · 2014-05-25T22:17:34Z

@cmr My patch actually allows bare '\r' in whitespace skipping and in non-doc comments. It only rejects it inside of strings and doc comments. That said, I don't know if it's worth trying to be that permissive. It may be better just to go ahead and treat a bare '\r' without a subsequent '\n' as a hard error anywhere in the file.

lilyball · 2014-05-25T22:21:48Z

active/0000-lexical-syntax-simplification.md

+      ;
+
+    LIT_CHAR
+      : '\'' ( '\\' CHAR_ESCAPE | [^'\n\t\r] ) '\''


Shouldn't [^'\n\t\r] be ~['\n\t\r]?

Also the character set needs to include \\ or else invalid character escapes will end up matching anyway.

zwarich · 2014-05-27T08:22:40Z

This grammar is wildly ambiguous. Identifiers, numbers and operators can be tokenized in multiple ways.

huonw · 2014-05-27T08:29:24Z

@zwarich do you have an example of an ambiguous sequence of tokens?

emberian · 2014-05-27T08:30:21Z

"12.12" could be INTEGER(12) DOT INTEGER(12)

emberian · 2014-05-27T08:31:32Z

etc. will be relatively easy to fix.

lilyball · 2014-05-27T08:49:08Z

Doesn't antlr4 pick the longest matching token?

huonw · 2014-05-27T09:22:17Z

(Yeah, isn't the maximal munch principal the standard way to resolve "ambiguities" like this?)

zwarich · 2014-05-27T17:57:39Z

@kballard @huonw Yes, that is the standard way of resolving ambiguities with lexical syntax, and apparently the 'lexer grammar' feature of ANTLR makes it choose this strategy, as opposed to what it uses for normal grammars.

anasazi · 2014-05-27T20:26:20Z

I like the idea of keeping comments after lexing so pretty-printers / refactoring tools can use the same lexer as the compiler, but how about we just make comment dropping a micropass between the lexer and parser instead of adding to the parser workload?

emberian · 2014-05-27T20:32:35Z

Sure, whatever.

emberian · 2014-05-27T21:04:25Z

Fixed most things, and verified that it works as I expect.

emberian · 2014-05-29T01:48:19Z

cc @nikomatsakis @pcwalton @brson I've updated this. It behaves as I expect for the code I've run it against, and accepts/rejects everything it should in the compiler/libs/testsuite/servo.

brson · 2014-05-29T02:15:08Z

Accepted as RFC 21 per https://github.com/mozilla/rust/wiki/Meeting-weekly-2014-05-27#lexer-changes

Fix typo in tutorial

RFC for caching results of `treeFor` hook

lilyball reviewed May 24, 2014
View reviewed changes

chris-morgan reviewed May 25, 2014
View reviewed changes

lilyball reviewed May 25, 2014
View reviewed changes

Lexical syntax simplification

26b5257

brson mentioned this pull request May 29, 2014

Tracking issue for RFC 21 - lexical syntax simplification rust-lang/rust#14504

Closed

brson merged commit 26b5257 into rust-lang:master May 29, 2014

withoutboats pushed a commit to withoutboats/rfcs that referenced this pull request Jan 15, 2017

Merge pull request rust-lang#90 from simbiont666/fix-tutorial

9337886

Fix typo in tutorial

Centril added the A-syntax Syntax related proposals & ideas label Nov 23, 2018

wycats pushed a commit to wycats/rust-rfcs that referenced this pull request Mar 5, 2019

Merge pull request rust-lang#90 from ember-cli/addon-tree-caching

ca1fe9a

RFC for caching results of `treeFor` hook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lexical syntax simplification #90

Lexical syntax simplification #90

emberian commented May 24, 2014 •

edited by traviscross

Loading

emberian commented May 24, 2014

lilyball May 24, 2014

emberian May 24, 2014

emberian commented May 24, 2014

chris-morgan May 25, 2014

emberian May 25, 2014

pcwalton commented May 25, 2014

emberian commented May 25, 2014

lilyball commented May 25, 2014

lilyball May 25, 2014

lilyball May 25, 2014

zwarich commented May 27, 2014

huonw commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 27, 2014

lilyball commented May 27, 2014

huonw commented May 27, 2014

zwarich commented May 27, 2014

anasazi commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 29, 2014

brson commented May 29, 2014

Lexical syntax simplification #90

Lexical syntax simplification #90

Conversation

emberian commented May 24, 2014 • edited by traviscross Loading

emberian commented May 24, 2014

lilyball May 24, 2014

Choose a reason for hiding this comment

emberian May 24, 2014

Choose a reason for hiding this comment

emberian commented May 24, 2014

chris-morgan May 25, 2014

Choose a reason for hiding this comment

emberian May 25, 2014

Choose a reason for hiding this comment

pcwalton commented May 25, 2014

emberian commented May 25, 2014

lilyball commented May 25, 2014

lilyball May 25, 2014

Choose a reason for hiding this comment

lilyball May 25, 2014

Choose a reason for hiding this comment

zwarich commented May 27, 2014

huonw commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 27, 2014

lilyball commented May 27, 2014

huonw commented May 27, 2014

zwarich commented May 27, 2014

anasazi commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 27, 2014

emberian commented May 29, 2014

brson commented May 29, 2014

emberian commented May 24, 2014 •

edited by traviscross

Loading