Use split_parser branch for markdown grammar #3108

MDeiml · 2022-07-19T11:45:14Z

Only a draft because markdown_inline grammar is not injected properly, but to me it seems like this is a problem with helix and not the language config? After a few experiments I think the grammar is injected, but highlighting is still not done according to the injected language. I'm not very sure though.

Another weird behavior is with fenced code blocks. They seem to work to some degree, but if you enter something like

```rust
let foo = "bar";
```

then let is highlighted correctly while "bar" is not highlighted at all.

MDeiml · 2022-07-21T06:42:46Z

Changing this to ready since I found a solution to the problems in #3129.

the-mikedavis · 2022-07-28T11:34:20Z

I tried this branch locally but I don't see any qualitative improvements to performance (we'll need some measurements like from #940 to be definitive). Specifically I see the some slowness with incremental updates while editing (for example typing quickly in insert mode) on a reasonably large and complicated markdown document like the changelog. Is this branch of the grammar(s) more correct than master? This doesn't feel like a regression so I'm certainly not against merging it but I'm curious what the improvement is of splitting the parsers?

MDeiml · 2022-07-28T14:21:59Z

I'm no longer maintaining the master branch, I just kept it because some projects seem to depend on it.

It was not really possible to implement a correct enough markdown parser in a single grammar. The specs also recommend to parse markdown in 2 passes. Now after splitting it into 2 grammars they work closer to that recommendation.

They're more correct now and with some optimizations to incremental parsing I think they'll also be faster. (But that'll be topic of future PRs).

MDeiml · 2022-07-28T14:24:02Z

Specifically I think that tree-sitter reparses all injected ranges if they contain "conflicts" no matter if they changed or not. That means that parsing is linear with the number of injected ranges. I think we could just not reparse ranges if they did not change at all (parsing should be deterministic with regards to the input anyways).

the-mikedavis · 2022-07-28T14:46:27Z

I'm not very familiar with the changes but I think archseer might've covered that when implementing combined injections: 6728e44...7c9ebd0

We don't use the stock tree-sitter-cli highlighting code since it isn't incremental

MDeiml · 2022-07-28T21:48:00Z

Just looking at the code that doesn't seem to be the case, but I should do some more investigation.

MDeiml · 2022-07-28T21:52:06Z

It's related to this TODO:

helix/helix-core/src/syntax.rs

Line 724 in 681c0a9

    
           // TODO: we should be able to avoid editing & parsing layers with ranges earlier in the document before the edit

Just going one step further and also avoid parsing layers that are after the edits

the-mikedavis · 2022-07-30T17:48:15Z

This works well for me locally but let's await further review/merging in #3129

archseer · 2022-08-05T02:17:33Z

Can you resolve the conflict? Looks good to merge otherwise 👍🏻

languages.toml

the-mikedavis

Works great! Thanks for working on this plus going the extra mile to add the new predicate :)

archseer requested a review from the-mikedavis July 19, 2022 13:19

MDeiml force-pushed the use_split_markdown_grammar branch from fe19775 to 7a3906c Compare July 19, 2022 14:34

MDeiml mentioned this pull request Jul 21, 2022

Exclude only named children without injection.include-children #3129

Merged

MDeiml marked this pull request as ready for review July 21, 2022 06:42

MDeiml force-pushed the use_split_markdown_grammar branch from cb4e6db to b7ee7af Compare July 26, 2022 15:17

the-mikedavis added the S-waiting-on-pr Status: This is waiting on another PR to be merged first label Jul 30, 2022

the-mikedavis removed the S-waiting-on-pr Status: This is waiting on another PR to be merged first label Aug 3, 2022

archseer reviewed Aug 5, 2022

View reviewed changes

languages.toml Outdated Show resolved Hide resolved

MDeiml force-pushed the use_split_markdown_grammar branch 2 times, most recently from 0802082 to 840e2d7 Compare August 6, 2022 14:42

MDeiml added 2 commits August 6, 2022 17:01

Use split_parser branch for markdown grammar

3005c68

Add strikethrough

c750fa3

MDeiml force-pushed the use_split_markdown_grammar branch from 840e2d7 to c750fa3 Compare August 6, 2022 15:03

the-mikedavis approved these changes Aug 6, 2022

View reviewed changes

the-mikedavis merged commit ea04220 into helix-editor:master Aug 6, 2022

MDeiml deleted the use_split_markdown_grammar branch August 6, 2022 16:33

David-Else mentioned this pull request Aug 9, 2022

freeze on attempt to open a markdown document #3375

Closed

thomasskk pushed a commit to thomasskk/helix that referenced this pull request Sep 9, 2022

Use split_parser branch for markdown grammar (helix-editor#3108)

d9f6d8a

This was referenced Oct 7, 2022

Slow Markdown Highlighting #4139

Closed

Improve performance for files with lots of injections #4146

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use split_parser branch for markdown grammar #3108

Use split_parser branch for markdown grammar #3108

MDeiml commented Jul 19, 2022

MDeiml commented Jul 21, 2022

the-mikedavis commented Jul 28, 2022

MDeiml commented Jul 28, 2022

MDeiml commented Jul 28, 2022

the-mikedavis commented Jul 28, 2022

MDeiml commented Jul 28, 2022

MDeiml commented Jul 28, 2022

the-mikedavis commented Jul 30, 2022

archseer commented Aug 5, 2022

the-mikedavis left a comment

Use split_parser branch for markdown grammar #3108

Use split_parser branch for markdown grammar #3108

Conversation

MDeiml commented Jul 19, 2022

MDeiml commented Jul 21, 2022

the-mikedavis commented Jul 28, 2022

MDeiml commented Jul 28, 2022

MDeiml commented Jul 28, 2022

the-mikedavis commented Jul 28, 2022

MDeiml commented Jul 28, 2022

MDeiml commented Jul 28, 2022

the-mikedavis commented Jul 30, 2022

archseer commented Aug 5, 2022

the-mikedavis left a comment

Choose a reason for hiding this comment