Improve document preprocessing by thecrypticace · Pull Request #1530 · tailwindlabs/tailwindcss-intellisense

thecrypticace · 2025-12-31T17:58:28Z

In the language server we do a lot of scanning of documents for:

Embedded languages (e.g. HTML <style> blocks are CSS, <script> tags are JS)
Class lists
Function calls in CSS
Complete or partial at-rules in CSS
etc…

Additionally, the user can provide custom regexes to target arbitrary text as class lists.

We don't want to detect any of these inside comments so we preprocess documents by replacing comments with spaces. Additionally, in JS, we replace regex literals with spaces as we don't want something like /<style>/ to accidentally get detected the start of an embedded language.

This process takes a small emount of time and memory and can be complicated to do correctly. Here I've replaced the existing scanner/parser with a UTF-16 code unit based version (e.g. String#charCodeAt) that is more correct than regexes (JS has no support for recursive patterns), uses less memory, and is up to ~8x faster in my benchmarks.

The implementation here isn't perfect either but it is a bit better.

This is both faster and uses less memory — especially on large documents

thecrypticace requested a review from RobinMalfait December 31, 2025 17:58

thecrypticace added 2 commits December 31, 2025 20:40

Add test

076be61

Reimplement language preprocessing

fef9459

This is both faster and uses less memory — especially on large documents

thecrypticace force-pushed the feat/document-scan-fast branch from c1b2c98 to fef9459 Compare January 1, 2026 01:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve document preprocessing#1530

Improve document preprocessing#1530
thecrypticace wants to merge 2 commits intofeat/document-no-async-findfrom
feat/document-scan-fast

thecrypticace commented Dec 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

thecrypticace commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

thecrypticace commented Dec 31, 2025 •

edited

Loading