Support semantic token deltas #2478

vinistock · 2024-08-23T13:07:36Z

Motivation

First step in #2461

This PR adds support for semantic token deltas.

Here's how semantic highlighting works before and after this change.

Before

On every keypress, the editor sends a textDocument/semanticTokens/full request and the server returns all tokens for the entire file. In large files, the editor will sometimes send textDocument/semanticTokens/range to compute tokens for a limited range instead of doing it for the entire file.

After

When the file is first opened, the editor will send a textDocument/semanticTokens/full to compute all tokens for that file. From that point on, the editor will only send textDocument/semanticTokens/full/delta, to receive only the difference in tokens between the previous response and the current state of the file.

Why does this matter

Computing semantic tokens on the server is cheap. In some of my benchmarks, the Ruby LSP can compute ~20k tokens in roughly ~10ms. However, applying these tokens in the editor is very expensive. The fewer tokens we return, the faster the editor becomes.

In fact, if we return ~20k tokens on every keypress, the editor becomes backlogged with work and the lag prevents users from working normally.

Does this PR completely fix the problem

No. It improves the problem, but I will follow this PR up with an audit of our semantic tokens implementations. We start out returning tokens for pretty much everything, but we need to be a lot more conservative and only return tokens for the things that are really ambiguous to avoid performance issues.

Implementation

There are many important parts in this implementation. First, for semantic token requests to work properly with deltas, we need to keep track of the semantic token result IDs and we need a separate cache for those tokens that cannot be cleared upon typing. The delta request needs to remember the previous result, so clearing upon typing would make it impossible to implement.

Secondly, running the full semantic tokens with the other listeners makes it incredibly difficult to ensure the correct result IDs. Accidentally bumping the ID will make the editor miss the delta and then we lose the benefits. It is clearly more impactful from a performance standpoint to support delta than it is to run semantic highlighting with the other listeners.

Having it separate will also play well with the next step to audit our semantic tokens, because we can convert the implementation to a visitor and have much better control over which tokens are returned.

Finally, the last part is computing the delta. The way the algorithm works is:

Find the first token from the start that differs between the previous and the current result. Strip the token arrays up to that point
From the end, find the first token that differs and strip the tail of the token arrays

Whatever is left will be the single edit necessary to turn the previous tokens into the current ones.

Automated Tests

Added a bunch of tests.

lib/ruby_lsp/document.rb

lib/ruby_lsp/requests/semantic_highlighting.rb

lib/ruby_lsp/listeners/semantic_highlighting.rb

lib/ruby_lsp/requests/semantic_highlighting.rb

lib/ruby_lsp/document.rb

vinistock added enhancement New feature or request server This pull request should be included in the server gem's release notes labels Aug 23, 2024

vinistock self-assigned this Aug 23, 2024

vinistock requested a review from a team as a code owner August 23, 2024 13:07

vinistock requested review from andyw8 and st0012 August 23, 2024 13:07

andyw8 reviewed Aug 23, 2024

View reviewed changes

lib/ruby_lsp/document.rb Outdated Show resolved Hide resolved

andyw8 reviewed Aug 23, 2024

View reviewed changes

lib/ruby_lsp/requests/semantic_highlighting.rb Outdated Show resolved Hide resolved

st0012 approved these changes Aug 23, 2024

View reviewed changes

vinistock force-pushed the vs-support-semantic-tokens-delta branch 3 times, most recently from 685c858 to 02e77cc Compare August 23, 2024 18:31

andyw8 approved these changes Aug 23, 2024

View reviewed changes

st0012 reviewed Aug 23, 2024

View reviewed changes

lib/ruby_lsp/document.rb Outdated Show resolved Hide resolved

Support semantic token deltas

3547038

vinistock force-pushed the vs-support-semantic-tokens-delta branch from 02e77cc to 3547038 Compare August 23, 2024 18:42

vinistock merged commit 0864edc into main Aug 23, 2024
34 checks passed

vinistock deleted the vs-support-semantic-tokens-delta branch August 23, 2024 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support semantic token deltas #2478

Support semantic token deltas #2478

vinistock commented Aug 23, 2024 •

edited

Loading

Support semantic token deltas #2478

Support semantic token deltas #2478

Conversation

vinistock commented Aug 23, 2024 • edited Loading

Motivation

Why does this matter

Does this PR completely fix the problem

Implementation

Automated Tests

vinistock commented Aug 23, 2024 •

edited

Loading