Semantic string tokens vs injected syntax #7111

mitsuhiko · 2020-12-31T22:17:56Z

I'm currently trying to inject some syntax into string literals by the use of the injectTo grammar feature. The idea is that I match on some specifically formed strings to provide custom syntax highlighting for specific strings in macros (mitsuhiko/insta#149).

Configuration wise my extension does something like this:

    "grammars": [
      {
        "language": "insta-snapshots",
        "scopeName": "source.insta-snapshots",
        "path": "./syntaxes/insta-snapshots.tmLanguage.json"
      },
      {
        "scopeName": "source.inline-insta-snapshots",
        "injectTo": [
          "source.rust"
        ],
        "path": "./syntaxes/inline-insta-snapshots.tmLanguage.json",
        "embeddedLanguages": {
          "meta.embedded.inline-insta-snapshot": "insta-snapshots"
        }
      }
    ]

Then in the grammar I'm matching on something. This all works and eventually I end up injecting a custom sub syntax into strings that look like @r"""...""". The issue here now is that once RLS runs a "string" semantic token is put over the entire region which I just parsed which completely removes my custom syntax again.

I was looking into how this is supposed to be solved or how other languages are doing it but the only thing I found is that apparently rust analyzer is the only language server which emits semantic string tokens? At least I do not see similar things in typescript and some other languages I tried.

Not sure if filing this here makes any sense but considering there are many things working together I figured I start filing something here.

The following screenshot shows the issue:

The token under the cursor is correctly determined to be keyword.insta but the styling in the theme is discarded because of the string semantic token which takes precedence.

The text was updated successfully, but these errors were encountered:

mitsuhiko · 2020-12-31T23:24:08Z

I filed this against vscode now since I think this is more likely to be an issue there: microsoft/vscode#113640

mitsuhiko · 2021-01-10T00:56:11Z

The response in the vscode issue is effectively that this would require coordination with RLS:

For this particular case, I think you could try to coordinate with the rust extension / rust semantic tokens provider implementation to give you an option or an API that would prevent the semantic tokens provider from covering those strings. Also, if the semantic tokens provider just creates string tokens, (which would be equal with the tokens created by TM), then they could simply stop creating them altogether without any visible effects.

matklad · 2021-01-18T17:14:25Z

Yeah, I think it makes sense to enable a config flag here, to suppress semantic tokens for strings or all tokens. Config value should be declared here:

https://github.com/rust-analyzer/rust-analyzer/blob/9daba961f236750c3a5d831c9775606271b37eff/crates/rust-analyzer/src/config.rs#L28-L184

Rather than non-producing the tokens, we should filter them out in the lsp layer, over here:

https://github.com/rust-analyzer/rust-analyzer/blob/c72d3a7c0989e63a9b063fed445cbbaf3e40a29f/crates/rust-analyzer/src/to_proto.rs#L336-L363

See this parameter to learn how to thread config between two places.

djrenren · 2021-05-07T20:06:58Z

Hey @matklad, that "this parameter" link seems to be another link to the semantic_tokens function. Did you mean to point to something else?

matklad · 2021-05-07T20:16:18Z

Yeah, I think https://github.com/rust-analyzer/rust-analyzer/blob/c72d3a7c0989e63a9b063fed445cbbaf3e40a29f/crates/rust-analyzer/src/to_proto.rs#L462 this is what I wanted to link here.

lnicola added C-Architecture Big architectural things which we need to figure up-front (or suggestions for rewrites :0) ) E-hard S-unactionable Issue requires feedback, design decisions or is blocked on other work labels Jan 18, 2021

djrenren mentioned this issue May 10, 2021

Allow semantic tokens for strings to be disabled #8795

Merged

bors bot closed this as completed in f9d4a9e May 17, 2021

ian-h-chamberlain mentioned this issue Aug 2, 2022

x/tools/gopls: semantic tokenizing of string escape/format characters golang/go#45753

Closed

DanTup mentioned this issue Oct 11, 2022

Consider supporting embedded language grammars when using semantic tokens microsoft/vscode#163292

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic string tokens vs injected syntax #7111

Semantic string tokens vs injected syntax #7111

mitsuhiko commented Dec 31, 2020

mitsuhiko commented Dec 31, 2020

mitsuhiko commented Jan 10, 2021

matklad commented Jan 18, 2021

djrenren commented May 7, 2021

matklad commented May 7, 2021

Semantic string tokens vs injected syntax #7111

Semantic string tokens vs injected syntax #7111

Comments

mitsuhiko commented Dec 31, 2020

mitsuhiko commented Dec 31, 2020

mitsuhiko commented Jan 10, 2021

matklad commented Jan 18, 2021

djrenren commented May 7, 2021

matklad commented May 7, 2021