feat(ast,parser): parse jsdoc #168

Boshen · 2023-03-12T03:08:59Z

I see there are some demand for reading jsdoc information, let's do this in two steps:

parse jsdoc into structured data, this should be lazy.
save them into trivias and have them accessible. I haven't figured out the best way to do this, but see AST Trivia handling #6 for disucssion.

Note: This is for parsing jsdoc comments to structured data. This is not about wiring up all the jsdoc comments into a document tree (which is the main purpose for jsdoc).

To parse jsdoc comments, we need to:

add a new routine to parse jsdoc in oxc_parser, this should be a separate routine so we can parse jsdoc lazily
add the jsdoc AST to oxc_ast
setup jsdoc tests (I'm unsure on how to do this properly yet).

References:

https://jsdoc.app/
tests
- https://github.com/jsdoc/jsdoc/tree/main/packages/jsdoc/test
- typescript conformance tests: https://github.com/microsoft/TypeScript/tree/main/tests/cases/conformance/jsdoc

Boshen · 2023-03-19T13:37:31Z

Prioritizing because @thepassle need this. See https://twitter.com/passle_/status/1637446645104164865

Boshen · 2023-03-23T14:35:45Z

Update:

The original intent was to come up with a generic comment attachment algorithm, where we attach comments to some AST node driven by a set of rules. After some research and reading about it in the ESLint and Babel codebases, I conclude that it is incomprehensible for me right now.

So instead, let's reduce the problem space down to "attach jsdoc comments to specific AST node". This will be much more approachable.

We will build the jsdoc data structure inside the semantic analyzer. When a jsdoc targeting AST node such as a Statement or Declaration is visited, we will ask Trivias to get us a leading comment. The jsdoc data structure will save the comment span and mark the Statement (SemanticNode) to indicate it has a jsdoc. A getter will be provided to parse the jsdoc lazily in a OnceCell.

ematipico · 2023-06-20T14:46:24Z

I want to help somehow. What do you need or what can I do? :)

Boshen · 2023-06-20T16:02:02Z

I want to help somehow. What do you need or what can I do? :)

Hi Ema! I've forgotten everything we wrote about jsdocs, but would you be interested in getting a https://github.com/gajus/eslint-plugin-jsdoc rule to work? I can guide you on the missing pieces in the discord channel. I think we had some of the infra working, targeting a lint rule would be the easiest to fill in the gaps.

lukeed · 2024-02-05T01:30:22Z

I'd love to be able to just see & access comment values instead of skipping them outright.

I'd definitely consider it a bonus step for oxc to parse any JSDocs and translate its meaning(s) to the AST, but there are lots of instances where users either invent directives or just use a standard comment to transfer additional information

Quick examples:

/** @table "users" */
type User = { ... }

import(/* webpackChunkName: "my-chunk" */ "foobar");

// comptime
const DAY = ms(1, "day")

leaysgur · 2024-02-22T09:11:25Z

We talked a bit about this in #2437 ...,

There are many use cases for JSDoc, but for now, aim to implement eslint-plugin-jsdoc
To know the details, read the source of eslint-plugin-jsdoc first

So, I spent these days reading through the code.

eslint-plugin-jsdoc itself
jsdoccomment, comment-parser and jsdoc-type-pratt-parser, which are heavily dependent

(There are 3 articles on my blog if you're interested. Sorry it's in Japanese.)

As a result, although IMO, I'm not sure we should aim for 100% compatibility for this.

For a number of reasons,

The 10 year old code was very convoluted and hard to understand...
- Not easy to deduce the design intent either
It depends on a lot of external libraries
- https://github.com/gajus/eslint-plugin-jsdoc/blob/ab893bae6aa5f05228390cb3ce4487485360cba8/package.json#L7-L16
- e.g. https://github.com/gajus/eslint-plugin-jsdoc/blob/main/src/rules/checkValues.js
Especially esquery (+estraverse), as the ability to refine the execution context with their originally invented AST seems particularly hard(but widely integrated)
- https://github.com/gajus/eslint-plugin-jsdoc/blob/main/docs/advanced.md
- Personally, I can't think of a use case, but it might be necessary for those who need it?
- And I have no idea how to write such a dynamic process in Rust... 😢

@Boshen Sorry for the long lines, then I'd like to confirm,

Were you originally going to re-implement eslint-plugin-jsdoc with compatibility, no matter how hard it was?
Do you think it would be worthwhile to just omit the heavy functionality
- and provide rules/oxc_jsdoc like rules/oxc?
- or keep the label eslint-plugin-jsdoc?
(Maybe we haven't seen this yet, but) Should we leave this to external plugins and spend our time on other things?
- e.g. Like TypeScript, put JSDoc in the AST?(although compatibility with ESTree would be an issue)
- e.g. Just provide a generic API like getLeadingComments(node) and leave it at all

What do you think? 👀

Boshen · 2024-02-22T09:44:59Z

@leaysgur Oh wow I didn't expect 3 blog posts on this topic, I thought jsdoc is a solved problem 😰

So in summary, it seems like the hardest part about jsdoc is comment attachment to AST nodes.

We can leave this part out and focus our task on just jsdoc content rules, which is quiet easy as all we need to do is finish the jsdoc parser and run rules against these parsed jsdocs.

then I'd like to confirm

The intention was to pass all the tests, but we don't really need to do it if it's not a fun task.

And if you're sick of jsdoc after looking at it for 3 days ... you may also join me on the eslint-plugin-import task.

leaysgur · 2024-02-22T14:28:30Z

For future reference, let me elaborate.

the hardest part about jsdoc is comment attachment to AST nodes.

This wasn't particularly difficult, at least if we trust the logic of eslint-plugin-jsdoc, or rather, jsdoccomment.

https://github.com/es-joy/jsdoccomment/blob/6aae5ea306015096e3d58cd22257e5222c54e3b4/src/jsdoccomment.js#L283

To put it simply, it was just this:

// findJSDocComment(node, sourceCode): Comment | null;
const beforeTokens = sourceCode.getTokensBefore(node, { includeComments: true });
while (let token = beforeTokens.pop()) {
  if (token.type === 'Block' && token.value.startsWith('*')) return token;
}

return null;

And I believe this behavior was already implemented in #2437 .

The part I found to be the hardest was that some(about half of) rules can:

1️⃣ freely determine which astNode to execute the above logic

if (
  esquery.matches(
    astNode,
    // from rule config
    `MethodDefinition:not([accessibility="public"]):has(JsdocBlock)`
  )
) {
  const jsdocComment = findJSDocComment(astNode, sourceCode);
}

2️⃣ And also based on the Comment obtained, freely determine whether to execute rule handler

const jsdocAstNode = toESTreeLikeAST(jsdocComment);
if (
  esquery.matches(
    jsdocAstNode,
    // from rule config
    `JsdocBlock[postDelimiter=""]:has(JsdocTypeUnion > JsdocTypeName[value="Bar"]:nth-child(1))`
  )
) {
  ruleHandler({ astNode, jsdocAstNode });
}

https://github.com/gajus/eslint-plugin-jsdoc/blob/main/docs/advanced.md

(Actually, they seemed to have a bit more complicated logic...)

About rest half of the rules seem to simply check "only the text of all JSDoc comments in the source".

However, their implementation was like

for (const { astNode, jsdocAstNode } of jsdocNodesWithAttachedNode)
  // Why astNode required...??
  ruleHandler({ astNode, jsdocAstNode });
// ...
for (const { jsdocAstNode } of jsdocNodesWithoutAttachedNode)
  ruleHandler({ astNode: null, jsdocAstNode });

they are called differently for some reason.

https://github.com/gajus/eslint-plugin-jsdoc/blob/e948bee821e964a92fbabc01574eca226e9e1252/src/iterateJsdoc.js#L2279-L2328

And when I tried to replace astNode of the former to null, the tests started to FAIL...! 😇

💡 After additional research, I finally solved this mystery.

check-tag-names
informative-docs
no-undefined-types

It seems that these 3 rules perform extra linting if node exists. Other all rule's tests pass without node!

you may also join me on the eslint-plugin-import task.

That sounds interesting as well.

Either way, I'll think a bit more about how to conclude #2437 .

Partial fix for #168 - [x] Fix general finding behavior for leading comments - [x] Accept multiple jsdoc comments per node - [x] Provide `get_one` and also `get_all` - [x] Add `iter_all()` for non-node related usage - [x] Limit AST node kinds to parse

Boshen · 2024-04-15T02:34:28Z

@leaysgur is carrying this task 👍 . Future issues can be created separately now.

Partial fix for oxc-project#168 - [x] Fix general finding behavior for leading comments - [x] Accept multiple jsdoc comments per node - [x] Provide `get_one` and also `get_all` - [x] Add `iter_all()` for non-node related usage - [x] Limit AST node kinds to parse

Boshen added this to the AST / Lexer / Parser milestone Mar 12, 2023

Boshen added the E-Help Wanted Experience level - For the experienced collaborators label Mar 12, 2023

Boshen self-assigned this Mar 19, 2023

Boshen added the P-high Priority - High label Mar 21, 2023

Boshen assigned shannonrothe Mar 21, 2023

Boshen modified the milestones: AST / Lexer / Parser, Linter Mar 22, 2023

Boshen removed this from the AST / Lexer / Parser milestone Mar 29, 2023

Boshen added the A-parser Area - Parser label Mar 29, 2023

Boshen added this to the 0.0.3 milestone Mar 29, 2023

Boshen added the A-linter Area - Linter label Mar 29, 2023

Boshen removed this from the 0.0.3 milestone Apr 2, 2023

This comment was marked as outdated.

Sign in to view

leaysgur mentioned this issue Jan 19, 2024

fix(semantic): find the nearest valid jsdoc comment #2064

Closed

leaysgur mentioned this issue Feb 19, 2024

fix(semantic): Refactor jsdoc finding #2437

Merged

5 tasks

Boshen assigned leaysgur and unassigned shannonrothe and Boshen Feb 20, 2024

Boshen closed this as completed Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ast,parser): parse jsdoc #168

feat(ast,parser): parse jsdoc #168

Boshen commented Mar 12, 2023 •

edited

Loading

Boshen commented Mar 19, 2023

Boshen commented Mar 23, 2023 •

edited

Loading

ematipico commented Jun 20, 2023

Boshen commented Jun 20, 2023

This comment was marked as outdated.

lukeed commented Feb 5, 2024

leaysgur commented Feb 22, 2024

Boshen commented Feb 22, 2024

leaysgur commented Feb 22, 2024 •

edited

Loading

Boshen commented Apr 15, 2024

feat(ast,parser): parse jsdoc #168

feat(ast,parser): parse jsdoc #168

Comments

Boshen commented Mar 12, 2023 • edited Loading

Boshen commented Mar 19, 2023

Boshen commented Mar 23, 2023 • edited Loading

ematipico commented Jun 20, 2023

Boshen commented Jun 20, 2023

This comment was marked as outdated.

lukeed commented Feb 5, 2024

leaysgur commented Feb 22, 2024

Boshen commented Feb 22, 2024

leaysgur commented Feb 22, 2024 • edited Loading

Boshen commented Apr 15, 2024

Boshen commented Mar 12, 2023 •

edited

Loading

Boshen commented Mar 23, 2023 •

edited

Loading

leaysgur commented Feb 22, 2024 •

edited

Loading