Proposal: Replace the token queue with an event-handler system #403

fb55 · 2022-02-11T14:03:22Z

Why

The token queue adds a level of indirection that makes it harder to fix some issues. Eg. #292 is easy to fix once the token queue is gone. Also, debugging is currently complicated, as stack traces end at the token queue.

With the queue gone, stack traces will point at the corresponding line in the tokenizer. V8 will be able to optimise more aggressively; in my branch combining all of the changes, I see a ~15% performance increase using htmlparser-benchmark.

Game plan

Update the tokenizer to produce events. There will be a QueuedTokenizer class that wraps around the tokenizer, which provides an interface for the parser. Opened as refactor(tokenizer): Introduce events #404
Invert event processing in the parser. The parser currently first checks the insertion mode, and then the token type. By inverting this (checking first the token type, then the insertion mode), we prepare the parser to accept the events from (1). Opened as refactor(parser): Invert event processing #405
Tie everything together. Have the updated parser from (2) consume the tokenizer events from (1). Opened as refactor(parser): Consume tokenizer events #419

(1) and (2) do not depend on one-another and can be merged independently.

cc @wooorm @43081j

The text was updated successfully, but these errors were encountered:

This was referenced Feb 11, 2022

refactor(tokenizer): Introduce events #404

Merged

refactor(parser): Invert event processing #405

Merged

fb55 mentioned this issue Feb 27, 2022

refactor(parser): Consume tokenizer events #419

Merged

fb55 closed this as completed in #419 Mar 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Replace the token queue with an event-handler system #403

Proposal: Replace the token queue with an event-handler system #403

fb55 commented Feb 11, 2022 •

edited

Loading

Proposal: Replace the token queue with an event-handler system #403

Proposal: Replace the token queue with an event-handler system #403

Comments

fb55 commented Feb 11, 2022 • edited Loading

Why

Game plan

fb55 commented Feb 11, 2022 •

edited

Loading