Report all syntax errors in Markdown files #389

NathanReb · 2022-08-01T12:13:17Z

This PR improves the MDX workflow by making sure that all parsing errors are reported at once rather than exiting on the first errors.

Obviously that only impacts errors that we can recover from e.g. mis-used or missing labels, syntax error inside a block but not for errors such as invalid markdown.

We used to do all the work in markdown/cram parser. I changed that to extract raw bits of data from the parser and leave their interpretation to a later pass.

While working on this I discovered that the locations we attach to blocks are incorrect due to inconsistencies in eol handling in the lexer code which I took the liberty to fix as part of this PR. I can eventually extract it as a separate PR depending on this one if you'd like but basing it on this work made it significantly simpler.

CHANGES.md

NathanReb · 2022-08-01T12:16:39Z

There is a small side effect which is that errors with labels are reported with the entire block location. I think it's not too bad, especially given how precise the error messages usually are, there should be no ambiguity as to where the error comes from.

We can eventually try to refine error locations even further but since we have no editor integration at the moment I see little incentive for it.

NathanReb · 2022-08-01T12:17:48Z

Finally, this does not impact mli files at the moment. The implementation of the parsing being quite different I decided to open a PR with I already had.

I'll work on the mli side of things next but doing it separately should ease the review!

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Leonidas-from-XIV

Thanks for the PR. I think it is fair to leave the fix for the error locations here, untangling it would probably create more work than it is worth, although it would be somewhat simpler to review.

In general I think it is fair to move the parsing out of the lexer, which also makes programming a bit easier in a file that is actual OCaml syntax instead of ocamllex. And well, it is more parsing than lexing anyway.

The code is fine, I don't have anything big to note. Just a few comments on the points that took me longer to understand with suggestions how to improve them.

lib/block.ml

lib/lexer_mdx.mll

lib/block.ml

lib/mdx.ml

lib/util.ml

test/bin/mdx-test/failure/part-unsupported/test-case.md.expected

test/lib/test_block.ml

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Leonidas-from-XIV

Looks good, I think it is ready to be merged.

@NathanReb

CHANGES: #### Added - Report all parsing errors in Markdown files (realworldocaml/mdx#389, @NathanReb) #### Changed - Preserve indentation in multiline OCaml blocks in .mli files (realworldocaml/mdx#395, @panglesd) #### Fixed - Fixed compatibility with Cmdliner 1.1.0 (realworldocaml/mdx#371, @Leonidas-from-XIV) - Report errors and exit codes of toplevel directives (realworldocaml/mdx#382, @talex5, @Leonidas-from-XIV) - Fix block locations in error reporting (realworldocaml/mdx#389, @NathanReb) - Include the content of the line that features the `part-end` MDX directive in the output, before that line would've been dropped (realworldocaml/mdx#374, realworldocaml/mdx#387, @Leonidas-from-XIV) - Handle EINTR signal on waitpid call by restarting the syscall. (realworldocaml/mdx#409, @tmcgilchrist) - Fix parsing of multiline toplevel phrases in .mli files (realworldocaml/mdx#394, realworldocaml/mdx#397, @Leonidas-from-XIV) #### Removed - Removed warning about missing semicolons added in MDX 1.11.0 and the automatic insertion of semicolons in the corrected files introduced in MDX 2.0.0. (realworldocaml/mdx#398, @Leonidas-from-XIV)

NathanReb requested a review from Leonidas-from-XIV August 1, 2022 12:13

github-actions bot reviewed Aug 1, 2022

View reviewed changes

CHANGES.md Outdated Show resolved Hide resolved

github-actions bot reviewed Aug 1, 2022

View reviewed changes

CHANGES.md Outdated Show resolved Hide resolved

NathanReb mentioned this pull request Aug 1, 2022

Introduce new syntax for explicit block type #385

Open

NathanReb force-pushed the report-all-syntax-errors branch from 6da7421 to afa20e1 Compare August 2, 2022 08:06

NathanReb and others added 6 commits August 2, 2022 10:28

Add test for multiple errors reporting

33ba162

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Report all parsing errors in markdown and cram files

30ca156

Add test for block locations

d6b375c

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Fix block locations computation in Markdown and cram files

5515cf0

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Update CHANGES.md

5cab168

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Add concat_map to Util.List for 4.08 compatibility

00dc7d9

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

NathanReb force-pushed the report-all-syntax-errors branch from afa20e1 to 00dc7d9 Compare August 2, 2022 08:42

Leonidas-from-XIV reviewed Aug 2, 2022

View reviewed changes

NathanReb added 4 commits August 2, 2022 16:57

Fix unallowed label error message

279c740

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Rename print_loc to pp in Stable_printer

a2fb3c1

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Use Util.Result.List.split instead of filter_map and concat_map

3d369f3

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

Rewrite anonymous functions dealing wiht Msg to fun (Msg ...)

8eaedb5

Signed-off-by: Nathan Rebours <nathan.p.rebours@gmail.com>

NathanReb requested a review from Leonidas-from-XIV August 2, 2022 15:56

Leonidas-from-XIV approved these changes Aug 3, 2022

View reviewed changes

NathanReb merged commit d23b06b into realworldocaml:main Aug 3, 2022

Leonidas-from-XIV mentioned this pull request Jan 6, 2023

[new release] mdx (2.2.0) ocaml/opam-repository#22867

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report all syntax errors in Markdown files #389

Report all syntax errors in Markdown files #389

NathanReb commented Aug 1, 2022

NathanReb commented Aug 1, 2022

NathanReb commented Aug 1, 2022

Leonidas-from-XIV left a comment

Leonidas-from-XIV left a comment

Report all syntax errors in Markdown files #389

Report all syntax errors in Markdown files #389

Conversation

NathanReb commented Aug 1, 2022

NathanReb commented Aug 1, 2022

NathanReb commented Aug 1, 2022

Leonidas-from-XIV left a comment

Choose a reason for hiding this comment

Leonidas-from-XIV left a comment

Choose a reason for hiding this comment