Edit linting: use line mapping to rule out irrelevant errors #3649

li-boxuan · 2024-08-29T06:52:18Z

Short description of the problem this fixes or functionality that this introduces. This may be used for the CHANGELOG

Please read #3412 for more detail. To simply put, previous work uses the diff of linting error messages from pre-edit-lint and post-edit-lint to rule out "irrelevant" linting errors. The biggest flaw is that error messages could be generic - leading to both false positives and false negatives. This PR uses a more robust approach: line numbers.

Give a summary of what the PR does, explaining any non-trivial design decisions

Before edit, we run linting once, and record the error messages from the linter. After edit, we run linting once again, and also record the error messages. We then compare the line numbers from both pre-edit-lint and post-edit-lint. We rule out errors that are irrelevant to the edit (i.e. we remove errors that exist in both pre-edit-lint and post-edit-lint). This is non-trivial because line numbers might change. Luckily, since we only allow edits at a single place at a time, it is not too hard to figure out line mapping.

Link of any specific issues this addresses

#3412

Fix a potential issue that might lead to file corruption when edit linting is enabled #3124 introduces a feature for editing: running linter twice before and after the change and only extract new errors introduced by the agent. This has some potential issues and I am working on #3649 to address them, but I feel like I am not gonna finish it in the next few days, and that PR has become harder and harder to review, thus this PR, which only focuses on a small improvement. So what's the issue? When we run linters on the original file before our edits, we need to copy the original file and use a temporary file to lint, because linting may have side-effect (e.g. modifying the file in-place). I used the word "may" because: Flake8 has no side-effect, so not a problem as of now. We don't enforce this or document this "no side-effect" as a requirement for linter implementation, so side-effect is allowed. Regardless, the "after-edit-linting" uses the same approach: backup the file before linting to avoid data corruption. We should keep our "before-edit-linting" consistent. Why no new unittest that reproduces the issue? Well, as I have mentioned earlier, flake8 has no side-effect, so technically it's not a bug but a flaw. Therefore, there's no way to write a test that reproduces the issue.

tobitege · 2024-09-09T04:17:46Z

Great job on this tricky enhancement!
It looks good to me but I'll defer to @xingyaoww for approval as I suppose this will be sent to swe-bench?

tobitege · 2024-09-16T06:46:23Z

Great work! Since the tests run fine, I'd say LGTM.
Will run several Aider bench tests on this branch here and see if any related issue comes up -or- these instances come up successful now (compared to earlier tries).

xingyaoww · 2024-09-16T12:21:40Z

sounds good! I've finally got some of the eval pipeline working properly! Will start an eval on this PR today

tobitege · 2024-09-16T12:28:37Z

Just fyi, I did 15 different Aider bench instances and don't see new issues from this PR. 👍

xingyaoww · 2024-09-18T16:19:23Z

Weird enough, this PR actually brings a lot of degradation.. The current main is about 79 resolved, but this PR gets ~60

tobitege · 2024-09-18T16:47:01Z

Why do you get those "mv: cannot stat" errors? 🤔

xingyaoww · 2024-09-18T16:50:47Z

@tobitege those are un-related eval scripts things 🤣

tobitege · 2024-10-08T18:26:36Z

@li-boxuan please buzz us again once this is ready

li-boxuan added 2 commits August 18, 2024 20:48

Rewrite tests, including a new test that showcases a bug

43ff8fb

Implement logic

49942b3

li-boxuan mentioned this pull request Sep 1, 2024

file_ops: Use tmp file for original linting #3681

Merged

li-boxuan added 7 commits September 7, 2024 00:10

Merge branch 'main' into 3412/edit-linting

159dd2d

WIP: Fix test

39e819b

Complete unit tests

736e864

Refine TODO comment

68f2900

Move code around

88ec704

Fix tests

e24eb0f

Merge remote-tracking branch 'upstream/main' into 3412/edit-linting

6f5ebb6

li-boxuan marked this pull request as ready for review September 9, 2024 03:50

li-boxuan requested review from tobitege and xingyaoww September 9, 2024 04:02

li-boxuan added the agent framework Strategies for prompting, agent, etc label Sep 9, 2024

Merge branch 'main' into 3412/edit-linting

2a052f4

neubig added the eval-this label Sep 13, 2024

Merge branch 'main' into 3412/edit-linting

a0628be

Merge branch 'main' into 3412/edit-linting

c3b3795

li-boxuan mentioned this pull request Sep 27, 2024

refactor: standardize linter output data structure and interface #4077

Merged

tobitege removed request for tobitege and xingyaoww October 8, 2024 18:25

li-boxuan marked this pull request as draft October 9, 2024 04:19

li-boxuan closed this Oct 12, 2024

li-boxuan mentioned this pull request Oct 15, 2024

[agent] LLM-based editing #3985

Merged

xingyaoww added a commit that referenced this pull request Oct 15, 2024

add two tests from #3649

b6858d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Edit linting: use line mapping to rule out irrelevant errors #3649

Edit linting: use line mapping to rule out irrelevant errors #3649

li-boxuan commented Aug 29, 2024 •

edited

Loading

tobitege commented Sep 9, 2024

tobitege commented Sep 16, 2024

xingyaoww commented Sep 16, 2024

tobitege commented Sep 16, 2024

xingyaoww commented Sep 18, 2024

tobitege commented Sep 18, 2024

xingyaoww commented Sep 18, 2024

tobitege commented Oct 8, 2024

Edit linting: use line mapping to rule out irrelevant errors #3649

Edit linting: use line mapping to rule out irrelevant errors #3649

Conversation

li-boxuan commented Aug 29, 2024 • edited Loading

tobitege commented Sep 9, 2024

tobitege commented Sep 16, 2024

xingyaoww commented Sep 16, 2024

tobitege commented Sep 16, 2024

xingyaoww commented Sep 18, 2024

tobitege commented Sep 18, 2024

xingyaoww commented Sep 18, 2024

tobitege commented Oct 8, 2024

li-boxuan commented Aug 29, 2024 •

edited

Loading