Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump swebench version #3216

Merged
merged 1 commit into from
Aug 2, 2024
Merged

Bump swebench version #3216

merged 1 commit into from
Aug 2, 2024

Conversation

xingyaoww
Copy link
Contributor

What is the problem that this fixes or functionality that this introduces? Does it fix any open issues?


Give a summary of what the PR does, explaining any non-trivial design decisions

Bump the SWE-Bench version to the latest commit. It increases our latest CodeAct v1.8 score from 78 to 80 resolved (26.67%)!


Other references

@xingyaoww xingyaoww marked this pull request as ready for review August 1, 2024 21:56
@xingyaoww xingyaoww merged commit 105f0ff into main Aug 2, 2024
2 checks passed
@xingyaoww xingyaoww deleted the xw/bump-swebench branch August 2, 2024 02:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants