Skip to content
Change the repository type filter

All

    Repositories list

    • evalplus

      Public
      Rigourous evaluation of LLM-synthesized code - NeurIPS 2023
      Python
      Apache License 2.0
      1021.2k451Updated Sep 20, 2024Sep 20, 2024
    • repoqa

      Public
      RepoQA: Evaluating Long-Context Code Understanding
      Python
      Apache License 2.0
      39622Updated Sep 16, 2024Sep 16, 2024
    • Apache License 2.0
      0000Updated Aug 6, 2024Aug 6, 2024
    • HTML
      Apache License 2.0
      51000Updated Jul 2, 2024Jul 2, 2024
    • Release repository for HumanEval+ data
      Python
      Apache License 2.0
      0200Updated May 1, 2024May 1, 2024
    • Apache License 2.0
      0000Updated Apr 23, 2024Apr 23, 2024
    • Release repository for MBPP+ data
      Python
      Apache License 2.0
      0000Updated Apr 17, 2024Apr 17, 2024
    • Cirron

      Public
      Cirron measures how many CPU instructions and system calls a piece of Python code executes.
      C
      4000Updated Feb 18, 2024Feb 18, 2024