The LLM Evaluation Framework
-
Updated
Nov 2, 2024 - Python
The LLM Evaluation Framework
[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
This repo is for an streamlit application that provides a user-friendly interface for evaluating large language models (LLMs) using the beyondllm package.
Add a description, image, and links to the llm-evaluation-metrics topic page so that developers can more easily learn about it.
To associate your repository with the llm-evaluation-metrics topic, visit your repo's landing page and select "manage topics."