Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation (ACL 2020)
-
Updated
Feb 14, 2022 - Python
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation (ACL 2020)
HERALD: An Annotation Efficient Method to Train User Engagement Predictors in Dialogs (ACL 2021)
Code and Data for our Findings of ACL 2021 paper titled 'Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation. Varun Gangal *, Harsh Jhamtani *, Eduard Hovy, Taylor Berg-Kirkpatrick'
Add a description, image, and links to the dialog-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the dialog-evaluation topic, visit your repo's landing page and select "manage topics."