inter-rater-agreement

Here are 11 public repositories matching this topic...

DISCOSUMO / evaluation

Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.

evaluation recall precision extractive-summarization forum-threads automatic-summarization evaluation-metrics rouge-n rouge-l jaccard cohens-kappa fleiss-kappa inter-rater-agreement

Updated Feb 10, 2017
Python

eltonyeung / NOAH-Fleiss-kappa-multiraters-inter-rater-agreement

Star

Raters inter-rater agreement

cohens-kappa fleiss-kappa inter-rater-agreement

Updated May 23, 2020

vinid / quica

Star

quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected in a single table than can be easily exported in Latex

python evaluation-metrics inter-rater-agreement evaluation-framework inter-coder-agreement

Updated Nov 9, 2020
Python

theoam / TDArchetypalAnalysis

Star

Replication package for the Archetypal Analysis conducted in the paper: Evaluating the Agreement among Technical Debt Measurement Tools: Building an Empirical Benchmark of Technical Debt Liabilities accepted at Springer's EMSE Journal.

technical-debt inter-rater-agreement replication-package archetypal-analysis empirical-benchmark

Updated Aug 11, 2020
R

jrrobison1 / rater-metric-calculator

Star

Python tool for calculating inter-rater reliability metrics and generating comprehensive reports for multi-rater datasets. Optionally have an LLM create an interpretation report.

statistics data-analysis inter-rater-agreement markdown-report research-tools metrics-calculator agreement-analysis rater-agreement