File config.json lists the configuration parameters used in llm-ranking.sh:
seed
: Seed used for random sampling.iterations
: number of times each experiment is runhuman_file
: dataset containing pairwise comparisons by humansllm_files
: list of datasets containing pairwise comparisons by strong LLMs (one for each)experiments_base_dir
: folder where the output will be stored.judges
: list of names of the strong LLMs (same order as their corresponding files inllm_files
)n
: Number of comparisons to subsample fromhuman_file
.alpha
: error probability parameterignore_ties
: Default 0. If 1, ignore comparisons where the verdict is a tie.methods
: list of methods to construct rank-sets, amongbaseline
,human only
,llm
,ppr
.models
: list of models to be ranked. If[]
, all models inhuman_file
are ranked.
The configuration parameters for synthetic.sh are inside the script.