vmkhlv

Vladislav Mikhailov vmkhlv

Pinned Loading

EleutherAI/lm-evaluation-harness EleutherAI/lm-evaluation-harness Public

A framework for few-shot evaluation of language models.

Python 10k 2.7k
ltgoslo/noreval ltgoslo/noreval Public

A Norwegian Language Understanding and Generation Evaluation Benchmark

5 1
ai-forever/mgpt ai-forever/mgpt Public

Multilingual Generative Pretrained Model

Jupyter Notebook 207 22
RussianNLP/RussianSuperGLUE RussianNLP/RussianSuperGLUE Public

Russian SuperGLUE benchmark

Jupyter Notebook 109 14
PragmaticsLab/vote_and_rank PragmaticsLab/vote_and_rank Public

Novel aggregation methods for multi-task NLP benchmarking

Jupyter Notebook 8 3
Toloka/beemo Toloka/beemo Public

Benchmark for fine-grained machine-generated text detection. 6.5k texts written by humans, generated by ten open-source instruction-finetuned LLMs and edited by expert annotators.

6 1