Lyra4-Gutenberg-12B

Sao10K/MN-12B-Lyra-v4 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

ORPO Finetuned using an RTX 3090 + 4060 Ti for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 19.63
IFEval (0-Shot) 22.12
BBH (3-Shot) 34.24
MATH Lvl 5 (4-Shot) 11.71
GPQA (0-shot) 9.17
MuSR (0-shot) 11.97
MMLU-PRO (5-shot) 28.57
Downloads last month
43
Safetensors
Model size
12.2B params
Tensor type
BF16
Β·
Inference Providers NEW
The selected billing account doesn't have any compatible Inference Provider enabled for this model. Settings

Model tree for nbeerbower/Lyra4-Gutenberg-12B

Finetuned
(2)
this model
Merges
7 models
Quantizations
10 models

Dataset used to train nbeerbower/Lyra4-Gutenberg-12B

Spaces using nbeerbower/Lyra4-Gutenberg-12B 4

Evaluation results