Spaces:

Metric-AI
/

ArmBench-LLM

Running

Bagratuni commited on Mar 12

Commit

be5a444

1 Parent(s): ba8086a

commit

Files changed (1) hide show

app.py CHANGED Viewed

@@ -99,11 +99,11 @@ def main():
                     3. **Important Notes**:
                         - For **mmlu_results**:
                             - The following categories must be included in the mmlu_results for the model to be considered valid:
-                                - "Biology", "Business", "Chemistry", "Computer Science", "Economics", "Engineering", "Health", "History", "Law", "Math", "Other", "Philosophy", "Physics", "Psychology", "Average"
                             - If any of these categories are missing, the model will not be added to the evaluation.
                         - For **unified_exam_results**:
                             - The following categories must be included in the unified_exam_results for the model to be considered valid:
-                                - "Average", "Armenian language and literature", "Armenian history", "Mathematics"
                             - If any of these categories are missing, the model will not be added to the evaluation.
                     4. **Submit your model**:
                         - Add the `Arm-LLM-Bench` tag and the `result.json` file to your model card.

                     3. **Important Notes**:
                         - For **mmlu_results**:
                             - The following categories must be included in the mmlu_results for the model to be considered valid:
+                            - ```["Biology", "Business", "Chemistry", "Computer Science", "Economics", "Engineering", "Health", "History", "Law", "Math", "Other", "Philosophy", "Physics", "Psychology", "Average"] ```
                             - If any of these categories are missing, the model will not be added to the evaluation.
                         - For **unified_exam_results**:
                             - The following categories must be included in the unified_exam_results for the model to be considered valid:
+                            - ```["Average", "Armenian language and literature", "Armenian history", "Mathematics"] ```
                             - If any of these categories are missing, the model will not be added to the evaluation.
                     4. **Submit your model**:
                         - Add the `Arm-LLM-Bench` tag and the `result.json` file to your model card.