Spaces:
Running
Running
commit
Browse files
app.py
CHANGED
@@ -99,11 +99,11 @@ def main():
|
|
99 |
3. **Important Notes**:
|
100 |
- For **mmlu_results**:
|
101 |
- The following categories must be included in the mmlu_results for the model to be considered valid:
|
102 |
-
|
103 |
- If any of these categories are missing, the model will not be added to the evaluation.
|
104 |
- For **unified_exam_results**:
|
105 |
- The following categories must be included in the unified_exam_results for the model to be considered valid:
|
106 |
-
|
107 |
- If any of these categories are missing, the model will not be added to the evaluation.
|
108 |
4. **Submit your model**:
|
109 |
- Add the `Arm-LLM-Bench` tag and the `result.json` file to your model card.
|
|
|
99 |
3. **Important Notes**:
|
100 |
- For **mmlu_results**:
|
101 |
- The following categories must be included in the mmlu_results for the model to be considered valid:
|
102 |
+
- ```["Biology", "Business", "Chemistry", "Computer Science", "Economics", "Engineering", "Health", "History", "Law", "Math", "Other", "Philosophy", "Physics", "Psychology", "Average"] ```
|
103 |
- If any of these categories are missing, the model will not be added to the evaluation.
|
104 |
- For **unified_exam_results**:
|
105 |
- The following categories must be included in the unified_exam_results for the model to be considered valid:
|
106 |
+
- ```["Average", "Armenian language and literature", "Armenian history", "Mathematics"] ```
|
107 |
- If any of these categories are missing, the model will not be added to the evaluation.
|
108 |
4. **Submit your model**:
|
109 |
- Add the `Arm-LLM-Bench` tag and the `result.json` file to your model card.
|