Spaces:
Running
Running
<html lang="en"> | |
<head> | |
<meta charset="UTF-8"> | |
<title>Model Benchmark: First Grade Math</title> | |
<style> | |
body { | |
font-family: Arial, sans-serif; | |
background: #f9f9f9; | |
margin: 0; | |
padding: 20px; | |
text-align: center; | |
} | |
h1 { | |
color: #333; | |
} | |
img { | |
max-width: 90%; | |
height: auto; | |
border: 1px solid #ccc; | |
box-shadow: 0px 0px 8px rgba(0,0,0,0.1); | |
} | |
.model-info { | |
margin-top: 20px; | |
font-size: 16px; | |
} | |
.model-info ul { | |
list-style: none; | |
padding: 0; | |
} | |
.model-info li { | |
padding: 6px 0; | |
} | |
</style> | |
</head> | |
<body> | |
<h1>Model Accuracy on First Grade Math (1000 Questions Each)</h1> | |
<img src="download.png" alt="Model Benchmark Chart"> | |
<div class="model-info"> | |
<h2>Models Benchmarked</h2> | |
<ul> | |
<li><strong>FlameF0X/MathGPT2</strong>: 763 correct / 237 incorrect</li> | |
<li><strong>FlameF0X/Muffin-2.9b-1C25</strong>: 9 correct / 991 incorrect</li> | |
<li><strong>FlameF0X/MuffinFace-2</strong>: 8 correct / 992 incorrect</li> | |
</ul> | |
</div> | |
</body> | |
</html> | |