John commited on
Commit
4913580
·
verified ·
1 Parent(s): acaaa6a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -1
README.md CHANGED
@@ -7,4 +7,36 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ Curating the best open reasoning datasets. A [Bespoke Labs](https://bespokelabs.ai/) and [DataComp](https://www.datacomp.ai/) community effort.
11
+
12
+ Our first goal is to curate a reasoning dataset to train state of the art small reasoning models that surpass [DeepSeek-R1-Distill-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) and [DeepSeek-R1-Distill-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) on math and code reasoning benchmarks.
13
+
14
+ ## Latest Results
15
+
16
+ <Table
17
+ data={{
18
+ headers: [
19
+ "Model",
20
+ "AIME24",
21
+ "MATH500",
22
+ "GPQA-D",
23
+ "LCB Easy",
24
+ "LCB Med",
25
+ "LCB Hard"
26
+ ],
27
+ rows: [
28
+ ["Open-Thinker-7B", "26.7", "86", "40.4", "74.2", "31.1", "3.3"],
29
+ ["Bespoke-Stratos-7B", "20.0", "82.0", "37.8", "71.4", "25.2", "1.6"],
30
+ ["DeepSeek-R1-Distill-Qwen-7B", "43.3", "89.4", "44.9", "81.3", "42.2", "2.4"],
31
+ ["gpt-4o-0513", "9.3", "74.6", "49.9", "-", "-", "-"],
32
+ ["o1-mini", "63.6", "90.0", "60.0", "-", "-", "-"]
33
+ ]
34
+ }}
35
+ />
36
+
37
+
38
+ ## About us
39
+
40
+ We are a team of researchers and engineers from Bespoke Labs, Stanford, University of California Berkeley, University of Washington, Juelich Supercomputing Center (JSC), LAION, UCLA, UNC Chapel Hill, and Toyota Research Institute united around building the best datasets (and thus the best models). See our previous works at [datacomp.ai](https://www.datacomp.ai/) and [mlfoundations](https://github.com/mlfoundations).
41
+
42
+ Open Thoughts is supported by [Bespoke Labs](https://www.bespokelabs.ai/), [Lambda Labs](https://lambdalabs.com/), [NSF IFML](https://www.ifml.institute/), [Juelich Supercomputing Center](https://www.fz-juelich.de/en/ias/jsc), [Toyota Research Institute](https://www.tri.global/).