File size: 2,264 Bytes
3da0ba2
 
 
 
 
 
 
 
 
 
 
e17d261
3da0ba2
 
 
 
 
 
 
 
 
 
a31f288
 
e17d261
 
a31f288
 
 
3da0ba2
 
 
 
 
3bdfe41
 
a31f288
3da0ba2
 
 
 
 
 
1ad168f
 
 
 
 
3da0ba2
 
238fbc9
d8dfb72
 
 
 
e17d261
d8dfb72
 
 
 
 
 
 
 
e17d261
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
license: mit
library_name: transformers
base_model:
- deepseek-ai/DeepSeek-V3-0324
- deepseek-ai/DeepSeek-R1
pipeline_tag: text-generation
---
# DeepSeek-R1T-Chimera

<div align="center">
<img src="https://354918363417-runtime-assets.s3.eu-central-1.amazonaws.com/company_logo_light.svg"
     alt="TNG Logo" 
     width="400"
     style="display: inline-block; vertical-align: middle;"/>
</div>
<br>
<div align="center">
  <a href="LICENSE" style="margin: 2px;">
    <img alt="License" src="https://img.shields.io/badge/License-MIT-f5de53?&color=f5de53" style="display: inline-block; vertical-align: middle;"/>
  </a>
</div>
<br>
<div align="center">
  <a href="https://x.com/tngtech/status/1916284566127444468" style="margin: 2px;">
    <img alt="Benchmarks" src="R1T-Chimera_Benchmarks_20250427_V1.jpg" style="display: inline-block; vertical-align: middle;"/>
  </a>
</div>


**Model merge of DeepSeek-R1 and DeepSeek-V3 (0324)**

An open weights model combining the intelligence of R1 with the token efficiency of V3.

[Announcement on X](https://x.com/tngtech/status/1916284566127444468) | [LinkedIn post](https://www.linkedin.com/posts/tng-technology-consulting_on-the-weekend-we-released-deepseek-r1t-chimera-activity-7323008947236290560-Cf2m) | [Try it on OpenRouter](https://openrouter.ai/tngtech/deepseek-r1t-chimera:free)


## Model Details

- **Architecture**: DeepSeek-MoE Transformer-based language model
- **Combination Method**: Merged model weights from DeepSeek-R1 and DeepSeek-V3 (0324)
- **Release Date**: 2025-04-27

## Use, Out-of-scope Use, Limitations, Risks, Recommendations et al
Regarding R1T Chimera, we ask you to follow the careful guidelines that Microsoft has created for their "MAI-DS-R1" DeepSeek-based model. 

These guidelines are available [here on Hugging Face](https://huggingface.co/microsoft/MAI-DS-R1).

## Contact

- Email: [email protected]
- X.com: @tngtech

## Citation

```
@misc{tng_technology_consulting_gmbh_2025,
	author       = { TNG Technology Consulting GmbH },
	title        = { DeepSeek-R1T-Chimera },
	year         = 2025,
    month        = {April},
	url          = { https://huggingface.co/tngtech/DeepSeek-R1T-Chimera },
	doi          = { 10.57967/hf/5330 },
	publisher    = { Hugging Face }
}

```