thesunday commited on
Commit
b142b88
·
1 Parent(s): 7d92ce6

Update model card

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md CHANGED
@@ -1,3 +1,40 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - merge
7
  ---
8
+
9
+ # Model Description
10
+ This is an experiment to compare merging 2 models using DARE TIES versus SLERP 🦙
11
+
12
+ We are mainly interested to compare against [Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp](https://huggingface.co/Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp)
13
+
14
+ The 2 models involved in the merge as follows:
15
+ 1. [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
16
+ 2. [Intel/neural-chat-7b-v3-3](https://huggingface.co/Intel/neural-chat-7b-v3-3)
17
+
18
+ - base model: [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
19
+
20
+ The yaml config file for the merge is:
21
+
22
+ ```yaml
23
+ models:
24
+ - model: mistralai/Mistral-7B-v0.1
25
+ # no parameters necessary for base model
26
+ - model: teknium/OpenHermes-2.5-Mistral-7B
27
+ parameters:
28
+ weight: 0.5
29
+ density: 0.5
30
+ - model: Intel/neural-chat-7b-v3-3
31
+ parameters:
32
+ weight: 0.5
33
+ density: 0.5
34
+ merge_method: dare_ties
35
+ base_model: mistralai/Mistral-7B-v0.1
36
+ parameters:
37
+ int8_mask: true
38
+ dtype: bfloat16
39
+
40
+ ```