aashish1904 commited on
Commit
ed9e214
1 Parent(s): 4e5aebd

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +179 -0
README.md ADDED
@@ -0,0 +1,179 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ license: llama3
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ base_model:
10
+ - nbeerbower/llama3.1-gutenberg-8B
11
+ - akjindal53244/Llama-3.1-Storm-8B
12
+ - NousResearch/Meta-Llama-3.1-8B
13
+ - nbeerbower/llama3.1-airoboros3.2-QDT-8B
14
+ - Sao10K/Llama-3.1-8B-Stheno-v3.4
15
+ model-index:
16
+ - name: Llama-3.1-8B-Ultra-Instruct
17
+ results:
18
+ - task:
19
+ type: text-generation
20
+ name: Text Generation
21
+ dataset:
22
+ name: IFEval (0-Shot)
23
+ type: HuggingFaceH4/ifeval
24
+ args:
25
+ num_few_shot: 0
26
+ metrics:
27
+ - type: inst_level_strict_acc and prompt_level_strict_acc
28
+ value: 80.81
29
+ name: strict accuracy
30
+ source:
31
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
32
+ name: Open LLM Leaderboard
33
+ - task:
34
+ type: text-generation
35
+ name: Text Generation
36
+ dataset:
37
+ name: BBH (3-Shot)
38
+ type: BBH
39
+ args:
40
+ num_few_shot: 3
41
+ metrics:
42
+ - type: acc_norm
43
+ value: 32.49
44
+ name: normalized accuracy
45
+ source:
46
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
47
+ name: Open LLM Leaderboard
48
+ - task:
49
+ type: text-generation
50
+ name: Text Generation
51
+ dataset:
52
+ name: MATH Lvl 5 (4-Shot)
53
+ type: hendrycks/competition_math
54
+ args:
55
+ num_few_shot: 4
56
+ metrics:
57
+ - type: exact_match
58
+ value: 14.95
59
+ name: exact match
60
+ source:
61
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
62
+ name: Open LLM Leaderboard
63
+ - task:
64
+ type: text-generation
65
+ name: Text Generation
66
+ dataset:
67
+ name: GPQA (0-shot)
68
+ type: Idavidrein/gpqa
69
+ args:
70
+ num_few_shot: 0
71
+ metrics:
72
+ - type: acc_norm
73
+ value: 5.59
74
+ name: acc_norm
75
+ source:
76
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
77
+ name: Open LLM Leaderboard
78
+ - task:
79
+ type: text-generation
80
+ name: Text Generation
81
+ dataset:
82
+ name: MuSR (0-shot)
83
+ type: TAUR-Lab/MuSR
84
+ args:
85
+ num_few_shot: 0
86
+ metrics:
87
+ - type: acc_norm
88
+ value: 8.61
89
+ name: acc_norm
90
+ source:
91
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
92
+ name: Open LLM Leaderboard
93
+ - task:
94
+ type: text-generation
95
+ name: Text Generation
96
+ dataset:
97
+ name: MMLU-PRO (5-shot)
98
+ type: TIGER-Lab/MMLU-Pro
99
+ config: main
100
+ split: test
101
+ args:
102
+ num_few_shot: 5
103
+ metrics:
104
+ - type: acc
105
+ value: 31.4
106
+ name: accuracy
107
+ source:
108
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Dampfinchen/Llama-3.1-8B-Ultra-Instruct
109
+ name: Open LLM Leaderboard
110
+
111
+ ---
112
+
113
+ ![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)
114
+
115
+ # QuantFactory/Llama-3.1-8B-Ultra-Instruct-GGUF
116
+ This is quantized version of [Dampfinchen/Llama-3.1-8B-Ultra-Instruct](https://huggingface.co/Dampfinchen/Llama-3.1-8B-Ultra-Instruct) created using llama.cpp
117
+
118
+ # Original Model Card
119
+
120
+ # merge
121
+
122
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
123
+
124
+ ## Merge Details
125
+ ### Merge Method
126
+
127
+ This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3.1-8B](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B) as a base.
128
+
129
+ ### Models Merged
130
+
131
+ The following models were included in the merge:
132
+ * [nbeerbower/llama3.1-gutenberg-8B](https://huggingface.co/nbeerbower/llama3.1-gutenberg-8B)
133
+ * [akjindal53244/Llama-3.1-Storm-8B](https://huggingface.co/akjindal53244/Llama-3.1-Storm-8B)
134
+ * [nbeerbower/llama3.1-airoboros3.2-QDT-8B](https://huggingface.co/nbeerbower/llama3.1-airoboros3.2-QDT-8B)
135
+ * [Sao10K/Llama-3.1-8B-Stheno-v3.4](https://huggingface.co/Sao10K/Llama-3.1-8B-Stheno-v3.4)
136
+
137
+ ### Configuration
138
+
139
+ The following YAML configuration was used to produce this model:
140
+
141
+ ```yaml
142
+ models:
143
+ - model: Sao10K/Llama-3.1-8B-Stheno-v3.4
144
+ parameters:
145
+ weight: 0.2
146
+ density: 0.5
147
+ - model: akjindal53244/Llama-3.1-Storm-8B
148
+ parameters:
149
+ weight: 0.5
150
+ density: 0.5
151
+ - model: nbeerbower/llama3.1-gutenberg-8B
152
+ parameters:
153
+ weight: 0.3
154
+ density: 0.5
155
+ - model: nbeerbower/llama3.1-airoboros3.2-QDT-8B
156
+ parameters:
157
+ weight: 0.2
158
+ density: 0.5
159
+ merge_method: dare_ties
160
+ base_model: NousResearch/Meta-Llama-3.1-8B
161
+ dtype: bfloat16
162
+ name: Llama-3.1-8B-Ultra-Instruct
163
+ ```
164
+
165
+ Use Llama 3 Instruct prompt template. Use with caution, I'm not responsible for what you do with it. All credits and thanks go to the creators of the fine tunes I've merged. In my own tests and on HF Eval it performs very well for a 8B model and I can recommend it. High quality quants by Bartowski: https://huggingface.co/bartowski/Llama-3.1-8B-Ultra-Instruct-GGUF
166
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
167
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Dampfinchen__Llama-3.1-8B-Ultra-Instruct)
168
+
169
+ | Metric |Value|
170
+ |-------------------|----:|
171
+ |Avg. |28.98|
172
+ |IFEval (0-Shot) |80.81|
173
+ |BBH (3-Shot) |32.49|
174
+ |MATH Lvl 5 (4-Shot)|14.95|
175
+ |GPQA (0-shot) | 5.59|
176
+ |MuSR (0-shot) | 8.61|
177
+ |MMLU-PRO (5-shot) |31.40|
178
+
179
+