LLMJapan commited on
Commit
9f8a75a
·
verified ·
1 Parent(s): 6d2aa7b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +72 -3
README.md CHANGED
@@ -1,3 +1,72 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ quantized_by: LLMJapan
3
+ pipeline_tag: text-generation
4
+ license: cc-by-nc-4.0
5
+ language:
6
+ - en
7
+ tags:
8
+ - nvidia
9
+ - AceInstruct
10
+ - code
11
+ - math
12
+ - general_domain
13
+ - instruct_model
14
+ base_model: nvidia/AceInstruct-72B
15
+ ---
16
+ ## Exllama v2 Quantizations of AceInstruct-72B by nvidia
17
+
18
+ Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.2.8">turboderp's ExLlamaV2 v0.2.8</a> for quantization.
19
+
20
+ Original model: https://huggingface.co/nvidia/AceInstruct-72B
21
+
22
+ Quantization Command Example for creating other bpw quantization
23
+ ```
24
+ cd {your git clone directory}
25
+ python convert.py -i {path to}/AceInstruct-72B -o {path to}/AceInstruct-72B/workingdir -cf {path to}/AceInstruct-72B/AceInstruct-72B-4bpw -b 4.0
26
+ ```
27
+
28
+ ## Prompt format
29
+
30
+ ```
31
+ <|im_start|>system
32
+ {system_prompt}<|im_end|>
33
+ <|im_start|>user
34
+ {prompt}<|im_end|>
35
+ <|im_start|>assistant
36
+ ```
37
+
38
+ ## How to add your system prompt
39
+
40
+ Copy the following json and replace the "You are AceInstruct developed by NVIDIA. You are helpful assistant." sentence with your original system prompt.
41
+ The default tokenizer_config.json does not have system prompt.
42
+
43
+ tokenizer_config.json
44
+ ```
45
+ "chat_template": "{{- '<|im_start|>system\\nYou are AceInstruct developed by NVIDIA. You are helpful assistant.<|im_end|>\\n' }}\n {%- for message in messages %}\n{{- '<|im_start|>' + message.role + '\n' + message.content + '<|im_end|>' + '\n' }}\n{%- endfor %}\n{%- if add_generation_prompt %}\n{{- '<|im_start|>assistant\n' }}\n{%- endif %}\n",
46
+ ```
47
+
48
+ ## File information
49
+
50
+ | quantization type | file size |
51
+ | ----------------------- | ----------: |
52
+ | 4.0bpw | 35.9 GiB |
53
+
54
+ ## Benchmark Results
55
+
56
+ | | Qwen2.5-1.5B-Instruct | AceInstruct-1.5B | Qwen2.5-7B-Instruct | AceInstruct-7B | Qwen2.5-72B-Instruct | AceInstruct-72B |
57
+ | --------- |:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|
58
+ | HumanEval | 61.60 | 73.17 | 84.80 | 85.37 | 86.60 | 89.63 |
59
+ | MBPP | 63.20 | 65.76 | 79.20 | 74.32 | 88.20 | 83.66 |
60
+ | GSM8K | 73.20 | 80.44 | 91.60 | 93.10 | 95.80 | 96.36 |
61
+ | MATH | 55.20 | 60.34 | 75.50 | 76.40 | 83.10 | 84.50 |
62
+ | MMLU | 58.37 | 58.17 | 74.51 | 74.68 | 84.67 | 83.88 |
63
+ | MMLU Pro | 32.40 | 33.78 | 56.30 | 54.50 | 71.10 | 66.10 |
64
+ | Average | 57.33 | 61.94 | 76.99 | 76.40 | 84.91 | 84.02 |
65
+
66
+ ## Credits
67
+
68
+ Thanks to NVIDIA team.
69
+
70
+ ---
71
+ license: cc-by-nc-4.0
72
+ ---