Update README.md
Browse files
README.md
CHANGED
@@ -125,6 +125,25 @@ User prompt...
|
|
125 |
|
126 |
In this format, the system section is optional and the conversation can be either single-turn or multi-turn. When applying inference, you always make your input string end with "\<s\>bot" to ask the model generating answers.
|
127 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
128 |
|
129 |
## Quickstart
|
130 |
|
@@ -293,6 +312,26 @@ User prompt...
|
|
293 |
|
294 |
在这个格式中,System提示是可选的(按需设定),支持单轮会话也支持多轮会话。推理时,请确保拼接的prompt字符串以"\<s\>bot\n"结尾,引导模型生成回答。
|
295 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
296 |
## 快速使用
|
297 |
|
298 |
```python
|
|
|
125 |
|
126 |
In this format, the system section is optional and the conversation can be either single-turn or multi-turn. When applying inference, you always make your input string end with "\<s\>bot" to ask the model generating answers.
|
127 |
|
128 |
+
For example, the format used to infer HumanEval is like the following:
|
129 |
+
|
130 |
+
```python
|
131 |
+
<s>human
|
132 |
+
# language: Python
|
133 |
+
from typing import List
|
134 |
+
def separate_paren_groups(paren_string: str) -> List[str]:
|
135 |
+
""" Input to this function is a string containing multiple groups of nested parentheses. Your goal is to
|
136 |
+
separate those group into separate strings and return the list of those.
|
137 |
+
Separate groups are balanced (each open brace is properly closed) and not nested within each other
|
138 |
+
Ignore any spaces in the input string.
|
139 |
+
>>> separate_paren_groups('( ) (( )) (( )( ))')
|
140 |
+
['()', '(())', '(()())']
|
141 |
+
"""
|
142 |
+
<s>bot
|
143 |
+
|
144 |
+
```
|
145 |
+
|
146 |
+
Specifically, we also add the Programming Language Tag (e.g. ```#language: Python``` for Python) used by CodeGeex models.
|
147 |
|
148 |
## Quickstart
|
149 |
|
|
|
312 |
|
313 |
在这个格式中,System提示是可选的(按需设定),支持单轮会话也支持多轮会话。推理时,请确保拼接的prompt字符串以"\<s\>bot\n"结尾,引导模型生成回答。
|
314 |
|
315 |
+
例如,推理HumanEval数据时使用的格式如下所示:
|
316 |
+
|
317 |
+
```python
|
318 |
+
<s>human
|
319 |
+
# language: Python
|
320 |
+
from typing import List
|
321 |
+
def separate_paren_groups(paren_string: str) -> List[str]:
|
322 |
+
""" Input to this function is a string containing multiple groups of nested parentheses. Your goal is to
|
323 |
+
separate those group into separate strings and return the list of those.
|
324 |
+
Separate groups are balanced (each open brace is properly closed) and not nested within each other
|
325 |
+
Ignore any spaces in the input string.
|
326 |
+
>>> separate_paren_groups('( ) (( )) (( )( ))')
|
327 |
+
['()', '(())', '(()())']
|
328 |
+
"""
|
329 |
+
<s>bot
|
330 |
+
|
331 |
+
```
|
332 |
+
|
333 |
+
特别地,我们也使用了CodeGeeX系列模型采用的编程语言区分标签(例如,对于Python语言,我们会使用```#language: Python```)。
|
334 |
+
|
335 |
## 快速使用
|
336 |
|
337 |
```python
|