tyfeng1997
/

Qwen2.5-1.5B-Open-R1-Distill

@@ -19,18 +19,40 @@ It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ```python
-from transformers import pipeline
-from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
-model_id = "tyfeng1997/Qwen2.5-1.5B-Open-R1-Distill"
 model = AutoModelForCausalLM.from_pretrained(model_id, device_map="cuda")
 tokenizer = AutoTokenizer.from_pretrained(model_id)
-generator = pipeline("text-generation", model=model, tokenizer=tokenizer)
-output = generator([
-   {"role":"system","content":"""Your role as an assistant involves thoroughly exploring questions through a systematic long thinking process before providing the final precise and accurate solutions. This requires engaging in a comprehensive cycle of analysis, summarizing, exploration, reassessment, reflection, backtracing, and iteration to develop well-considered thinking process. Please structure your response into two main sections: Thought and Solution. In the Thought section, detail your reasoning process using the specified format: <|begin_of_thought|> {thought with steps separated with '\n\n'} <|end_of_thought|> Each step should include detailed considerations such as analisying questions, summarizing relevant findings, brainstorming new ideas, verifying the accuracy of the current steps, refining any errors, and revisiting previous steps. In the Solution section, based on various attempts, explorations, and reflections from the Thought section, systematically present the final solution that you deem correct. The solution should remain a logical, accurate, concise expression style and detail necessary step needed to reach the conclusion, formatted as follows: <|begin_of_solution|> {final formatted, precise, and clear solution} <|end_of_solution|> Now, try to solve the following question through the above guidelines:"""},
-    {"role": "user", "content": """A regular hexagon can be divided into six equilateral triangles. If the perimeter of one of the triangles is 21 inches, what is the perimeter, in inches, of the regular hexagon?"""}
-    ], max_new_tokens=10000, return_full_text=False)[0]
-print(output["generated_text"])
 ```
 #### output
 ``` text

 ## Quick start
 ```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+model_id = "tyfeng1997/Qwen2.5-1.5B-Open-R1-Distill" #instruct
 model = AutoModelForCausalLM.from_pretrained(model_id, device_map="cuda")
 tokenizer = AutoTokenizer.from_pretrained(model_id)
+# Prepare the messages
+messages = [
+    {
+        "role": "system",
+        "content": """Your role as an assistant involves thoroughly exploring questions through a systematic long thinking process before providing the final precise and accurate solutions. This requires engaging in a comprehensive cycle of analysis, summarizing, exploration, reassessment, reflection, backtracing, and iteration to develop well-considered thinking process. Please structure your response into two main sections: Thought and Solution. In the Thought section, detail your reasoning process using the specified format: <|begin_of_thought|> {thought with steps separated with '\n\n'} <|end_of_thought|> Each step should include detailed considerations such as analisying questions, summarizing relevant findings, brainstorming new ideas, verifying the accuracy of the current steps, refining any errors, and revisiting previous steps. In the Solution section, based on various attempts, explorations, and reflections from the Thought section, systematically present the final solution that you deem correct. The solution should remain a logical, accurate, concise expression style and detail necessary step needed to reach the conclusion, formatted as follows: <|begin_of_solution|> {final formatted, precise, and clear solution} <|end_of_solution|> Now, try to solve the following question through the above guidelines:"""
+    },
+    {
+        "role": "user",
+        "content": """A regular hexagon can be divided into six equilateral triangles. If the perimeter of one of the triangles is 21 inches, what is the perimeter, in inches, of the regular hexagon?"""
+    }
+]
+# Apply chat template
+prompt = tokenizer.apply_chat_template(messages, tokenize=False)
+# Tokenize the prompt
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+# Generate
+outputs = model.generate(
+    inputs.input_ids,
+    max_new_tokens=10000,
+    pad_token_id=tokenizer.pad_token_id,
+    eos_token_id=tokenizer.eos_token_id
+)
+# Decode and print the response
+response = tokenizer.decode(outputs[0], skip_special_tokens=True)
+print(response)
 ```
 #### output
 ``` text