Goekdeniz-Guelmez
/

Josiefied-Qwen2.5-14B-Instruct-abliterated-v4

@@ -51,7 +51,7 @@ model-index:
         num_few_shot: 4
     metrics:
     - type: exact_match
-      value: 0.0
       name: exact match
     source:
       url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Goekdeniz-Guelmez/Josiefied-Qwen2.5-14B-Instruct-abliterated-v4
@@ -227,40 +227,6 @@ Use at you rown risk!
 ---
-# Qwen2.5-14B-Instruct
-## Introduction
-Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:
-- Significantly **more knowledge** and has greatly improved capabilities in **coding** and **mathematics**, thanks to our specialized expert models in these domains.
-- Significant improvements in **instruction following**, **generating long texts** (over 8K tokens), **understanding structured data** (e.g, tables), and **generating structured outputs** especially JSON. **More resilient to the diversity of system prompts**, enhancing role-play implementation and condition-setting for chatbots.
-- **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
-- **Multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
-**This repo contains the instruction-tuned 14B Qwen2.5 model**, which has the following features:
-- Type: Causal Language Models
-- Training Stage: Pretraining & Post-training
-- Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
-- Number of Parameters: 14.7B
-- Number of Paramaters (Non-Embedding): 13.1B
-- Number of Layers: 48
-- Number of Attention Heads (GQA): 40 for Q and 8 for KV
-- Context Length: Full 131,072 tokens and generation 8192 tokens
-  - Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2.5 for handling long texts.
-For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2.5/), [GitHub](https://github.com/QwenLM/Qwen2.5), and [Documentation](https://qwen.readthedocs.io/en/latest/).
-## Requirements
-The code of Qwen2.5 has been in the latest Hugging face `transformers` and we advise you to use the latest version of `transformers`.
-With `transformers<4.37.0`, you will encounter the following error:
-```
-KeyError: 'qwen2'
-```
 ## Quickstart
 Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
@@ -353,10 +319,10 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
 |      Metric       |Value|
 |-------------------|----:|
-|Avg.               |33.51|
 |IFEval (0-Shot)    |82.92|
 |BBH (3-Shot)       |48.05|
-|MATH Lvl 5 (4-Shot)| 0.00|
 |GPQA (0-shot)      |12.30|
 |MuSR (0-shot)      |13.15|
 |MMLU-PRO (5-shot)  |44.65|

         num_few_shot: 4
     metrics:
     - type: exact_match
+      value: 54.23
       name: exact match
     source:
       url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Goekdeniz-Guelmez/Josiefied-Qwen2.5-14B-Instruct-abliterated-v4
 ---
 ## Quickstart
 Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
 |      Metric       |Value|
 |-------------------|----:|
+|Avg.               |42.55|
 |IFEval (0-Shot)    |82.92|
 |BBH (3-Shot)       |48.05|
+|MATH Lvl 5 (4-Shot)|54.23|
 |GPQA (0-shot)      |12.30|
 |MuSR (0-shot)      |13.15|
 |MMLU-PRO (5-shot)  |44.65|