--- base_model: - meta-llama/Llama-3.1-8B - Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B - deepseek-ai/DeepSeek-R1-Distill-Llama-8B - mergekit-community/L3.1-Athena-a-8B - Sao10K/L3-8B-Lunaris-v1 - mlabonne/NeuralDaredevil-8B-abliterated - mergekit-community/L3-Boshima-a - Skywork/Skywork-o1-Open-Llama-3.1-8B - MathGenie/MathCoder2-Llama-3-8B - NousResearch/Hermes-3-Llama-3.1-8B - Hastagaras/Jamet-8B-L3-MK.V-Blackroot library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [mergekit-community/L3.1-Athena-a-8B](https://huggingface.co/mergekit-community/L3.1-Athena-a-8B) as a base. ### Models Merged The following models were included in the merge: * [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) * [Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B) * [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) * [Sao10K/L3-8B-Lunaris-v1](https://huggingface.co/Sao10K/L3-8B-Lunaris-v1) * [mlabonne/NeuralDaredevil-8B-abliterated](https://huggingface.co/mlabonne/NeuralDaredevil-8B-abliterated) * [mergekit-community/L3-Boshima-a](https://huggingface.co/mergekit-community/L3-Boshima-a) * [Skywork/Skywork-o1-Open-Llama-3.1-8B](https://huggingface.co/Skywork/Skywork-o1-Open-Llama-3.1-8B) * [MathGenie/MathCoder2-Llama-3-8B](https://huggingface.co/MathGenie/MathCoder2-Llama-3-8B) * [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B) * [Hastagaras/Jamet-8B-L3-MK.V-Blackroot](https://huggingface.co/Hastagaras/Jamet-8B-L3-MK.V-Blackroot) ### Configuration The following YAML configuration was used to produce this model: ```yaml out_dtype: bfloat16 merge_method: model_stock base_model: mergekit-community/L3.1-Athena-a-8B models: - model: meta-llama/Llama-3.1-8B - model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B - model: Skywork/Skywork-o1-Open-Llama-3.1-8B - model: MathGenie/MathCoder2-Llama-3-8B - model: NousResearch/Hermes-3-Llama-3.1-8B - model: mlabonne/NeuralDaredevil-8B-abliterated - model: Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B - model: Sao10K/L3-8B-Lunaris-v1 - model: mergekit-community/L3-Boshima-a - model: Hastagaras/Jamet-8B-L3-MK.V-Blackroot ```