monsterapi
/

gpt2_137m_DolphinCoder

Model card Files Files and versions Community

souvik0306 commited on Dec 27, 2023

Commit

50cfcb6

·

1 Parent(s): a558223

Update README.md

Files changed (1) hide show

README.md +9 -17

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ tags:
 - instruct
 - gpt2
 datasets:
-- HuggingFaceH4/no_robots
 base_model: gpt2
 license: apache-2.0
 ---
@@ -14,38 +14,30 @@ license: apache-2.0
 **Model Used:** gpt2
-**Dataset:** HuggingFaceH4/no_robots
 #### Dataset Insights:
-[No Robots](https://huggingface.co/datasets/HuggingFaceH4/no_robots) is a high-quality dataset of 10,000 instructions and demonstrations created by skilled human annotators. This data can be used for supervised fine-tuning (SFT) to make language models follow instructions better.
 #### Finetuning Details:
-With the utilization of [MonsterAPI](https://monsterapi.ai)'s [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm), this finetuning:
 - Was achieved with great cost-effectiveness.
-- Completed in a total duration of 3mins 40s for 1 epoch using an A6000 48GB GPU.
-- Costed `$0.101` for the entire epoch.
 #### Hyperparameters & Additional Details:
 - **Epochs:** 1
-- **Cost Per Epoch:** $0.101
-- **Total Finetuning Cost:** $0.101
 - **Model Path:** gpt2
 - **Learning Rate:** 0.0002
 - **Data Split:** 100% train
-- **Gradient Accumulation Steps:** 4
 - **lora r:** 32
 - **lora alpha:** 64
-#### Prompt Structure
-```
-<|system|> <|endoftext|> <|user|> [USER PROMPT]<|endoftext|> <|assistant|> [ASSISTANT ANSWER] <|endoftext|>
-```
-#### Training loss :
-![training loss](https://cdn-uploads.huggingface.co/production/uploads/63ba46aa0a9866b28cb19a14/9bgb518kFwtDsFtrHzmTu.png)
 license: apache-2.0

 - instruct
 - gpt2
 datasets:
+- cognitivecomputations/dolphin-coder
 base_model: gpt2
 license: apache-2.0
 ---
 **Model Used:** gpt2
+**Dataset:** cognitivecomputations/dolphin-coder
 #### Dataset Insights:
+[Dolphin-Coder](https://huggingface.co/datasets/cognitivecomputations/dolphin-coder) Dolphin-Coder dataset – a high-quality collection of 100,000+ coding questions and responses. It's perfect for supervised fine-tuning (SFT), and teaching language models to improve on coding-based tasks.
 #### Finetuning Details:
+With the utilization of [MonsterAPI](https://monsterapi.ai)'s [no-code LLM finetuner](https://monsterapi.ai/finetuning), this finetuning:
 - Was achieved with great cost-effectiveness.
+- Completed in a total duration of 58mins 48s for 1 epochs using an A6000 48GB GPU.
+- Costed `$1.96` for the entire 1 epoch.
 #### Hyperparameters & Additional Details:
 - **Epochs:** 1
+- **Total Finetuning Cost:** $1.96
 - **Model Path:** gpt2
 - **Learning Rate:** 0.0002
 - **Data Split:** 100% train
+- **Gradient Accumulation Steps:** 128
 - **lora r:** 32
 - **lora alpha:** 64
+---
 license: apache-2.0