abrahammg
/

Llama3-8B-Galician-Chat-Lora

Safetensors

Galician

English

Model card Files Files and versions Community

abrahammg commited on May 3, 2024

Commit

708cd5f

verified ·

1 Parent(s): d6aac28

Update README.md

Browse files

Files changed (1) hide show

README.md +19 -9

README.md CHANGED Viewed

@@ -7,11 +7,11 @@ language:
 - en
 ---
-# Llama3-8B Adapter Fine-Tuned for Galician language
-This repository contains a Lora Adapter to finetune Meta's LLaMA 3-8B Instruct LLM to Galician language.
-## Model Adapter Description
 This Lora Adapter has been specifically fine-tuned to understand and generate text in Galician. It was refined using a modified version of the [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) dataset, enriched with synthetic data to enhance its text generation and comprehension capabilities in specific contexts.
@@ -19,7 +19,7 @@ This Lora Adapter has been specifically fine-tuned to understand and generate te
 - **Base Model**: Unsloth Meta's LLaMA 3 8B Instruct (https://huggingface.co/unsloth/llama-3-8b-Instruct-bnb-4bit)
 - **Fine-Tuning Platform**: LLaMA Factory
-- **Infrastructure**: Finisterrae III, CESGA
 - **Dataset**: [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) (with modifications)
 - **Fine-Tuning Objective**: To improve text comprehension and generation in Galician.
@@ -60,21 +60,25 @@ User: Cantos habitantes ten Galicia?
 Assistant: Segundo as últimas estimacións, Galicia ten uns 2,8 millóns de habitantes.
 ```
-## How to Use the Model
 To use this adapter, follow the example code provided below. Ensure you have the necessary libraries installed (e.g., Hugging Face's `transformers`).
 ### Installation
 ```bash
 git clone https://huggingface.co/abrahammg/Llama3-8B-Galician-Chat-Lora
 ```
 ```bash
 pip install transformers bitsandbytes "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git" llmtuner xformers
 ```
 ### Run the adapter
 ```bash
 from llmtuner import ChatModel
 from llmtuner.extras.misc import torch_gc
@@ -100,17 +104,22 @@ while True:
     print("History has been removed.")
     continue
-  messages.append({"role": "user", "content": query})     # add query to messages
   print("Assistant: ", end="", flush=True)
   response = ""
-  for new_text in chat_model.stream_chat(messages):      # stream generation
     print(new_text, end="", flush=True)
     response += new_text
   print()
-  messages.append({"role": "assistant", "content": response}) # add response to messages
 torch_gc()
 ```
 ## Citation
 ```markdown
@@ -127,3 +136,4 @@ torch_gc()
 - [meta-llama/llama3](https://github.com/meta-llama/llama3)
 - [hiyouga/LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)

 - en
 ---
+# Llama3-8B Lora adapter for Galician language
+This repository houses a specialized LoRA (Low-Rank Adaptation) Adapter designed specifically for fine-tuning Meta's LLaMA 3-8B Instruct version for applications involving the Galician language. The purpose of this adapter is to efficiently adapt the pre-trained model, which has been initially trained on a broad range of data and languages, to better understand and generate text in Galician.
+## Adapter Description
 This Lora Adapter has been specifically fine-tuned to understand and generate text in Galician. It was refined using a modified version of the [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) dataset, enriched with synthetic data to enhance its text generation and comprehension capabilities in specific contexts.
 - **Base Model**: Unsloth Meta's LLaMA 3 8B Instruct (https://huggingface.co/unsloth/llama-3-8b-Instruct-bnb-4bit)
 - **Fine-Tuning Platform**: LLaMA Factory
+- **Infrastructure**: Finisterrae III Supercomputer, CESGA (Galicia-Spain)
 - **Dataset**: [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) (with modifications)
 - **Fine-Tuning Objective**: To improve text comprehension and generation in Galician.
 Assistant: Segundo as últimas estimacións, Galicia ten uns 2,8 millóns de habitantes.
 ```
+## How to Use the Adapter
 To use this adapter, follow the example code provided below. Ensure you have the necessary libraries installed (e.g., Hugging Face's `transformers`).
 ### Installation
+Download de adapter from huggingface:
 ```bash
 git clone https://huggingface.co/abrahammg/Llama3-8B-Galician-Chat-Lora
 ```
+Install dependencies:
 ```bash
 pip install transformers bitsandbytes "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git" llmtuner xformers
 ```
 ### Run the adapter
+Create a python script (ex. run_model.py):
 ```bash
 from llmtuner import ChatModel
 from llmtuner.extras.misc import torch_gc
     print("History has been removed.")
     continue
+  messages.append({"role": "user", "content": query})
   print("Assistant: ", end="", flush=True)
   response = ""
+  for new_text in chat_model.stream_chat(messages):
     print(new_text, end="", flush=True)
     response += new_text
   print()
+  messages.append({"role": "assistant", "content": response})
 torch_gc()
 ```
+and run it
+```bash
+python run_model.py
+```
 ## Citation
 ```markdown
 - [meta-llama/llama3](https://github.com/meta-llama/llama3)
 - [hiyouga/LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)
+- [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician)