Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,33 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- irlab-udc/alpaca_data_galician
|
5 |
+
language:
|
6 |
+
- gl
|
7 |
+
- en
|
8 |
---
|
9 |
+
|
10 |
+
# Galician Fine-Tuned LLM Model
|
11 |
+
|
12 |
+
This repository contains a large language model (LLM) fine-tuned using the LLaMA Factory library and the Finisterrae III supercomputer at CESGA. The base model used for fine-tuning was Meta's `LLaMA 3`.
|
13 |
+
|
14 |
+
## Model Description
|
15 |
+
|
16 |
+
This LLM model has been specifically fine-tuned to understand and generate text in Galician. It was fine-tuned using a modified version of the [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) dataset, enriched with synthetic data to enhance its text generation and comprehension capabilities in specific contexts.
|
17 |
+
|
18 |
+
### Technical Details
|
19 |
+
|
20 |
+
- **Base Model**: Meta's LLaMA 3
|
21 |
+
- **Fine-Tuning Platform**: LLaMA Factory
|
22 |
+
- **Infrastructure**: Finisterrae III, CESGA
|
23 |
+
- **Dataset**: [irlab-udc/alpaca_data_galician](https://huggingface.co/datasets/irlab-udc/alpaca_data_galician) (with modifications)
|
24 |
+
- **Fine-Tuning Objective**: To improve text comprehension and generation in Galician.
|
25 |
+
|
26 |
+
## How to Use the Model
|
27 |
+
|
28 |
+
To use this model, follow the example code provided below. Ensure you have the necessary libraries installed (e.g., Hugging Face's `transformers`).
|
29 |
+
|
30 |
+
### Installation
|
31 |
+
|
32 |
+
```bash
|
33 |
+
pip install transformers
|