Update README.md
Browse files
README.md
CHANGED
|
@@ -9,13 +9,12 @@ inference: false
|
|
| 9 |
pipeline_tag: text-generation
|
| 10 |
---
|
| 11 |
|
| 12 |
-
Base Model
|
| 13 |
-
Language
|
| 14 |
-
Training Methodology
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
DPO: Direct Preference Optimization for final alignment
|
| 19 |
|
| 20 |
## Running this model
|
| 21 |
More info coming later
|
|
|
|
| 9 |
pipeline_tag: text-generation
|
| 10 |
---
|
| 11 |
|
| 12 |
+
* **Base Model:** [Gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
|
| 13 |
+
* **Language:** Finnish (fi)
|
| 14 |
+
* **Training Methodology:**
|
| 15 |
+
* Step 1: Continued Pretraining (CP) Mix of English, Finnish and Code-switching data
|
| 16 |
+
* Step 2: Supervised Fine-Tuning (SFT) Mostly Finnish
|
| 17 |
+
* Step 3: Direct Preference Optimization (DPO) Mostly Finnish
|
|
|
|
| 18 |
|
| 19 |
## Running this model
|
| 20 |
More info coming later
|