RLVF pipeline using parser oracles to align LMs for Icelandic and Danish. GPT-SW3 and Viking-13B trained with Delta-DPO.
Fakhar
Hodfa71
AI & ML interests
None yet
Recent Activity
updated a model about 1 hour ago
Hodfa71/saga-is-356m-kl-sft published a model about 1 hour ago
Hodfa71/saga-is-356m-kl-sft updated a model about 4 hours ago
Hodfa71/saga-da-6b7-delta-dpo-klsft-antihackOrganizations
models 43
Hodfa71/saga-is-356m-kl-sft
0.4B • Updated
Hodfa71/saga-da-6b7-delta-dpo-klsft-antihack
7B • Updated
Hodfa71/saga-is-llama1b-delta-dpo-klsft-antihack
1B • Updated
Hodfa71/saga-is-llama8b-delta-dpo-klsft-antihack
8B • Updated
Hodfa71/saga-is-356m-delta-dpo-nosft-antihack
0.4B • Updated
Hodfa71/saga-is-1b3-delta-dpo-klsft-antihack
1B • Updated
Hodfa71/saga-is-6b7-delta-dpo-klsft-antihack
7B • Updated
Hodfa71/gpt-sw3-6b7-da-delta-dpo
7B • Updated • 14
Hodfa71/gpt-sw3-356m-is-delta-dpo-nosft-antihack
0.4B • Updated • 27
Hodfa71/llama-1b-is-delta-dpo
1B • Updated • 3
datasets 11
Hodfa71/normistral-11b-nb-saga-kl-sft-delta-dpo-pairs
Viewer • Updated • 8.87k • 21
Hodfa71/normistral-11b-nb-saga-nosft-delta-dpo-pairs
Viewer • Updated • 3.12k • 20
Hodfa71/gpt-sw3-1b3-nb-saga-delta-dpo-pairs
Viewer • Updated • 7.08k • 25
Hodfa71/normistral-7b-nb-saga-delta-dpo-pairs
Viewer • Updated • 9.13k • 24
Hodfa71/OmniAgentBench
Viewer • Updated • 30 • 12
Hodfa71/OmniAgentBench-Audio
Viewer • Updated • 30 • 54
Hodfa71/saga-da-delta-dpo-r1
Viewer • Updated • 7.41k • 22
Hodfa71/saga-da-delta-dpo-r2
Viewer • Updated • 7.31k • 26
Hodfa71/pstu-synthetic-secrets
Viewer • Updated • 175 • 31
Hodfa71/NER-German
Preview • Updated • 15