edbeeching
·
AI & ML interests
None yet
Organizations
edbeeching/vsft-llava-1.5-7b-hf
Image-Text-to-Text
• 7B • Updated • 57
edbeeching/atari_alien_3333
Reinforcement Learning
• Updated • 8
edbeeching/zephyr-7b-sft-qlora
Updated
edbeeching/llama2-70b-ift
Updated
Updated • 6
edbeeching/atari_2B_atari_surround_1111
Reinforcement Learning
• Updated edbeeching/falcon-7b-ift-rm-22
Updated • 1
edbeeching/falcon-7b-ift-rm-21
Updated
edbeeching/test_peft_seq_cls_save
Updated
edbeeching/test_peft_save_falcon-7b
Updated
edbeeching/falcon-7b-ift-rm-10
Updated
edbeeching/falcon-7b-ift-rm-11
Updated
edbeeching/falcon-7b-ift-rm-06
Updated
edbeeching/falcon-7b-ift-rm-05
Updated
edbeeching/mujoco_mujoco_standup_2222
Reinforcement Learning
• Updated • 4
edbeeching/mujoco_standup_1111
Reinforcement Learning
• Updated • 2
edbeeching/mujoco_mujoco_pusher_3333
Reinforcement Learning
• Updated • 1
edbeeching/mujoco_mujoco_pusher_2222
Reinforcement Learning
• Updated • 2
edbeeching/mujoco_pusher_1111
Reinforcement Learning
• Updated edbeeching/llama-7b-ift-ds-save-test5
Text Generation
• Updated • 5
edbeeching/llama-7b-ift-ds-save-test4
Text Generation
• Updated • 5
edbeeching/llama-7b-ift-ds-save-test3
Text Generation
• Updated • 4
edbeeching/llama-65b-ift-ds-v03
Text Generation
• Updated • 6
edbeeching/llama-65b-ift-ds-v02
Text Generation
• Updated • 6
edbeeching/llama-7b-se-rl-tokenizer
Updated
edbeeching/llama-se-rl-adapter
Text Generation
• Updated edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5step_1000-adapter-merged
Updated
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_800-adapter-merged
Text Generation
• Updated • 3
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_1100-adapter-merged
Text Generation
• Updated • 5
edbeeching/llama-se-rl-finetune-128-8-8-1.4e-5_adamstep_1000-adapter-merged
Text Generation
• Updated • 4