AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

christopher 
in bigscience/bloom 3 months ago

[SPAM] Deleted

3
#289 opened 3 months ago by
sarthak-saxena
stas 
posted an update 3 months ago
view post
Post
244
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.