Leandro von Werra's picture

Leandro von Werra PRO

lvwerra

huggingface

·

https://www.lvwerra.com

AI & ML interests

NLP and RL

Recent Activity

updated a bucket 5 days ago

ml-intern-explorers/hutter-prize-collab

updated a Space 7 days ago

ml-intern-explorers/hutter-prize-dashboard

liked a Space 7 days ago

AdithyaSK/rl-environments-guide

View all activity

Organizations

published an article 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 149

published an article 7 months ago

Article

Unlock the power of images with AI Sheets

+4

Ameeeee, dvilasuero, frascuchon, damianpumar, lvwerra, thomwolf

•

Oct 21, 2025

• 33

published an article 8 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

+1

baptistecolle, hannayukhymenko, lvwerra

•

Sep 10, 2025

• 64

published an article 9 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

+4

dvilasuero, Ameeeee, frascuchon, damianpumar, lvwerra, thomwolf

•

Aug 8, 2025

• 109

published an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

published an article about 1 year ago

Article

Open R1: Update #4

open-r1

•

Mar 26, 2025

• 49

published an article about 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 297

published an article over 1 year ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

+5

eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric

•

Feb 4, 2025

• 129

published an article over 1 year ago

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 305

published an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

published an article over 1 year ago

Article

LeMaterial: an open source initiative to accelerate materials discovery and research

+8

AlexDuvalinho, lritchie, msiron, inelgnu, etiennedufayet, amandinerossello, Ramlaoui, IAMJB, lvwerra, thomwolf

•

Dec 10, 2024

• 56

published an article over 1 year ago

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

+2

RuchitRawal, mfarre, somepago, lvwerra

•

Oct 23, 2024

• 19

published an article over 1 year ago

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

+2

RuchitRawal, mfarre, somepago, lvwerra

•

Oct 23, 2024

• 19

published an article over 1 year ago

Article

FineVideo: behind the scenes

+4

mfarre, andito, lewtun, lvwerra, pcuenq, thomwolf

•

Sep 23, 2024

• 35

published an article over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 280

published an article over 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

+1

neuralink, lvwerra, thomwolf

•

Aug 14, 2024

• 76

published an article almost 2 years ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

published an article almost 2 years ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

published an article almost 2 years ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

terryyz, ganler, SivilTaram, huybery, Muennighoff, dpfried, harmdevries, lvwerra, clefourrier

•

Jun 18, 2024

• 54

published an article about 2 years ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

+7

yuxiang630, cassanof, ganler, YifengDing, StringChaos, harmdevries, lvwerra, arjunguha, lingming

•

Apr 29, 2024

• 79