1 8 11

Iliass Lasri

iliasslasri

AI & ML interests

None yet

Recent Activity

upvoted an article 24 days ago

Mixture of Experts Explained

liked a Space about 1 month ago

nanotron/ultrascale-playbook

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

upvoted an article 24 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

997

liked 2 Spaces about 1 month ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.56k

The secrets to building world-class LLMs

liked a model about 1 month ago

Datadog/Toto-Open-Base-1.0

Time Series Forecasting • Updated Sep 8 • 139k • 129

upvoted a paper about 2 months ago

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

Paper • 2110.13900 • Published Oct 26, 2021 • 1

updated a model about 2 months ago

iliasslasri/Qwen2.5-0.5B-Instruct-DPO

Updated Oct 18

published a model about 2 months ago

iliasslasri/Qwen2.5-0.5B-Instruct-DPO

Updated Oct 18

liked a dataset about 2 months ago

AIffl/french_hh_rlhf

Viewer • Updated Jun 15, 2024 • 169k • 75 • 5

liked a Space about 2 months ago

Gradio Hackathon Registration Winter 25

📝

179

Gradio Agents & MCP Hackathon Winter 2025 Registration Page

upvoted an article about 2 months ago

Article

The Large Language Model Course

Jan 16

•

212

reacted to mlabonne's post with 🔥 about 2 months ago

Post

6783

LiquidAI/LFM2-8B-A1B just dropped!

8.3B params with only 1.5B active/token 🚀

> Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B
> MoE designed to run on phones/laptops (llama.cpp / vLLM)
> Pre-trained on 12T tokens → strong math/code/IF

1 reply

liked a model about 2 months ago

speechbrain/tts-fastspeech2-ljspeech

Text-to-Speech • Updated Feb 25, 2024 • 185 • 6

New activity in keithito/lj_speech about 2 months ago

Error

#9 opened about 2 months ago by

iliasslasri

liked a dataset about 2 months ago

MikhailT/lj-speech

Viewer • Updated Jun 23, 2023 • 13.1k • 301 • 6

updated a model 2 months ago

iliasslasri/Qwen2.5-0.5B-french-summarizer

0.5B • Updated Oct 9 • 3

published a model 2 months ago

iliasslasri/Qwen2.5-0.5B-french-summarizer

0.5B • Updated Oct 9 • 3

upvoted an article 2 months ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

upvoted 2 papers 2 months ago

Voxtral

Paper • 2507.13264 • Published Jul 17 • 29

Moshi: a speech-text foundation model for real-time dialogue

Paper • 2410.00037 • Published Sep 17, 2024 • 8

upvoted an article 2 months ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12

•

120

Iliass Lasri

AI & ML interests

Recent Activity

Organizations

iliasslasri's activity

Mixture of Experts Explained

The Ultra-Scale Playbook

The Smol Training Playbook

Gradio Hackathon Registration Winter 25

The Large Language Model Course

Error

Decoding Strategies in Large Language Models

Mastering Tensor Dimensions in Transformers