Vadim Kataev

vkataev

vkataev

AI & ML interests

LLMs, all types of ASR models, methods to reduce model sizes, methods to improve generalization, methods to increase model capacity

Recent Activity

upvoted an article about 2 months ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

liked a Space 3 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

upvoted an article about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

588

upvoted a paper 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 295

liked a Space 3 months ago

The Smol Training Playbook

📚

2.95k

The secrets to building world-class LLMs

liked a dataset 4 months ago

karpathy/fineweb-edu-100b-shuffle

Viewer • Updated Sep 25, 2025 • 97.2M • 39.8k • 152

upvoted an article 4 months ago

Article

Visualizing How VLMs Work

Oct 7, 2025

•

upvoted 2 papers 4 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 547

liked 3 datasets 4 months ago

liked a Space 5 months ago

The Tokenizer Playground

📝

626

Experiment with and compare different tokenizers

liked a model 5 months ago

facebook/MobileLLM-R1-360M-base

Text Generation • 0.4B • Updated Nov 10, 2025 • 152 • 12

upvoted a paper 5 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 229

liked 4 datasets 6 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 4.71k • 1.06k

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 79k • 1.13k

intfloat/wikidata5m

Viewer • Updated Dec 24, 2022 • 4.82M • 169 • 9

intfloat/wikipedia

Updated Apr 23, 2023 • 27 • 7

upvoted a collection 6 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated about 8 hours ago • 96

commented a paper 6 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 181 •

upvoted a paper 8 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

Vadim Kataev

AI & ML interests

Recent Activity

Organizations

vkataev's activity

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook

Visualizing How VLMs Work

The Tokenizer Playground