ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a model 2 days ago

MiniMaxAI/MiniMax-M2.7

upvoted a paper 3 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

liked a dataset 4 days ago

nvidia/Nemotron-SFT-OpenCode-v1

View all activity

Organizations

upvoted a paper 3 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published 19 days ago • 12

upvoted an article 20 days ago

Article

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

Jan 4, 2025

•

upvoted a collection 24 days ago

UltraData

Collection

Ultra Scale, Ultra Quality, Ultra Coverage • 9 items • Updated 8 days ago • 80

upvoted a paper 24 days ago

Data Science and Technology Towards AGI Part I: Tiered Data Management

Paper • 2602.09003 • Published Feb 9 • 7

upvoted a paper about 1 month ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 194

upvoted a collection about 1 month ago

Open Coding Agents Specialization

Collection

Ai2 Open Coding Agents - Django, Sphinx, Sympy Data • 6 items • Updated Feb 11 • 5

upvoted 4 papers about 1 month ago

upvoted an article 3 months ago

Article

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

Jun 27, 2025

•

upvoted 5 papers 3 months ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published Jan 22 • 19

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Paper • 2601.10124 • Published Jan 15 • 4

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30, 2025 • 15

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 108

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 322

upvoted a collection 4 months ago

Molmo2 Data

Collection

Artifacts for the Molmo2 data release • 13 items • Updated Mar 2 • 39

upvoted 3 papers 4 months ago

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Paper • 2512.02551 • Published Dec 2, 2025 • 13

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 51

ldwang

AI & ML interests

Recent Activity

Organizations

ldwang's activity

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

Automated Discovery of High-Performance GPU Kernels with OpenEvolve