18 5

Brendan Slevin

brend007

brendan-slevin-ab7496a7

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

liked a dataset about 7 hours ago

WinkingFace/CryptoLM-Solana-SOL-USDT

liked a model about 7 hours ago

nvidia/Nemotron-Terminal-8B

View all activity

Organizations

None yet

upvoted a paper about 4 hours ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published 3 days ago • 21

liked a dataset about 7 hours ago

WinkingFace/CryptoLM-Solana-SOL-USDT

Viewer • Updated Mar 19, 2025 • 32.3k • 73 • 10

liked a model about 7 hours ago

nvidia/Nemotron-Terminal-8B

Text Generation • 8B • Updated 3 days ago • 81 • 13

upvoted a paper about 21 hours ago

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published 3 days ago • 28

upvoted a paper 3 days ago

EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots

Paper • 2602.18071 • Published 7 days ago • 21

upvoted a paper 5 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 10 days ago • 98

upvoted 2 articles 7 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

604

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

8 days ago

•

469

upvoted 2 papers 8 days ago

DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories

Paper • 2602.10809 • Published 16 days ago • 52

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published 15 days ago • 98

upvoted an article 10 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

15 days ago

•

126

upvoted a collection 10 days ago

Qwen3.5

Collection

9 items • Updated 1 day ago • 451

upvoted an article 12 days ago

Article

Custom Kernels for All from Codex and Claude

15 days ago

•

upvoted a paper 15 days ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published 17 days ago • 19

upvoted an article 22 days ago

Article

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

24 days ago

•

upvoted a paper 27 days ago

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published Jan 25 • 55

upvoted an article about 1 month ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted a paper 2 months ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

upvoted an article 3 months ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

upvoted a paper 3 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 258

Brendan Slevin

AI & ML interests

Recent Activity

Organizations

brend007's activity

We Got Claude to Fine-Tune an Open Source LLM

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Forge: Scalable Agent RL Framework and Algorithm

Custom Kernels for All from Codex and Claude

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day