Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

HCI, Computer Vision, Object Detection, Pattern Recognition, NLP, Supervised Learning

Recent Activity

liked a model about 7 hours ago

hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

upvoted an article 2 days ago

Safety Evals Should Project Test-Time Compute

liked a model 3 days ago

Muse-research/Muse-1B

View all activity

Organizations

upvoted an article 2 days ago

Article

Safety Evals Should Project Test-Time Compute

Cerru02

•

13 days ago

• 4

upvoted 6 papers 4 days ago

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Paper • 2605.20668 • Published 5 days ago • 11

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published 21 days ago • 40

Auditing Agent Harness Safety

Paper • 2605.14271 • Published 11 days ago • 54

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 11 days ago • 109

Code as Agent Harness

Paper • 2605.18747 • Published 7 days ago • 200

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 12 days ago • 217

upvoted an article 7 days ago

Article

Unlocking asynchronicity in continuous batching

+1

ror, pcuenq, ariG23498

•

11 days ago

• 54

upvoted 10 papers 14 days ago

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

Paper • 2605.07363 • Published 17 days ago • 12

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 18 days ago • 15

AcademiClaw: When Students Set Challenges for AI Agents

Paper • 2605.02661 • Published 21 days ago • 16

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

Paper • 2605.05204 • Published 19 days ago • 27

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 18 days ago • 45

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published 17 days ago • 67

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 18 days ago • 110

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 17 days ago • 97

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 21 days ago • 120

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 25 days ago • 218

upvoted a collection 26 days ago

MiMo-V2.5

4 items • Updated 27 days ago • 86

upvoted an article 30 days ago

Article

Tropical Quivers for Modern AI: A Guided Tour of a Research Program

AmelieSchreiber

•

Mar 22

• 3