1 126 9

SAMBIT CHAKRABORTY

sambitchakhf03

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

sambitchakhf03/SwiFTeDLM:Adding `safetensors` variant of this model

liked a model 6 days ago

sambitchakhf03/SwiFTeDLM

updated a model 8 days ago

sambitchakhf03/gemma-3-270m-classifier

View all activity

Organizations

New activity in sambitchakhf03/SwiFTeDLM 6 days ago

Adding `safetensors` variant of this model

#1 opened 9 months ago by

SFconvertbot

liked a model 6 days ago

sambitchakhf03/SwiFTeDLM

Text Generation • 7B • Updated 6 days ago • 19 • 1

updated a model 8 days ago

sambitchakhf03/gemma-3-270m-classifier

0.3B • Updated 8 days ago • 16

published a model 8 days ago

sambitchakhf03/gemma-3-270m-classifier

0.3B • Updated 8 days ago • 16

upvoted an article 11 days ago

Article

The Optimal Architecture for Small Language Models

13 days ago

•

upvoted a paper 13 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 17 days ago • 61

upvoted a paper 14 days ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

Paper • 2512.17495 • Published 20 days ago • 19

upvoted a paper 15 days ago

CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published 17 days ago • 11

upvoted a paper 20 days ago

Universal Reasoning Model

Paper • 2512.14693 • Published 23 days ago • 41

upvoted 2 papers 21 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4, 2025 • 133

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 23 days ago • 114

upvoted a paper 23 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published 28 days ago • 113

upvoted a paper 24 days ago

Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Paper • 2512.06421 • Published Dec 6, 2025 • 5

upvoted a paper 25 days ago

BEAVER: An Efficient Deterministic LLM Verifier

Paper • 2512.05439 • Published Dec 5, 2025 • 35

upvoted a paper 27 days ago

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published 29 days ago • 46

upvoted 2 papers 30 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published about 1 month ago • 75

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published about 1 month ago • 57

upvoted 3 papers about 1 month ago

Self-Improving VLM Judges Without Human Annotations

Paper • 2512.05145 • Published Dec 2, 2025 • 19

Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment

Paper • 2511.22345 • Published Nov 27, 2025 • 12

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published Dec 2, 2025 • 40

SAMBIT CHAKRABORTY

AI & ML interests

Recent Activity

Organizations

sambitchakhf03's activity

Adding `safetensors` variant of this model

The Optimal Architecture for Small Language Models