25 13

Malkesh Dalia

malkesh2911

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

Gemma 4

liked a model 2 days ago

ggerganov/whisper.cpp

liked a model 6 days ago

netflix/void-model

View all activity

Organizations

None yet

upvoted a collection 1 day ago

Gemma 4

Collection

8 items • Updated 7 days ago • 517

liked a model 2 days ago

ggerganov/whisper.cpp

Automatic Speech Recognition • Updated Oct 29, 2024 • 1.36k

liked a model 6 days ago

netflix/void-model

Video-to-Video • Updated 3 days ago • 681

upvoted an article 6 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

8 days ago

•

802

upvoted a paper 8 days ago

Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Paper • 2603.27771 • Published 11 days ago • 50

upvoted a paper 9 days ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published 10 days ago • 56

upvoted a paper 12 days ago

Vega: Learning to Drive with Natural Language Instructions

Paper • 2603.25741 • Published 14 days ago • 6

liked a model 13 days ago

facebook/tribev2

Updated 13 days ago • 83k • 342

upvoted a paper 14 days ago

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 16 days ago • 25

updated a collection 15 days ago

My AI

Collection

6 items • Updated 15 days ago

upvoted a paper 15 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 23 days ago • 94

upvoted a paper 26 days ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published 28 days ago • 13

liked a dataset 29 days ago

pwc-archive/papers-with-abstracts

Viewer • Updated Sep 10, 2025 • 576k • 287 • 14

upvoted a paper 2 months ago

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Paper • 2601.21598 • Published Jan 29 • 10

liked a model 2 months ago

Qwen/Qwen3-ASR-1.7B

Automatic Speech Recognition • 2B • Updated Jan 30 • 1.47M • 677

updated a collection 2 months ago

My AI

Collection

6 items • Updated 15 days ago

upvoted 2 papers 3 months ago

MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published Jan 14 • 96

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

upvoted 2 papers 4 months ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 66

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42

Malkesh Dalia

AI & ML interests

Recent Activity

Organizations

malkesh2911's activity

Welcome Gemma 4: Frontier multimodal intelligence on device