urroxyz (Urro)

updated a collection 1 day ago

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 124 items • Updated 1 day ago • 19

upvoted 14 papers 1 day ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 41

SD-E^2: Semantic Exploration for Reasoning Under Token Budgets

Paper • 2601.17982 • Published Jan 25 • 1

Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking

Paper • 2510.26122 • Published Jan 4 • 1

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published Mar 6, 2025 • 6

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16, 2025 • 49

Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

Paper • 2602.10273 • Published Mar 23 • 1

Sampling for Quality: Training-Free Reward-Guided LLM Decoding via Sequential Monte Carlo

Paper • 2604.16453 • Published Apr 7 • 1

Diversified Sampling Improves Scaling LLM inference

Paper • 2502.11027 • Published Feb 16, 2025 • 1

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

Paper • 2510.03777 • Published Feb 14 • 2

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 14

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 86

updated a collection 2 days ago

WTF GENIUS PAPERS

Collection

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 124 items • Updated 1 day ago • 19

upvoted a paper 2 days ago

Introspective Diffusion Language Models

Paper • 2604.11035 • Published 28 days ago • 24

updated a collection 2 days ago

WTF GENIUS PAPERS

Collection

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 124 items • Updated 1 day ago • 19

upvoted a paper 2 days ago

From Growing to Looping: A Unified View of Iterative Computation in LLMs

Paper • 2602.16490 • Published Feb 18 • 1

commented on EMO: Pretraining mixture of experts for emergent modularity 2 days ago

I really like this.

It's how MoEs should have always worked, and it's fascinating that it actually succeeds in practice.

Please submit this to Daily Papers so more people can see it:
https://huggingface.co/papers/submit?paperId=2605.06663

Urro PRO

AI & ML interests

Recent Activity

Organizations

WTF GENIUS PAPERS

RegMix: Data Mixture as Regression for Language Model Pre-training

SD-E^2: Semantic Exploration for Reasoning Under Token Budgets

Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking

Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration

Outcome-based Exploration for LLM Reasoning

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

Sampling for Quality: Training-Free Reward-Guided LLM Decoding via Sequential Monte Carlo

Diversified Sampling Improves Scaling LLM inference

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

WTF GENIUS PAPERS

Introspective Diffusion Language Models

WTF GENIUS PAPERS

From Growing to Looping: A Unified View of Iterative Computation in LLMs

Urro PRO

AI & ML interests

Recent Activity

Organizations

urroxyz's activity