Yizhi
MercedeSnape
AI & ML interests
None yet
Recent Activity
updated
a collection
about 9 hours ago
agent env
updated
a collection
3 days ago
MoE
updated
a collection
3 days ago
RL agent
Organizations
None yet
ViT
future
LLM reasoning
mm thinking
agent training
agent env
model paradigm
Memory
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 163 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 28
KG
Benchmark: method
Problem Definition
Evolve
reasoning evaluation
agent reasoning
RL agent
-
Scaling Agent Learning via Experience Synthesis
Paper • 2511.03773 • Published • 81 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 115 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 164
mas
MoE
RAG
-
Multi-hop Reasoning via Early Knowledge Alignment
Paper • 2512.20144 • Published • 6 -
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding
Paper • 2512.17220 • Published • 110 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 97
Tokenization
RL training
Benchmark: method
ViT
Problem Definition
future
Evolve
LLM reasoning
reasoning evaluation
mm thinking
agent reasoning
agent training
RL agent
-
Scaling Agent Learning via Experience Synthesis
Paper • 2511.03773 • Published • 81 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 115 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 164
agent env
mas
model paradigm
MoE
Memory
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 163 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 28
RAG
-
Multi-hop Reasoning via Early Knowledge Alignment
Paper • 2512.20144 • Published • 6 -
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding
Paper • 2512.17220 • Published • 110 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 97
KG
Tokenization