21 12

Новиков Наталья

JosephRamirez

AI & ML interests

Research on LLM agents and evaluation. Mostly focused on experiments.

Recent Activity

liked a dataset about 21 hours ago

wegrthj/e94fjt-v654-raw

upvoted a paper 1 day ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

liked a dataset 6 days ago

wegrthj/l36l5h-qi9l-raw

View all activity

Organizations

None yet

liked a dataset about 21 hours ago

wegrthj/e94fjt-v654-raw

Updated 6 minutes ago • 25.1k • 7

upvoted a paper 1 day ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

Paper • 2606.02482 • Published 2 days ago • 25

liked a dataset 6 days ago

wegrthj/l36l5h-qi9l-raw

Updated 6 minutes ago • 21.1k • 8

upvoted a paper 8 days ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published 9 days ago • 101

liked a model 11 days ago

brendan-gho/qwen2.5-1.5b-liminal-otter-cot-seed1-mcq

Updated 9 days ago • 1

liked a dataset 12 days ago

trl-lib/trackio-dataset

Viewer • Updated 5 minutes ago • 3.83k • 26.8k • 12

liked a model 12 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 8 days ago • 18.9k • • 1.1k

upvoted a paper 13 days ago

WavFlow: Audio Generation in Waveform Space

Paper • 2605.18749 • Published 16 days ago • 10

upvoted a paper 15 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 21 days ago • 270

liked a model 16 days ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.61M • • 4.84k

liked a model 20 days ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27, 2025 • 1.05M • • 4.08k

upvoted a paper 27 days ago

AnalogRetriever: Learning Cross-Modal Representations for Analog Circuit Retrieval

Paper • 2604.23195 • Published Apr 25 • 3

upvoted 2 papers about 1 month ago

Credal Concept Bottleneck Models for Epistemic-Aleatoric Uncertainty Decomposition

Paper • 2604.24170 • Published Apr 27 • 2

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 242

upvoted a paper about 2 months ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published Apr 5 • 37

liked a dataset about 2 months ago

HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 69.5k • 722

upvoted a paper about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

liked 2 models about 2 months ago

openbmb/VoxCPM2

Text-to-Speech • 2B • Updated Apr 16 • 238k • 1.36k

NexVeridian/gemma-4-31B-it-6bit

Text Generation • 31B • Updated Apr 22 • 58 • 1

upvoted a paper about 2 months ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

Новиков Наталья

AI & ML interests

Recent Activity

Organizations

JosephRamirez's activity