20 11

Oliver Kowalski

browser-kid

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

upvoted a paper 4 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a model 4 days ago

mimifuong/Quasar_Mi4

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published 10 days ago • 89

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 6 days ago • 201

liked a model 4 days ago

mimifuong/Quasar_Mi4

3B • Updated 4 days ago • 35 • 1

liked a model 5 days ago

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.88M • • 7.75k

upvoted 2 papers 8 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 13 days ago • 268

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Paper • 2605.14539 • Published 12 days ago • 5

upvoted a paper 12 days ago

Can Muon Fine-tune Adam-Pretrained Models?

Paper • 2605.10468 • Published 15 days ago • 6

liked a dataset 15 days ago

Maynor996/upload2

Viewer • Updated 6 days ago • 1 • 1.05M • 15

liked a dataset 19 days ago

anonymous-24421/DriCo

Viewer • Updated 14 days ago • 16.9k • 55 • 1

liked a model 25 days ago

apol/alia-40b-distill-vapol

Text Generation • Updated 21 days ago • 2.01k • 2

upvoted 2 papers about 1 month ago

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

Paper • 2604.04936 • Published Jan 8 • 26

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published Apr 21 • 87

liked a dataset about 1 month ago

Salesforce/GiftEvalPretrain

Preview • Updated Jan 21, 2025 • 313k • 37

upvoted 2 papers about 2 months ago

R3PM-Net: Real-time, Robust, Real-world Point Matching Network

Paper • 2604.05060 • Published Apr 6 • 7

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

liked a model about 2 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 14.8M • • 709

upvoted 2 papers about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 54

liked a dataset about 2 months ago

mteb/sts22-crosslingual-sts

Viewer • Updated Feb 24 • 17.2k • 26.4k • 12

upvoted a paper about 2 months ago

Qworld: Question-Specific Evaluation Criteria for LLMs

Paper • 2603.23522 • Published Mar 6 • 10

Oliver Kowalski

AI & ML interests

Recent Activity

Organizations

browser-kid's activity