shri210620 (shrikant lengare)

upvoted a paper 8 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 54

upvoted a paper 10 months ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69

upvoted an article 10 months ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Feb 10

•

59

upvoted 2 collections 10 months ago

DeepSeek-R1

Collection

10 items • Updated 14 days ago • 821

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Jul 21 • 226

upvoted 2 papers 10 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 23

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published Jan 30 • 23

upvoted a paper 11 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115

upvoted 2 papers 12 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

upvoted 3 papers about 1 year ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118

upvoted 2 articles about 1 year ago

Article

Running Your Custom LoRA Fine-Tuned MusicGen Large Locally

Dec 6, 2024

•

1

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

80

shrikant lengare

AI & ML interests

Organizations

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Slamming: Training a Speech Language Model on One GPU in a Day

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

DeepSeek-R1

Qwen2-VL

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

o3-mini vs DeepSeek-R1: Which One is Safer?

Evolving Deeper LLM Thinking

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Running Your Custom LoRA Fine-Tuned MusicGen Large Locally

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

shrikant lengare

AI & ML interests

Organizations

shri210620's activity

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Running Your Custom LoRA Fine-Tuned MusicGen Large Locally

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs