Dina Suehiro Jones

dmsuehir

dmsuehir

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

upvoted a paper 6 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

upvoted an article 8 months ago

Welcome Llama 4 Maverick & Scout on Hugging Face

View all activity

Organizations

upvoted a paper about 2 months ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13 • 28

upvoted a paper 6 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 277

upvoted 2 articles 8 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

Apr 5

•

146

Article

Building Your Own AI Document Dream Team: A Generic Multi-Agent System

Apr 8

•

upvoted an article 10 months ago

Article

Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators

Jan 28

•

upvoted 2 articles about 1 year ago

Article

Occam’s Sheath: A Simpler Approach to AI Safety Guardrails

Oct 18, 2024

•

Article

Model Card Generator Interface: Crafting Clear Insights into AI Models

Sep 27, 2024

•

liked a model about 1 year ago

Intel/toxic-prompt-roberta

Text Classification • 0.1B • Updated Oct 16, 2024 • 103 • 8

upvoted an article about 1 year ago

Article

Fine Tuning a LLM Using Kubernetes with Intel® Gaudi® Accelerator

Sep 9, 2024

•

upvoted an article over 1 year ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Aug 19, 2024

•

published an article over 1 year ago

Article

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors

Apr 24, 2024

•

liked 3 models about 2 years ago

Dina Suehiro Jones

AI & ML interests

Recent Activity

Organizations

dmsuehir's activity

Welcome Llama 4 Maverick & Scout on Hugging Face

Building Your Own AI Document Dream Team: A Generic Multi-Agent System

Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators

Occam’s Sheath: A Simpler Approach to AI Safety Guardrails

Model Card Generator Interface: Crafting Clear Insights into AI Models

Fine Tuning a LLM Using Kubernetes with Intel® Gaudi® Accelerator

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors