Youmi Ma

maym15

AI & ML interests

None yet

Recent Activity

published a model 16 days ago

maym15/Olmo-3-7B-Think-RetMask

published a model 16 days ago

maym15/Olmo-3-7B-Instruct-RetMask

published a model 16 days ago

maym15/Qwen3-8B-RetMask

View all activity

Organizations

published 4 models 16 days ago

updated a model 18 days ago

maym15/Olmo-3-7B-Think-RetMask

Text Generation • 7B • Updated 18 days ago • 13

updated a collection 18 days ago

RetMask

Collection

Trained checkpoints for the paper "From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models" • 4 items • Updated 18 days ago

updated a model 18 days ago

maym15/Olmo-3-7B-Instruct-RetMask

Text Generation • 7B • Updated 18 days ago • 12

updated 2 models 19 days ago

maym15/Llama-3.1-8B-Instruct-RetMask

Text Generation • 8B • Updated 19 days ago • 15

maym15/Qwen3-8B-RetMask

Text Generation • 8B • Updated 19 days ago • 17

published a model 11 months ago

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5

Text Generation • 8B • Updated Jun 25, 2025 • 2.33k • • 19

updated 2 models 11 months ago

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5

Text Generation • 8B • Updated Jun 25, 2025 • 2.33k • • 19

tokyotech-llm/Llama-3.1-Swallow-8B-v0.5

8B • Updated Jul 1, 2025 • 420 • 9

updated a Space 11 months ago

README

🌍

updated 4 models about 1 year ago

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Text Generation • 71B • Updated Jul 1, 2025 • 224 • • 13

tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3

Text Generation • 71B • Updated Apr 2, 2025 • 378 • • 13

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3

Text Generation • 8B • Updated Apr 2, 2025 • 2.98k • • 24

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2

Text Generation • 8B • Updated Apr 2, 2025 • 126 • • 16

Youmi Ma

AI & ML interests

Recent Activity

Organizations

maym15's activity

README