OpenMOSS

Team

university

http://openmoss.sii.edu.cn/

OpenMOSS

Activity Feed Request to join this org

AI & ML interests

LLM

Recent Activity

freesky submitted a paper 2 days ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

artpli authored a paper 2 days ago

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

Qiancccc updated a dataset 3 days ago

OpenMOSS-Team/FutureOmni

View all activity

Papers

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

View all Papers

OpenMOSS-Team 's collections 13

FutureOmni

First Omni-modal Future Forecasting Benchmark

OpenMOSS-Team/FutureOmni

Viewer • Updated 3 days ago • 1.03k • 275 • 2
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published 5 days ago • 34

MOSS Transcribe Diarize

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 20 days ago • 55
Running

Featured

48

MOSS Transcribe Diarize

🏢

48

Transcribe audio/video files with speaker identification

DiRL

An Efficient Training Framework for Diffusion Language Models

OpenMOSS-Team/DiRL-8B-Instruct

Text Generation • 8B • Updated 5 days ago • 57 • 11

MOSS-Speech

True Speech-to-Speech Langugage Model

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 106 • 16
OpenMOSS-Team/MOSS-Speech-Codec

0.9B • Updated Oct 1, 2025 • 98 • 4
Running on Zero

15

MOSS-Speech Demo

🚀

15

True Speech-to-Speech Language Model
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 19

MOSS Embodied Planner

OpenMOSS-Team/Embodied_R1-ScienceWorld

8B • Updated Jun 30, 2025 • 1
OpenMOSS-Team/Embodied_Planner-R1-Alfworld

8B • Updated Jun 30, 2025 • 5
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Paper • 2506.23127 • Published Jun 29, 2025 • 1
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

MHA2MLA-refactor

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor

Text Generation • 0.1B • Updated Jun 23, 2025 • 6
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 25
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 9
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor

Text Generation • 0.3B • Updated Jun 17, 2025 • 21

MOSS

OpenMOSS-Team/moss-moon-003-sft-plugin

Text Generation • Updated Apr 25, 2023 • 13 • 69
OpenMOSS-Team/moss-moon-003-sft

Text Generation • Updated Apr 25, 2023 • 19 • 127
OpenMOSS-Team/moss-moon-003-base

Text Generation • Updated Apr 25, 2023 • 140 • 131
OpenMOSS-Team/moss-moon-003-sft-int4

Text Generation • Updated Apr 26, 2023 • 23 • 40

Game-RL

Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Code2Logic/GameQA-140K

Viewer • Updated Oct 6, 2025 • 300 • 202 • 14
OpenMOSS-Team/GameQA-5K

Preview • Updated Jun 22, 2025 • 14 • 1
OpenMOSS-Team/Game-RL-Qwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated Jul 27, 2025 • 8
OpenMOSS-Team/Game-RL-InternVL3-8B

8B • Updated Jun 17, 2025 • 19 • 1

FRoM-W1

https://github.com/OpenMOSS/FRoM-W1

OpenMOSS-Team/FRoM-W1

Updated 4 days ago • 6
OpenMOSS-Team/FRoM-W1-Datasets

Viewer • Updated 4 days ago • 166k • 182 • 4
FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

Paper • 2601.12799 • Published 6 days ago • 2

RoboOmni

Proactive Robot Manipulation in Omni-modal Context

OpenMOSS-Team/RoboOmni

Robotics • Updated Oct 30, 2025 • 66 • 5
OpenMOSS-Team/RoboOmni-LIBERO-Spatial

Robotics • Updated Oct 31, 2025 • 10 • 1
OpenMOSS-Team/RoboOmni-LIBERO-Goal

Updated Oct 29, 2025 • 2
OpenMOSS-Team/RoboOmni-LIBERO-Object

Updated Oct 29, 2025 • 8

MOSS-TTSD

OpenMOSS-Team/MOSS-TTSD-v0.5

Text-to-Speech • 2B • Updated Sep 2, 2025 • 203 • 52
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 7 • 27
Running on Zero

39

MOSS TTSD

📉

39

MOSS-TTSD: Text to Spoken Dialogue Generation
OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 724 • 15

Low Rank Sparse Attention

Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".

OpenMOSS-Team/Lorsa

Updated Apr 28, 2025 • 2
OpenMOSS-Team/Lorsa-Pythia-160M

Updated May 8, 2025 • 1
OpenMOSS-Team/Lorsa-Llama-3.1-8B

Updated May 8, 2025

MHA2MLA

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 3
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16

Text Generation • 6B • Updated Mar 13, 2025 • 4
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32

Text Generation • 6B • Updated Mar 13, 2025 • 3
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64

Text Generation • 7B • Updated Mar 13, 2025 • 1

FutureOmni

First Omni-modal Future Forecasting Benchmark

OpenMOSS-Team/FutureOmni

Viewer • Updated 3 days ago • 1.03k • 275 • 2
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published 5 days ago • 34

Game-RL

Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Code2Logic/GameQA-140K

Viewer • Updated Oct 6, 2025 • 300 • 202 • 14
OpenMOSS-Team/GameQA-5K

Preview • Updated Jun 22, 2025 • 14 • 1
OpenMOSS-Team/Game-RL-Qwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated Jul 27, 2025 • 8
OpenMOSS-Team/Game-RL-InternVL3-8B

8B • Updated Jun 17, 2025 • 19 • 1

MOSS Transcribe Diarize

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 20 days ago • 55
Running

Featured

48

MOSS Transcribe Diarize

🏢

48

Transcribe audio/video files with speaker identification

FRoM-W1

https://github.com/OpenMOSS/FRoM-W1

OpenMOSS-Team/FRoM-W1

Updated 4 days ago • 6
OpenMOSS-Team/FRoM-W1-Datasets

Viewer • Updated 4 days ago • 166k • 182 • 4
FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

Paper • 2601.12799 • Published 6 days ago • 2

DiRL

An Efficient Training Framework for Diffusion Language Models

OpenMOSS-Team/DiRL-8B-Instruct

Text Generation • 8B • Updated 5 days ago • 57 • 11

RoboOmni

Proactive Robot Manipulation in Omni-modal Context

OpenMOSS-Team/RoboOmni

Robotics • Updated Oct 30, 2025 • 66 • 5
OpenMOSS-Team/RoboOmni-LIBERO-Spatial

Robotics • Updated Oct 31, 2025 • 10 • 1
OpenMOSS-Team/RoboOmni-LIBERO-Goal

Updated Oct 29, 2025 • 2
OpenMOSS-Team/RoboOmni-LIBERO-Object

Updated Oct 29, 2025 • 8

MOSS-Speech

True Speech-to-Speech Langugage Model

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 106 • 16
OpenMOSS-Team/MOSS-Speech-Codec

0.9B • Updated Oct 1, 2025 • 98 • 4
Running on Zero

15

MOSS-Speech Demo

🚀

15

True Speech-to-Speech Language Model
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 19

MOSS-TTSD

OpenMOSS-Team/MOSS-TTSD-v0.5

Text-to-Speech • 2B • Updated Sep 2, 2025 • 203 • 52
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 7 • 27
Running on Zero

39

MOSS TTSD

📉

39

MOSS-TTSD: Text to Spoken Dialogue Generation
OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 724 • 15

MOSS Embodied Planner

OpenMOSS-Team/Embodied_R1-ScienceWorld

8B • Updated Jun 30, 2025 • 1
OpenMOSS-Team/Embodied_Planner-R1-Alfworld

8B • Updated Jun 30, 2025 • 5
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Paper • 2506.23127 • Published Jun 29, 2025 • 1
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

Low Rank Sparse Attention

Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".

OpenMOSS-Team/Lorsa

Updated Apr 28, 2025 • 2
OpenMOSS-Team/Lorsa-Pythia-160M

Updated May 8, 2025 • 1
OpenMOSS-Team/Lorsa-Llama-3.1-8B

Updated May 8, 2025

MHA2MLA-refactor

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor

Text Generation • 0.1B • Updated Jun 23, 2025 • 6
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 25
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 9
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor

Text Generation • 0.3B • Updated Jun 17, 2025 • 21

MHA2MLA

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 3
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16

Text Generation • 6B • Updated Mar 13, 2025 • 4
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32

Text Generation • 6B • Updated Mar 13, 2025 • 3
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64

Text Generation • 7B • Updated Mar 13, 2025 • 1

MOSS

OpenMOSS-Team/moss-moon-003-sft-plugin

Text Generation • Updated Apr 25, 2023 • 13 • 69
OpenMOSS-Team/moss-moon-003-sft

Text Generation • Updated Apr 25, 2023 • 19 • 127
OpenMOSS-Team/moss-moon-003-base

Text Generation • Updated Apr 25, 2023 • 140 • 131
OpenMOSS-Team/moss-moon-003-sft-int4

Text Generation • Updated Apr 26, 2023 • 23 • 40

AI & ML interests

Recent Activity

Papers

Team members 17

OpenMOSS-Team 's collections 13

MOSS Transcribe Diarize

MOSS-Speech Demo

MOSS TTSD

MOSS Transcribe Diarize

MOSS-Speech Demo

MOSS TTSD