Chinese text humanization model series: SFT + DPO training pipeline, models and datasets included.
Isaac
XiangJinYu
AI & ML interests
Agent, LLM, RL
Recent Activity
updated a dataset 2 days ago
XiangJinYu/Qwen3.5-9B-Humanize-Dataset updated a model 2 days ago
XiangJinYu/Qwen3.5-9B-Humanize-DPO-Round2 updated a model 2 days ago
XiangJinYu/Qwen3.5-9B-Humanize-DPO-Round1Organizations
None yet