MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models
Abstract
MolHIT presents a hierarchical discrete diffusion model for molecular graph generation that achieves superior chemical validity and property-guided synthesis compared to existing 1D and graph-based approaches.
Molecular generation with diffusion models has emerged as a promising direction for AI-driven drug discovery and materials science. While graph diffusion models have been widely adopted due to the discrete nature of 2D molecular graphs, existing models suffer from low chemical validity and struggle to meet the desired properties compared to 1D modeling. In this work, we introduce MolHIT, a powerful molecular graph generation framework that overcomes long-standing performance limitations in existing methods. MolHIT is based on the Hierarchical Discrete Diffusion Model, which generalizes discrete diffusion to additional categories that encode chemical priors, and decoupled atom encoding that splits the atom types according to their chemical roles. Overall, MolHIT achieves new state-of-the-art performance on the MOSES dataset with near-perfect validity for the first time in graph diffusion, surpassing strong 1D baselines across multiple metrics. We further demonstrate strong performance in downstream tasks, including multi-property guided generation and scaffold extension.
Community
We introduce a hierarchical discrete diffusion framework for molecular graph generation that overcomes long-standing performance limitations in existing methods. We generalize discrete diffusion to additional categories that encode chemical priors and decouple atom encoding that splits the atom types according to their chemical roles.
MolHIT achieves new state-of-the-art performance on the MOSES dataset with near-perfect validity for the first time in graph diffusion, surpassing strong 1D baselines across multiple metrics. We further demonstrate strong performance in downstream tasks, including multi-property guided generation and scaffold extension.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation (2026)
- Breaking the Bottlenecks: Scalable Diffusion Models for 3D Molecular Generation (2026)
- Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design (2026)
- 3D Molecule Generation from Rigid Motifs via SE(3) Flows (2026)
- Variational Bayesian Flow Network for Graph Generation (2026)
- Improved Sampling Schedules for Discrete Diffusion Models (2026)
- SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper