Contrastive Distribution Matching for Amortized Sequential Monte Carlo in Discrete Diffusion
Abstract
Contrastive Distribution Matching addresses efficient sampling from reward-tilted distributions in discrete diffusion models through learned twist functions that reduce computational overhead while maintaining accuracy across diverse applications.
Discrete diffusion models have emerged as powerful frameworks for generating structured categorical data. However, efficiently sampling from reward-tilted distributions remains a fundamental challenge. While Twisted Sequential Monte Carlo (SMC) offers asymptotic exactness for this task, estimating the optimal twist function in discrete state spaces necessitates costly Monte Carlo approximations, resulting a severe computational bottleneck at inference. To overcome this limitation, we introduce Contrastive Distribution Matching (CDM), a novel framework that amortizes the cost of SMC inference by learning a parameterized twist function via positive and negative samples. For efficient training, we reformulate the gradient estimator to leverage the closed-form forward kernels of discrete diffusion models. In practice, evaluating our learned twist function incurs less than 5% additional computational overhead compared to a single forward pass of the base model. Through extensive empirical evaluations, we demonstrate that CDM consistently outperforms existing baselines under matched wall-clock time. We validate the effectiveness and versatility of our approach across a diverse range of applications, including toxic text generation, regulatory DNA sequence design, protein designability, and diffusion large language model alignment.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Inference-Time Alignment of Diffusion Models via Trust-Region Iterative Twisted Sequential Monte Carlo (2026)
- Efficient Adjoint Matching for Fine-tuning Diffusion Models (2026)
- Reinforcing Few-step Generators via Reward-Tilted Distribution Matching (2026)
- VASR: Variance-Aware Systematic Resampling for Reward-Guided Diffusion (2026)
- $S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models (2026)
- Hierarchical Variational Policies for Reward-Guided Diffusion (2026)
- Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2605.23346 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
