Your ICLR Recommendation list - Your ICLR Recommendation list

There are 5000 papers for you in ICLR 2025

2Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion Distillation¶

Abstract Accelerating the sampling speed of diffusion models remains a significant challenge. Recent score distillation methods distill a heavy teacher model into a student generator to achieve one-step generation, which is optimized by calculating the difference between two score functions on the samples generated by the student model. However, there is a score mismatch issue in the early stage of the score distillation process, since existing methods mainly focus on using the endpoint of pre-trained diffusion models as teacher models, overlooking the importance of the convergence trajectory between the student generator and the teacher model. To address this issue, we extend the score distillation process by introducing the entire convergence trajectory of the teacher model and propose $\textbf{Dis}$ tribution $\textbf{Back}$ tracking Distillation ( $\textbf{DisBack}$ ). DisBask is composed of two stages: $\textit{Degradation Recording}$ and $\textit{Distribution Backtracking}$ . $\textit{Degradation Recording}$ is designed to obtain the convergence trajectory by recording the degradation path from the pre-trained teacher model to the untrained student generator. The degradation path implicitly represents the intermediate distributions between the teacher and the student, and its reverse can be viewed as the convergence trajectory from the student generator to the teacher model. Then $\textit{Distribution Backtracking}$ trains the student generator to backtrack the intermediate distributions along the path to approximate the convergence trajectory of the teacher model. Extensive experiments show that DisBack achieves faster and better convergence than the existing distillation method and achieves comparable or better generation performance, with an FID score of 1.38 on the ImageNet 64 $\times$ 64 dataset. DisBack is easy to implement and can be generalized to existing distillation methods to boost performance.

1Your ICLR Recommendation list

2Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion Distillation¶

3Diffusion Transformer Policy¶

4Diffusion Policy Policy Optimization¶

5Diffusion Models for 4D Novel View Synthesis¶

6The Deficit of New Information in Diffusion Models: A Focus on Diverse Samples¶

7What Makes a Good Diffusion Planner for Decision Making?¶

8Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models¶

9Discovery and Expansion of New Domains within Diffusion Models¶

10Iterative DPO with An Improvement Model for Fine-tuning Diffusion Models¶

11Adding Conditional Control to Diffusion Models with Reinforcement Learning¶

12Diffusion Models Meet Contextual Bandits¶

13Influence-Guided Diffusion for Dataset Distillation¶

14Inverse Engineering Diffusion: Deriving Variance Schedules with Rationale¶

15Progressive distillation induces an implicit curriculum¶

16One Step Diffusion via Shortcut Models¶

17Unveiling Concept Attribution in Diffusion Models¶

18Navigating Concept Drift and Temporal Shift: Distribution Shift Generalized Time-Series Forecasting¶

19Can Diffusion Models Disentangle? A Theoretical Perspective¶

20Fast Multi-Mode Adaptive Generative Distillation for Continually Learning Diffusion Models¶

21A Tailored Framework for Aligning Diffusion Models with Human Preference¶

22Optimal Targets for Concept Erasure in Diffusion Models and Where To Find Them¶

23Protecting Minorities in Diffusion Models via Capacity Allocation¶

24DC-DPM: A Divide-and-Conquer Approach for Diffusion Reverse Process¶

25Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts¶

26Latent Weight Diffusion: Generating policies from trajectories¶

27Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning¶

28How do diffusion models learn and generalize on abstract rules for reasoning?¶

29O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions¶

30Diffusion Models are Evolutionary Algorithms¶

31How and how well do diffusion models improve adversarial robustness?¶

32Distributionally Robust Policy Learning under Concept Drifts¶

33Representative Guidance: Diffusion Model Sampling with Consistency¶

34Diffusion Transportation Cost for Domain Adaptation¶

35Improved Convergence Rate for Diffusion Probabilistic Models¶

36Exploration by Running Away from the Past¶

37Broadening Target Distributions for Accelerated Diffusion Models via a Novel Analysis Approach¶

38Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models¶

39How to Find the Exact Pareto Front for Multi-Objective MDPs?¶

40APCtrl: Adding Conditional Control to Diffusion Models by Alternative Projection¶

41Choose Your Anchor Wisely: Effective Unlearning Diffusion Models via Concept Reconditioning¶

42Enhancing Dataset Distillation with Concurrent Learning: Addressing Negative Correlations and Catastrophic Forgetting in Trajectory Matching¶

43Discrete Distribution Networks¶

44Anti-Exposure Bias in Diffusion Models via Prompt Learning¶

45Direct Distributional Optimization for Provable Alignment of Diffusion Models¶

46Stabilizing the Kumaraswamy Distribution¶

47Dynamic Negative Guidance of Diffusion Models¶

48One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation¶

49Dynamic Diffusion Transformer¶

50Data Unlearning in Diffusion Models¶

51Towards a Theoretical Understanding of Memorization in Diffusion Models¶

52RETHINK MAXIMUM STATE ENTROPY¶

53Conditional Information Bottleneck Approach for Out-of-Distribution Sequential Recommendation¶

54Backtracking Improves Generation Safety¶

55Optimizing Latent Goal by Learning from Trajectory Preference¶

56Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model¶

57Distilled Diffusion Language Models¶

58Revamping Diffusion Guidance for Conditional and Unconditional Generation¶

59Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner¶

60Can the Training Loss be Predictive for Out-of-Distribution Generalization?¶

61Balancing Domain-Invariant and Domain-Specific Knowledge for Domain Generalization with Online Knowledge Distillation¶

62Heavy-Tailed Diffusion Models¶

63Variational Search Distributions¶

64Diffusion Modulation via Environment Mechanism Modeling for Planning¶

65Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis¶

66Energy-Based Conceptual Diffusion Model¶

67Diversity-Rewarded CFG Distillation¶

68EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing¶

69Diffusion-Based Planning for Autonomous Driving with Flexible Guidance¶

70Satisficing Exploration in Bandit Optimization¶

71Longitudinal Latent Diffusion Models¶

72Understanding and Mitigating Memorization in Diffusion Models for Tabular Data¶

73Principle Counterfactual Fairness¶

74Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers¶

75Improving Discrete Diffusion with Schedule-Conditioning¶

76Preference Diffusion for Recommendation¶

77Mitigating Shortcut Learning with Diffusion Counterfactuals and Diverse Ensembles¶

78HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?¶

79Zigzag Diffusion Sampling: The Path to Success ls Zigzag¶

80RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning¶