Both Sora and Stable Diffusion 3 adopt diffusion transformers, but do we really need a super large DiT for all sampling steps for generation?
No. We found ~40% early timesteps of DiT-XL can be replaced with a 10x faster DiT-S without image quality drop!
Introduce Trajectory Stitching (T-Stitch), a training-free method that complements existing efficient sampling methods by dynamically allocating computation to different denoising steps.
Paper: Code:
GitHub - NVlabs/T-Stitch: Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"
Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching" - NVlabs/T-Stitch
github.com
T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching
T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching
t-stitch.github.io