Diffusers documentation
Reinforcement learning training with DDPO
Get started
Tutorials
OverviewUnderstanding pipelines, models and schedulersAutoPipelineTrain a diffusion modelInference with PEFT
Using Diffusers
Loading & Hub
OverviewLoad pipelines, models, and schedulersLoad and compare different schedulersLoad community pipelines and componentsLoad safetensorsLoad different Stable Diffusion formatsLoad adaptersPush files to the Hub
Tasks
Techniques
Textual inversionDistributed inference with multiple GPUsImprove image quality with deterministic generationControl image brightnessPrompt weightingImprove generation quality with FreeU
Specific pipeline examples
OverviewStable Diffusion XLSDXL TurboKandinskyControlNetShap-EDiffEditDistilled Stable Diffusion inferencePipeline callbacksCreate reproducible pipelinesCommunity pipelinesContribute a community pipelineLatent Consistency Model-LoRALatent Consistency ModelStable Video Diffusion
Training
OverviewCreate a dataset for trainingAdapt a model to a new task
Models
Unconditional image generationText-to-imageStable Diffusion XLKandinsky 2.2WuerstchenControlNetT2I-AdaptersInstructPix2Pix
Methods
Taking Diffusers Beyond Images
Optimization
Conceptual Guides
PhilosophyControlled generationHow to contribute?Diffusers' Ethical GuidelinesEvaluating Diffusion Models
API
Main Classes
Loaders
Models
OverviewUNet1DModelUNet2DModelUNet2DConditionModelUNet3DConditionModelUNetMotionModelVQModelAutoencoderKLAsymmetricAutoencoderKLTiny AutoEncoderConsistencyDecoderVAETransformer2DTransformer TemporalPrior TransformerControlNet
Pipelines
OverviewAltDiffusionAnimateDiffAttend-and-ExciteAudio DiffusionAudioLDMAudioLDM 2AutoPipelineBLIP-DiffusionConsistency ModelsControlNetControlNet with Stable Diffusion XLCycle DiffusionDance DiffusionDDIMDDPMDeepFloyd IFDiffEditDiTInstructPix2PixKandinsky 2.1Kandinsky 2.2Kandinsky 3Latent Consistency ModelsLatent DiffusionMultiDiffusionMusicLDMPaint by ExampleParallel Sampling of Diffusion ModelsPix2Pix ZeroPixArt-αPNDMRePaintScore SDE VESelf-Attention GuidanceSemantic GuidanceShap-ESpectrogram Diffusion
Stable Diffusion
OverviewText-to-imageImage-to-imageInpaintingDepth-to-imageImage variationSafe Stable DiffusionStable Diffusion 2Stable Diffusion XLSDXL TurboLatent upscalerSuper-resolutionLDM3D Text-to-(RGB, Depth), Text-to-(RGB-pano, Depth-pano), LDM3D UpscalerStable Diffusion T2I-AdapterGLIGEN (Grounded Language-to-Image Generation)
Stable unCLIPStochastic Karras VEText-to-image model editingText-to-videoText2Video-ZerounCLIPUnconditional Latent DiffusionUniDiffuserValue-guided samplingVersatile DiffusionVQ DiffusionWuerstchenSchedulers
OverviewCMStochasticIterativeSchedulerConsistencyDecoderSchedulerDDIMInverseSchedulerDDIMSchedulerDDPMSchedulerDEISMultistepSchedulerDPMSolverMultistepInverseDPMSolverMultistepSchedulerDPMSolverSDESchedulerDPMSolverSinglestepSchedulerEulerAncestralDiscreteSchedulerEulerDiscreteSchedulerHeunDiscreteSchedulerIPNDMSchedulerKarrasVeSchedulerKDPM2AncestralDiscreteSchedulerKDPM2DiscreteSchedulerLCMSchedulerLMSDiscreteSchedulerPNDMSchedulerRePaintSchedulerScoreSdeVeSchedulerScoreSdeVpSchedulerUniPCMultistepSchedulerVQDiffusionScheduler
Internal classes
You are viewing v0.24.0 version. A newer version v0.38.0 is available.
Reinforcement learning training with DDPO
You can fine-tune Stable Diffusion on a reward function via reinforcement learning with the 🤗 TRL library and 🤗 Diffusers. This is done with the Denoising Diffusion Policy Optimization (DDPO) algorithm introduced by Black et al. in Training Diffusion Models with Reinforcement Learning, which is implemented in 🤗 TRL with the DDPOTrainer.
For more information, check out the DDPOTrainer API reference and the Finetune Stable Diffusion Models with DDPO via TRL blog post.