Thu Nov 03 2022
Wed Nov 02 2022

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Generative models
Text-to-image synthesis
Ensemble learning
Generating product images
Generating visuals for marketing campaigns
Designing visuals in different domains

Proposes an ensemble of text-to-image diffusion models specialized for different synthesis stages to improve text alignment while maintaining high visual quality and the same inference computation cost. Shows improved results compared to previous large-scale text-to-image diffusion models on standard benchmarks. Enables 'paint-with-words' capability to allow users to select words in input text and paint them in a canvas to control the output.

Can improve text alignment and visual quality of text-to-image synthesis in business applications, such as generating product images or visuals for marketing campaigns. 'Paint-with-words' capability can be useful for designers and marketing professionals to create desired visuals.

Tue Nov 01 2022
Mon Oct 31 2022
Sun Oct 30 2022
Thu Oct 27 2022