Stable Diffusion Models
We recommend you use Stable Diffusion with Hugging Face Diffusers library. You can also use the original CompVis code.
1. CompVis Stable Diffusion
CompVis Stable Diffusion
https://github.com/compvis/stable-diffusion
Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input.
- CompVis/stable-diffusion-v-1-1-original
https://huggingface.co/CompVis/stable-diffusion-v-1-1-original
- CompVis/stable-diffusion-v-1-2-original
https://huggingface.co/CompVis/stable-diffusion-v-1-2-original
- CompVis/stable-diffusion-v-1-3-original
https://huggingface.co/CompVis/stable-diffusion-v-1-3-original
- CompVis/stable-diffusion-v-1-4-original
https://huggingface.co/CompVis/stable-diffusion-v-1-4-original
2. Hugging Face Diffusers
Hugging Face Diffusers
https://github.com/huggingface/diffusers
Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input.
- CompVis/stable-diffusion-v1-1
https://huggingface.co/CompVis/stable-diffusion-v1-1
- CompVis/stable-diffusion-v1-2
https://huggingface.co/CompVis/stable-diffusion-v1-2
- CompVis/stable-diffusion-v1-3
https://huggingface.co/CompVis/stable-diffusion-v1-3
- CompVis/stable-diffusion-v1-4
https://huggingface.co/CompVis/stable-diffusion-v1-4
3. Contrastive Language-Image Pre-Training (CLIP)
- openai/clip-vit-large-patch14
https://huggingface.co/openai/clip-vit-large-patch14
- CLIP
https://github.com/openai/CLIP
CLIP (Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs.
References
[1] Yongqiang Cheng, https://yongqiang.blog.csdn.net/