Sampler: takes the main Stable Diffusion model as an input, takes both positive and negative prompts encoded by CLIP model + a latent image (can be blank)
sampler takes this input latent image, adds noise to it and then denoises it using the main model
prompts and negative prompts are passed to model at each sampling step
sampler outputs the denoised image
VAE: translates an image from latent space to pixel space
Prompting: (word:1.5) means it is 1.5 more effective