Paella: Simple & Efficient Text-To-Image generationhttps://github.com/dome272/Paella
Paella is an easy-to-use text-to-image model that can turn text into pictures. It was inspired by earlier models but has simpler code for training and sampling. During training, it "noises" images by randomly replacing visual elements with others from a library, and then tries to predict the original elements. During sampling, the model creates a distribution over each element and then selects one at random to build up the final image. Paella is designed to make text-to-image models more accessible to non-experts in the field by simplifying the technical components.