Paella is an easy-to-use text-to-image model that can turn text into pictures. It was inspired by earlier models but has simpler code for training and sampling. During training, it "noises" images by randomly replacing visual elements with others from a library, and then tries to predict the original elements. During sampling, the model creates a distribution over each element and then selects one at random to build up the final image. Paella is designed to make text-to-image models more accessible to non-experts in the field by simplifying the technical components.
FoldFold allExpandExpand allAre you sure you want to delete this link?Are you sure you want to delete this tag?
The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community