OpenFlamingo-9B Demohttps://7164d2142d11.ngrok.app/
OpenFlamingo is a new tool that helps computers learn how to understand pictures and words together.
The OpenFlamingo project aims to develop a multimodal system capable of processing and reasoning about images, videos, and text, with the ultimate goal of matching the power and versatility of GPT-4 in handling visual and text input. The project is creating an open-source version of DeepMind's Flamingo model, which is a LMM trained on large-scale web corpora containing interleaved text and images. The OpenFlamingo model implements the same architecture as Flamingo, but is trained on open-source datasets, with the released OpenFlamingo-9B checkpoint trained on 5M samples from the Multimodal C4 dataset and 10M samples from LAION-2B.