SCL is a vision-language model pre-trained on COCO, VG, CC3M, SBU.
The code of SCL can be found at https://github.com/IIGROUP/SCL.
We have uploaded pre-trained model weights.
GLSCL-100k: pre-training with MLM, CL, ITM, MGSC, MLTC
MGSC-100k: pre-training with MLM, CL, ITM, MGSC