日期

4 年前

数据集组织

发布 URL

github.com

论文 URL

arxiv.org

许可证

Other

标签

图像描述

CC12M (Conceptual 12M) 是一个图像文本对的数据集，专门用于视觉和语言预训练。数据集包含 1200 万个图像文本对。与 CC3M 相比，对于 multiple downstream task 该数据集在长尾视觉识别方面表现更佳。

Citation

@inproceedings{changpinyo2021cc12m, title = {{Conceptual 12M}: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts}, author = {Changpinyo, Soravit and Sharma, Piyush and Ding, Nan and Soricut, Radu}, booktitle = {CVPR}, year = {2021}, }

此数据集由社区用户贡献,仅用于教育和信息目的。如有任何内容涉及版权侵权,请通过 [email protected] 联系我们,我们将及时审核并删除。