COCO2017数据集,官方地址。
COCO2017包含训练集118287张,验证集5000张,80类。此外,该仓库还提供了一个训练子集(25504张)。
实例分割。
目录结构
COCO2017train
├── annotations
│ └── instances_train2017.json
└── train2017
COCO2017subtrain
├── annotations
│ └── instances_subtrain2017.json
└── subtrain2017
COCO2017val
├── annotations
│ └── instances_val2017.json
└── val2017COCO的实例分割标注字段如下,完整的细节可参考这里:
{
"images": [image],
"annotations": [annotation],
"categories": [category]
}
image = {
"id": int,
"width": int,
"height": int,
"file_name": str,
}
annotation = {
"id": int,
"image_id": int,
"category_id": int,
"segmentation": RLE or [polygon],
"area": float,
"bbox": [x,y,width,height],
"iscrowd": 0 or 1,
}
categories = [{
"id": int,
"name": str,
"supercategory": str,
}]本仓库目前提供了训练集、验证集和一个训练子集的下载。
from modelscope.msdatasets import MsDataset
from modelscope.utils.constant import DownloadMode
dataset_train = MsDataset.load('COCO2017_Instance_Segmentation', split='train',
download_mode=DownloadMode.FORCE_REDOWNLOAD)
dataset_subtrain = MsDataset.load('COCO2017_Instance_Segmentation', split='subtrain',
download_mode=DownloadMode.FORCE_REDOWNLOAD)
dataset_val = MsDataset.load('COCO2017_Instance_Segmentation', split='validation',
download_mode=DownloadMode.FORCE_REDOWNLOAD)
print(dataset_train.config_kwargs)
print(dataset_subtrain.config_kwargs)
print(dataset_val.config_kwargs)本数据集遵循Creative Commons Attribution 4.0 License,更多的版权、授权使用信息请参考这里。
@inproceedings{lin2014microsoft,
title={Microsoft coco: Common objects in context},
author={Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Hays, James and Perona, Pietro and Ramanan, Deva and Doll{\'a}r, Piotr and Zitnick, C Lawrence},
booktitle={European conference on computer vision},
pages={740--755},
year={2014},
organization={Springer}
}