Skip to content

DEV Community

Super Kai (Kazuya Ito)

Posted on Nov 21 • Edited on Dec 18

Datasets for Computer Vision (2)

#python #pytorch #dataset #computervision

Buy Me a Coffee☕

*Memos:

My post explains MNIST, EMNIST, QMNIST, ETLCDB, Kuzushiji and Moving MNIST.
My post explains Oxford-IIIT Pet, Oxford 102 Flower, Stanford Cars, Places365, Flickr8k and Flickr30k.
My post explains ImageNet, LSUN and MS COCO.
My post explains PASCAL VOC, SUN Database, Kinetics Dataset and Cityscapes.
My post explains Image Classification(Recognition), Object Localization, Object Detection and Image Segmentation.
My post explains Keypoint Detection(Landmark Detection), Image Matching, Object Tracking, Stereo Matching, Video Prediction, Optical Flow, Image Captioning.

(1) Fashion-MNIST(2017):

has the 70,000 fashion images each connected to the label from 10 classes: *Memos:
- 60,000 for train and 10,000 for test.
- Each image has 28x28 pixels.
is FashionMNIST() in PyTorch. *My post explains FashionMNIST().
is used for Image Classification.

(2) Caltech 101(2003):

has the 8,677 object images each connected to the label from 101 categories(classes). *Each image has roughly 300x200 pixels.
is used for Image Classification.
is Caltech101() in PyTorch. *My post explains Caltech101().

(3) Caltech 256(2007):

has the 30,607 object images connected to the label from 257 categories(classes). *Actually, it has 257 categories(classes) against the name Caltech 256.
is used for Image Classification.
is Caltech256() in PyTorch. *My post explains Caltech256().

(4) CelebA(Large-scale CelebFaces Attributes)(2015):

has the 202,599 celebrity face images each connected to the label from 8192 unique identities(classes) and each connected to 40 attributes: *Memos:
- 162,770 for train, 19,867 for validation and 19,962 for test.
- Each image has 5 landmarks.
- Directly downloading it from Google Drive is recommended because downloading it with Google Drive API from Google Drive is too crowded.
is used for Keypoint Detection and Fine-Grained Image Classification.
is CelebA() in PyTorch. *My post explains CelebA().

(5) CIFAR-10(Canadian Institute For Advanced Research-10)(2009):

has the 60,000 vehicle and animal images each connected to the label from 10 classes: *Memos:
- 50,000 for train and 10,000 for test.
- Each image has 32x32 pixels.
is used for Image Classification.
is CIFAR10() in PyTorch. *My post explains CIFAR10().

(6) CIFAR-100(Canadian Institute For Advanced Research-100)(2009):

has the 60,000 object images each connected to the label from 100 classes: *Memos:
- 50,000 for train and 10,000 for test.
- Each image has 32x32 pixels.
is used for Image Classification.
is CIFAR100() in PyTorch. *My post explains CIFAR100().

Top comments (0)

Subscribe

Read next

Best open source Image to Video CogVideoX1.5-5B-I2V is pretty decent and optimized for low VRAM

Furkan Gözükara - Dec 25

Merge PDFs Recursively Using Python

abbazs - Dec 24

Python Tutorial - 3 Data Structure

nadirbasalamah - Dec 23

Advent of Code 2024 - Day 21: Keypad Conundrum

Grant Riordan - Dec 22