MNIST

MNIST:

This dataset contains grayscale images for handwritten digits in which half of the training set and half of the test set are collected among Census Bureau employees and the second half of each training and test sets are collected among high school students. The dataset is a subset of images from two parent datasets NIST’s Special Database 3 and Special Database 1.

Here is some information regarding the MNIST dataset:

  • Number of images in the dataset: 70,000 (60,000 images for the training set: 30,000 from NIST’s Special Database 3 and 30,000 from NIST’s Special Database 1. 10,000 images for the test set: 5000 from Special Database 3 and 5000 from Special Database 1)

  • Image size: 28×28

  • Number of classes: 10 (0, 1, 2, 3, 4, 5, 6, 7, 8, 9)

4 data files are available for download from http://yann.lecun.com/exdb/mnist/ which contain training set images, training set labels, test set images and test set labels. Please note that the images in this dataset do not have the image format and the user is supposed to write a short code to read them. The details about the file format is available on the mentioned address.

keywords: Vision, Image, Classification, Handwritten Digits