gitmyhub

fashion-mnist

Python ★ 13k updated 4y ago

A MNIST-like fashion product database. Benchmark :point_down:

A dataset of 70,000 clothing images in 10 categories designed as a harder drop-in replacement for the classic MNIST handwritten digit dataset. Loads directly in PyTorch, TensorFlow, and Keras with no manual download needed.

PythonPyTorchTensorFlowKerasNumPysetup: easycomplexity 2/5

Fashion-MNIST is a dataset created by the research team at Zalando, the European fashion retailer. It contains 70,000 small grayscale images of clothing items, split into 60,000 for training machine learning models and 10,000 for testing them. Each image is 28 by 28 pixels and belongs to one of ten categories: T-shirt or top, trouser, pullover, dress, coat, sandal, shirt, sneaker, bag, or ankle boot.

The dataset was built as a direct replacement for a much older and widely used dataset called MNIST, which contains handwritten digit images in the exact same format. The Zalando team's argument is that MNIST has become too easy: modern machine learning methods can classify handwritten digits with near-perfect accuracy, which makes it a poor test for comparing different approaches. They argue that clothing images are harder to distinguish, making Fashion-MNIST a more meaningful benchmark while keeping the same file format and size so that existing code written for MNIST can load it without any changes.

Because it mirrors the MNIST format exactly, Fashion-MNIST is built into many popular machine learning libraries already. You can load it directly in PyTorch, TensorFlow, Keras, Hugging Face Datasets, and several others without downloading anything manually. If you prefer to download the raw files yourself, the README links to them directly.

The repository also includes scripts for loading the data with NumPy, visualizing the images, and running benchmark comparisons. The README lists results from many different approaches that have been tested on this dataset, providing a reference table researchers can use to see how their own methods compare.

This is a dataset and benchmarking resource, not an application. Its audience is machine learning researchers and practitioners who want a standard test case that is more demanding than handwritten digits but still simple enough to iterate on quickly.

Where it fits