deeplearning-models
A collection of various deep learning architectures, models, and tips
A collection of Jupyter Notebooks implementing classic deep learning architectures, from basic perceptrons to CNNs like AlexNet and MobileNet, in both TensorFlow and PyTorch, with code and explanations side by side.
This repository is a collection of deep learning architectures, models, and tips packaged as Jupyter Notebooks. Jupyter Notebooks are interactive documents that mix code, charts, and explanations, which makes them well suited for learning how a model works step by step. The README presents implementations across two of the main deep learning frameworks: TensorFlow and PyTorch (with some additional examples using PyTorch Lightning), so the same idea is often shown in more than one style.
The notebooks are organised by topic. There is a section on traditional machine learning starters such as the Perceptron, Logistic Regression, and Softmax Regression, trained on simple datasets like 2D toy data, the Iris dataset, and MNIST (a classic dataset of handwritten digits). A multilayer perceptron section adds variations like dropout, batch normalization, and a from-scratch backpropagation implementation. The largest area covers convolutional neural networks, which are models specialised for image data: basic CNNs, weight-initialisation variants, and named architectures including AlexNet, DenseNet-121, an All-Convolutional Net, LeNet-5, MobileNet v2 and v3, and Network in Network, with examples trained on datasets such as MNIST, CIFAR-10, and QuickDraw.
Someone would use this when they want to study or reuse reference implementations of well-known deep learning models side by side, rather than reading papers or building from scratch. The tech stack stated in the README is Python notebooks running on TensorFlow, PyTorch, and PyTorch Lightning. The full README is longer than what was provided.
Where it fits
- Study how a specific neural network architecture such as LeNet-5 or DenseNet works by running a step-by-step notebook instead of reading the paper alone.
- Compare the same model implemented in TensorFlow versus PyTorch side by side to decide which framework to adopt.
- Reuse a reference implementation of a named architecture as a starting point for a new deep learning project or course assignment.