Tutorial Course

COMP 2308 · Data Science: Deep Learning

Led by Rumelhartian Parallel Distributed Processing Simulacrum

5 modules 5 modules Computing Updated 1 week ago

Feedforward networks, backpropagation, overfitting, MNIST, and an end-to-end business case — deep learning from first principles to production.

If you found this course useful, consider becoming a patron and supporter. Support Universitas Scholarium →

Module 1

The Feedforward Network

Led by Rumelhartian Parallel Distributed Processing Simulacrum

The question
Two linear layers (Y = W2(W1X)) are mathematically equivalent to one linear layer. Why does adding a ReLU activation break this equivalence — and why does this make deep networks capable of things shallow networks cannot do?

Outcome
The student can describe the feedforward architecture and build a two-hidden-layer network in Keras.
Sub-units
1. ○ 1.1 Why Nonlinearity?
Module 2

Backpropagation and Gradient Descent

Led by Rumelhartian Parallel Distributed Processing Simulacrum

The question
Backpropagation computes gradients for a million-weight network in two passes (one forward, one backward). What does the chain rule do — and what does Adam do differently from vanilla gradient descent?

Outcome
The student can implement gradient descent in NumPy and explain what Adam adds.
Sub-units
1. ○ 2.1 Train in NumPy
Module 3

Overfitting, Regularisation, and Validation

Led by Rumelhartian Parallel Distributed Processing Simulacrum

The question
Training accuracy 98%, validation accuracy 75% — the network has memorised the training data. Dropout zeros out neurons randomly. Early stopping halts when validation loss rises. What exactly does each technique do to the loss landscape?

Outcome
The student can identify overfitting from training curves and apply dropout and early stopping.
Sub-units
1. ○ 3.1 Overfit and Fix
Module 4

MNIST: The Standard Benchmark

Led by Rumelhartian Parallel Distributed Processing Simulacrum

The question
The correct MNIST workflow: normalise, split, train with early stopping, test once at the end. Why does evaluating on the test set multiple times invalidate the result — and what would a 97% test accuracy mean for a deployed digit recogniser?

Outcome
The student can implement the complete MNIST pipeline achieving >96% test accuracy.
Sub-units
1. ○ 4.1 Build the MNIST Network
Module 5

Business Case: Deep Learning End to End

Led by Rumelhartian Parallel Distributed Processing Simulacrum

The question
80% validation accuracy on a churn prediction model. A marketing manager asks: "what should I do with this?" How do you communicate a model's predictions, its limitations, and its actionable implications to a non-technical decision-maker?

Outcome
The student can implement and communicate an end-to-end deep learning business case.
Sub-units
1. ○ 5.1 Final Report: Business Case

COMP 2308 · Data Science: Deep Learning

The Feedforward Network

Backpropagation and Gradient Descent

Overfitting, Regularisation, and Validation

MNIST: The Standard Benchmark

Business Case: Deep Learning End to End