Part 1: Fundamentals

Module 01

What is Artificial Intelligence?

01 · Foundations

Defining AI

Artificial Intelligence (AI) is the ability of a computer system to perform tasks that, when done by humans, would require intelligence — things like recognising speech, understanding language, making decisions, or spotting patterns in data.

The word "artificial" simply means it's built by humans. The word "intelligence" is where it gets interesting. Researchers disagree on exactly what intelligence means, but for practical purposes, AI is any system that can perceive its environment, reason about it, and act in ways that achieve some goal.

Key distinction: AI does not need to "think" the way humans do. It just needs to produce useful outputs — a good translation, a correct diagnosis, a recommended song.

The term was coined in 1956 by computer scientist John McCarthy at the Dartmouth Conference, but the ideas behind it go back centuries — to philosophers asking what it means to think, and to mathematicians like Alan Turing who asked whether machines could ever be intelligent.

01 · Foundations

A Simple Analogy

Think of teaching a child to recognise a dog. You don't write a rule book ("four legs, fur, barks"). You show hundreds of examples: "this is a dog, this is a cat, this is a dog…" The child learns the pattern on their own.

Modern AI works almost the same way. Instead of being hand-programmed with rules, it's shown millions of examples and finds the patterns itself. That's the core idea behind machine learning (ML) — the dominant form of AI today.

Figure 1: Traditional programming requires humans to write the rules explicitly. Machine learning discovers the rules automatically from data.

Module 02

How Machines Learn

02 · Core Concept

The Learning Loop

Machine learning is the process by which AI systems improve through experience. Instead of following hand-written rules, an ML model finds patterns in data. Here's how the cycle works:

Figure 2: A typical supervised machine learning pipeline — from raw data collection through to a deployed model. The "Label Data" step is specific to supervised learning; unsupervised approaches skip it.

The key ingredient is data. The more good-quality examples a model sees, the better it learns. This is why tech companies invest so heavily in collecting data — it's the raw material of AI.

Three main types of learning:

Supervised learning — The model learns from labelled examples (input + correct answer). Most common. Used for image recognition, spam filtering, disease diagnosis.

Unsupervised learning — The model finds hidden patterns in unlabelled data. Used for customer segmentation, anomaly detection.

Reinforcement learning — The model learns by trial and error, receiving rewards for good actions. Used for game-playing AI (like AlphaGo) and robotics.

Figure 3: Supervised learning uses labelled data; unsupervised learning discovers structure on its own.

02 · Supervised ML

Supervised Learning — A Closer Look

Supervised learning is by far the most widely used form of ML today. The word "supervised" refers to the fact that a human has provided the correct answers — the labels — for every training example. The algorithm's job is to learn the mapping from inputs to those labels well enough to predict labels on new, unseen inputs.

Think of it like studying with an answer key. You practise on thousands of exam questions where you already know the correct answer. After enough practice you can answer new questions you've never seen before.

The core assumption of supervised learning: The patterns that exist in the labelled training data also exist in the real world. If your training data is unrepresentative, biased, or out of date, the model will inherit those flaws — no matter how sophisticated the algorithm.

Labels, Features & Examples — The Core Vocabulary

Every supervised learning problem is built from the same three building blocks. Understanding these terms precisely will make every other concept in this course easier to follow:

Label (y)

The thing you're trying to predict. For spam detection it's spam / not spam. For house prices it's the sale price. For weather forecasting it's tomorrow's temperature. Every training row has a known label; in production the model must predict it.

Features (x₁, x₂ … xₙ)

The input variables the model uses to make its prediction. For spam: word count, sender address, links present. For house prices: square footage, bedrooms, neighbourhood. The model learns which features matter most and by how much.

Example

A single row of data — one instance with its features and (during training) its label. Labelled examples train the model. Unlabelled examples are what the deployed model must predict on in the real world.

Table 1: A supervised learning dataset. Each row is an example. Green columns are features; the gold column is the label. The last row is unlabelled — what the trained model must predict.

Framing questions to ask before building any ML model:
• What exactly is the label — and can I collect it reliably?
• What features are available at prediction time (not just at training time)?
• Is this regression (predicting a number) or classification (predicting a category)?
• How much labelled data do I have — is it enough?

Regression vs Classification

The two most common supervised learning task types — differing only in what kind of label you're predicting:

REGRESSION

Predict a continuous number

Examples: house price, tomorrow's temperature, expected exam score. Output is a real number on a continuous scale. Evaluated with metrics like MSE or MAE (Mean Absolute Error).

CLASSIFICATION

Predict a category

Examples: spam/not spam, cat/dog/bird, disease present/absent. Output is one of a fixed set of classes. Binary = 2 classes; multi-class = 3+. Evaluated with accuracy, precision, recall.

How the Supervision Signal Works

During training, the model makes a prediction on each labelled example. It then compares its prediction to the true label and receives a supervision signal — a measure of how wrong it was. This signal is used to adjust the model's internal parameters so the next prediction is a little better. Repeat this millions of times and the model gradually improves.

Figure 4: The supervised learning feedback loop. The model predicts, the loss measures error against the true label, and the weights are updated. This cycle repeats until loss is minimised.

Real-World Supervised Learning Examples

Spam Detection

Label: spam / not spam
Features: words in subject, sender, links present
Training data: millions of emails manually tagged by users clicking "Mark as spam"

Medical Image Diagnosis

Label: tumour present / absent
Features: pixel intensities in an X-ray or MRI
Training data: scans labelled by radiologists over years of clinical practice

Machine Translation

Label: correct translation in target language
Features: words and context in source language
Training data: millions of human-translated document pairs

Music Recommendation

Label: did the user play / skip this song?
Features: listening history, song tempo, genre, time of day
Training data: billions of play/skip events from real users

Speech Recognition

Label: the correct text transcript
Features: audio waveform frequencies over time
Training data: thousands of hours of recorded speech paired with human transcripts

Self-Driving Perception

Label: object type (car / pedestrian / sign)
Features: camera pixels, LiDAR point clouds
Training data: millions of manually annotated driving video frames

Data quality trumps data quantity. Google's ML course emphasises that a model trained on 10,000 carefully cleaned, representative, correctly-labelled examples will almost always outperform one trained on 1,000,000 noisy, mislabelled, or biased ones. Garbage in, garbage out — this is ML's most reliable law. (Google ML Crash Course, Data Preparation module)

When Supervised Learning Works Best

Supervised learning is a good fit when:

Labels are available and reliableYou have historical data where the correct answer is known — past sales, medical records with confirmed diagnoses, emails users already tagged.
The input-output relationship is stableThe patterns in training data will still hold when the model is deployed. A model trained on pre-2020 customer behaviour may not predict post-pandemic behaviour well.
Enough labelled examples existSimple models may need thousands of examples; deep learning models may need millions. Rare events (rare diseases, rare fraud types) are harder to learn because labelled cases are scarce.
Watch out: label leakageIf a feature in your training data accidentally encodes the answer (e.g., a "date of death" field in a mortality prediction model), the model learns a shortcut that won't exist at deployment time.

02 · Unsupervised ML

Unsupervised Learning — Finding Hidden Structure

In unsupervised learning there are no labels. The algorithm is given raw data and asked to find structure on its own — groupings, patterns, compressions, or anomalies — without being told what to look for. This mirrors how humans often learn: by observing the world and finding categories without an explicit teacher.

Unsupervised learning is harder to evaluate (there's no "correct answer" to check against), but it's enormously useful when labelling is expensive, impossible, or when you want to discover something genuinely unknown in your data.

The Four Main Unsupervised Techniques

CLUSTERING

Group similar things together

Algorithms like K-Means partition data into K groups so that items in the same group are more similar to each other than to items in other groups. Used for: customer segmentation, document grouping, image compression, gene expression analysis.

DIM. REDUCTION

Compress data while preserving structure

Techniques like PCA and t-SNE take high-dimensional data (e.g. 1,000 features) and represent it in 2–3 dimensions while preserving meaningful relationships. Used for: data visualisation, noise removal, preprocessing before supervised learning.

ANOMALY DETECTION

Find the unusual

Learn what "normal" looks like from unlabelled data, then flag anything that deviates significantly. Used for: credit card fraud detection, network intrusion detection, manufacturing defect spotting, medical outlier detection.

GENERATIVE

Learn the data distribution

Models like VAEs (Variational Autoencoders) and GANs (Generative Adversarial Networks) learn the underlying distribution of training data well enough to generate new examples. The foundation of AI image generation, text generation, and drug molecule design.

Figure 5: K-Means clustering demo. Click "Next Step" to walk through each stage — placing centroids, assigning points, and moving centroids until clusters are discovered.

Dimensionality Reduction — Seeing the Invisible

Real datasets often have hundreds or thousands of features. Dimensionality reduction compresses this into 2–3 dimensions so humans (and other algorithms) can understand it. The key insight is that most high-dimensional data actually lies on a much lower-dimensional manifold — the apparent complexity is mostly redundancy.

Figure 6: Dimensionality reduction with PCA or t-SNE. 1,000 features are compressed to 2 dimensions, making hidden cluster structure immediately visible.

Supervised vs Unsupervised — when to use which:

Use supervised learning when you have labelled data and a clear prediction target — classifying emails, forecasting sales, diagnosing diseases.

Use unsupervised learning when labels don't exist or are too expensive to create — exploring a new dataset, discovering customer segments, detecting novel fraud patterns, compressing data for visualisation.

In practice, many real systems use both: unsupervised techniques to explore and preprocess data, followed by supervised learning for the final prediction task.

Module 08

Types of AI Systems

08 · Taxonomy

Narrow AI vs General AI

AI systems are often categorised by their breadth — the range of tasks they can handle.

TODAY

Narrow AI (ANI)

Designed for one specific task. Excellent at it, but can't do anything else. Every AI you use today is narrow AI — chess engines, spam filters, image classifiers, voice assistants, language models.

HYPOTHETICAL FUTURE

General AI (AGI)

A system that can learn and perform any intellectual task a human can. Does not yet exist. Researchers debate whether it is possible, and if so, when it might arrive.

FAR FUTURE / THEORETICAL

Super Intelligence (ASI)

An AI that surpasses the best human intellect in every domain. A theoretical concept, not something that exists today. Often the subject of philosophical and safety debates.

CURRENT FRONTIER

Large Language Models

AI systems trained on vast amounts of text. They can write, translate, summarise, code, reason about language. Examples: GPT-4, Claude, Gemini. A type of narrow AI, but with surprisingly broad language capabilities.

Important: Despite what movies suggest, AI today has no desires, feelings, consciousness, or goals of its own. An AI that writes emotional poetry is not feeling those emotions — it's producing statistically likely text based on training data.

08 · By Technique

Key AI Techniques You'll Encounter

NLP

Natural Language Processing (NLP) helps computers understand and generate human language. Powers search engines, chatbots, translation services, autocomplete, and document summarisation.

Computer Vision

Enables machines to interpret images and video. Used in facial recognition, medical imaging, self-driving cars, quality control in manufacturing, and satellite analysis.

Generative AI

Creates new content — text, images, music, video, code. Models like DALL·E, Stable Diffusion, and ChatGPT are examples. Learns distributions in data and samples from them.

Reinforcement Learning

An agent learns by taking actions in an environment and receiving rewards. Achieved superhuman performance in chess, Go, and many video games. Also used in robotics and recommendation systems.

Recommendation

Predict what a user will like based on past behaviour and similar users. Power Netflix, Spotify, Amazon, and TikTok. Use techniques like collaborative filtering, matrix factorisation, and increasingly, deep learning.

Anomaly Detection

Identifies unusual patterns that deviate from the norm. Used for fraud detection, network intrusion, manufacturing defects, and medical outliers. Trains on "normal" data, then flags anything statistically surprising.

Module 09

AI in Your Daily Life

09 · Applications

AI Is Already Everywhere

You interact with AI dozens of times every day — often without realising it. Here's a map of where AI shows up in a typical day:

7:00 AM

Your phone alarm wakes you. Snooze or not?

Smartphone features like wake word detection ("Hey Siri") use neural networks trained on millions of voice samples.

7:15 AM

You check social media

The content you see is curated by recommendation algorithms that model your past behaviour and predict what will keep you engaged.

8:00 AM

You navigate to work

Google Maps and Waze use ML to predict real-time traffic, estimate arrival times, and reroute you dynamically.

9:00 AM

You check email

Spam filters use Bayesian classifiers and neural networks to block ~99.9% of spam before it reaches your inbox. (Google Workspace, 2023)

12:00 PM

You shop online

"Customers also bought…" recommendations are powered by collaborative filtering algorithms that analyse millions of purchase patterns.

8:00 PM

You stream a movie

Netflix, Spotify, and YouTube use sophisticated recommendation engines that learn your taste and serve personalised content.

11:00 PM

Your bank monitors for fraud

In the background, ML models analyse every transaction you make in milliseconds, flagging anomalies that suggest fraud.

Key insight: Most AI you encounter is invisible. It's not a robot or a chatbot — it's a layer of prediction and optimisation woven into software you already use.

Module 10

Using AI Responsibly

10 · Ethics & Safety

Why AI Can Go Wrong

AI systems are only as good as the data they're trained on and the goals they're given. Several well-documented failure modes deserve your attention as an AI user and citizen.

Bias in AI

If training data reflects historical inequalities, the model will perpetuate them. A famous example: early facial recognition systems had much higher error rates for darker skin tones because training datasets were skewed toward lighter-skinned faces.

Lighter skin ♂

94% acc.

Lighter skin ♀

87% acc.

Darker skin ♂

68% acc.

Darker skin ♀

35% acc.

Figure 7: Illustrative data based on MIT Media Lab's "Gender Shades" study (Buolamwini & Gebru, 2018). Early commercial facial recognition systems showed large disparities in accuracy.

Hallucinations

Large language models can generate confident-sounding but factually wrong information — a phenomenon called hallucination. They predict likely-sounding text, not necessarily true text. Always verify important claims from an AI chatbot using authoritative sources.

Privacy & Data

Many AI services are trained on or learn from user data. When you use a free AI product, consider: what data is collected, who owns it, how it's stored, and whether it's used to train future models.

10 · Best Practices

Your Responsible AI Checklist

Here are practical habits for anyone using AI tools today:

Verify important outputsAI-generated text, code, medical or legal information should always be cross-checked. Use AI as a first draft or starting point, not a final source.
Understand the limitsKnow what the AI you're using is designed to do. A language model is not a search engine; a recommendation system is not an objective advisor.
Protect your dataDon't share sensitive personal information (passwords, financial details, medical records) with AI chatbots unless you've reviewed the platform's privacy policy.
Watch for biasWhen using AI for hiring, healthcare, credit decisions, or any high-stakes domain, ask whether the system has been audited for bias and what redress exists if it makes a mistake.
Be transparent about AI useIn academic, professional, or creative work, disclose when AI tools contributed substantially. This builds trust and supports honest norms around AI.
Think about wider impactAI systems have environmental costs (energy for training), economic effects (job displacement), and societal implications. Informed citizens engage with these policy questions.

Module 11

The Future of AI

11 · Looking Ahead

What's Coming — and What's Already Here

AI is developing faster than almost any technology in history. Here's a look at where things stand and where they're heading:

2012

Deep Learning Revolution

AlexNet wins ImageNet challenge by a massive margin, sparking the deep learning era.

2017

Transformer Architecture

Google publishes "Attention Is All You Need", the paper behind modern language models.

2020

GPT-3 & Foundation Models

Large language models demonstrate surprisingly broad capability from a single pre-trained model.

2022–23

AI Goes Mainstream

ChatGPT reaches 100M users in 2 months. (Reuters, Feb 2023) DALL·E, Stable Diffusion, Midjourney democratise image generation.

2024–25

Agents & Reasoning

AI systems can plan multi-step tasks, browse the web, write and execute code, and collaborate with other AI agents.

Near future

AI in Science & Medicine

AlphaFold already solved protein folding. AI drug discovery, personalised medicine, and climate modelling are accelerating.

Open questions

Alignment & Safety

How do we ensure AI systems reliably pursue the goals we actually want? This is one of the most important research areas today.

The most important thing you can do: Stay informed, stay curious, and engage with AI as an active and thoughtful participant — not just a passive user. The decisions being made now about how to build and regulate AI will shape society for decades.

Test Yourself

Part 1 Quiz

5 questions · 20 points each · 100 total.

Ready?

5 questions · 20 points each

Well done!

You have covered AI fundamentals. Ready to go deeper?

Part 2: How Models Work →