DEV Community

Gary Jackson profile picture

Gary Jackson

I'm a software engineer based in Brisbane. I like understanding things at the implementation level, which usually means building them from scratch to see how they actually work. Right now that's look

Location Brisbane Joined Joined on  Personal website https://www.garyjackson.dev/ github website
Chapter 10: Multi-Head Attention and the MLP Block

Chapter 10: Multi-Head Attention and the MLP Block

Comments
7 min read
Chapter 9: Single-Head Attention - Tokens Looking at Each Other

Chapter 9: Single-Head Attention - Tokens Looking at Each Other

Comments
9 min read
Chapter 8: RMS Normalisation and Residual Connections

Chapter 8: RMS Normalisation and Residual Connections

Comments
4 min read
Chapter 7: The Training Loop and Adam Optimiser

Chapter 7: The Training Loop and Adam Optimiser

Comments
7 min read
Chapter 6: Embeddings, the Forward Pass, and the Loss Function

Chapter 6: Embeddings, the Forward Pass, and the Loss Function

Comments
7 min read
Chapter 5: Linear Transformation and Softmax

Chapter 5: Linear Transformation and Softmax

Comments
4 min read
Chapter 4: The Bigram Model - Simplest Possible Language Model

Chapter 4: The Bigram Model - Simplest Possible Language Model

Comments
5 min read
Chapter 3: The Tokenizer - Text to Numbers and Back

Chapter 3: The Tokenizer - Text to Numbers and Back

Comments
2 min read
Chapter 2: Backward - Automatic Gradient Computation

Chapter 2: Backward - Automatic Gradient Computation

Comments
7 min read
Chapter 1: The Value Class - Recording the Forward Pass

Chapter 1: The Value Class - Recording the Forward Pass

Comments
10 min read
Chapter 0: Project Setup

Chapter 0: Project Setup

1
Comments
4 min read
Building a GPT From Scratch in C# - Introduction

Building a GPT From Scratch in C# - Introduction

1
Comments
3 min read
loading...