Example Curriculum
Available in
days
days
after you enroll
Available in
days
days
after you enroll
Available in
days
days
after you enroll
- DistilBert vs. Bert Differences (4:46)
- Embeddings In A Continuous Vector Space (7:40)
- Introduction To Positional Encodings (5:13)
- Positional Encodings - Part 1 (4:14)
- Positional Encodings - Part 2 (Even and Odd Indices) (10:10)
- Why Use Sine and Cosine Functions (5:08)
- Understanding the Nature of Sine and Cosine Functions (9:52)
- Visualizing Positional Encodings in Sine and Cosine Graphs (9:24)
- Solving the Equations to Get the Values for Positional Encodings (18:07)
Available in
days
days
after you enroll
- Introduction to Attention Mechanism (3:02)
- Query, Key and Value Matrix (18:10)
- Getting Started with Our Step by Step Attention Calculation (6:53)
- Calculating Key Vectors (20:05)
- Query Matrix Introduction (10:20)
- Calculating Raw Attention Scores (21:24)
- Understanding the Mathematics Behind Dot Products and Vector Alignment (13:32)
- Visualizing Raw Attention Scores in 2D (5:42)
- Converting Raw Attention Scores to Probability Distributions with Softmax (9:16)
- Normalization (3:19)
- Understanding the Value Matrix and Value Vector (9:07)
- Calculating the Final Context Aware Rich Representation for the Word "River" (10:45)
- Understanding the Output (1:58)
- Understanding Multi Head Attention (11:55)
- Multi Head Attention Example and Subsequent Layers (9:51)
- Masked Language Learning (2:29)
Available in
days
days
after you enroll