Autoplay
Autocomplete
Previous Lesson
Complete and Continue
Fundamentals of Apache Spark and PySpark
Introduction
Introduction (7:29)
Exercise: Meet Your Classmates and Instructor
Course Resources
Setup and Useful Resources
[Optional] UNIX CLI Commands
[Optional] Using Windows
Installing Software for the Course
[Optional] What Is a Virtualenv? (6:36)
Big Data Processing with Apache Spark
Apache Spark (3:43)
How Spark Works (4:23)
Spark Application (7:40)
DataFrames (6:42)
Installing Spark (5:50)
Installing Spark on Linux
Inside Airbnb Data (7:01)
Writing Your First Spark Job (7:04)
Lazy Processing (2:16)
[Exercise] Basic Functions (1:28)
[Exercise] Basic Functions - Solution (6:41)
Aggregating Data (3:59)
Joining Data (4:39)
Aggregations and Joins with Spark (6:09)
Complex Data Types (5:08)
[Exercise] Aggregate Functions (0:49)
[Exercise] Aggregate Functions - Solution (5:53)
User Defined Functions (3:25)
Data Shuffle (6:13)
Data Accumulators (3:41)
Optimizing Spark Jobs (7:38)
Submitting Spark Jobs (4:28)
Other Spark APIs (5:15)
Spark SQL (4:32)
[Exercise] Advanced Spark (2:10)
[Exercise] Advanced Spark - Solution (5:25)
Summary (3:07)
Where To Go From Here?
Let's Keep Learning Together! (1:05)
Review This Byte!
How Spark Works