Week 11: Spark
- Lecture notes:
-
Spark - Lecture notes
-
Other parallel computing frameworks - Lecture slides
(6 per page)
- Supplemental notes:
-
- Spark Documentation
-
Links to an extensive list of documentation, tutorial videos, exercises, and trainng materials.
- Intro to Apache Spark
-
ITAS Workshop: introduction, examples, software development lifecycle, case studies
- Hands-on Tour of Apache Spark in 5 Minutes
-
Hortonworks tutorial
- Buzzwords:
-
Reslilient Distributed Data (RDD), transformations, actions, job, task, executor, worker, cluster manager