Structured Streaming
Apache Spark Streaming Overview Creating Streaming DataFrames Transforming DataFrames Executing
Apache Spark Streaming Overview Creating Streaming DataFrames Transforming DataFrames Executing
Review: Apache Spark on a Cluster RDD Partitions Example: Partitioning
Writing a Spark Application Building and Running an Application Application
Datasets and DataFrames Creating Datasets Loading and Saving Datasets Dataset
Writing and Passing Transformation Functions Transformation Execution Converting Between RDDs
RDD Overview RDD Data Sources Creating and Saving RDDs RDD
Querying DataFrames Using Column Expressions Grouping and Aggregation Queries Joining
Creating DataFrames from Data Sources Saving DataFrames to Data Sources
What is Apache Spark? Starting the Spark Shell Using the
YARN Architecture Working With YARN
Apache Hadoop Cluster Components HDFS Architecture Using HDFS
Apache Hadoop Overview Data Processing Introduction to the Hands-On Exercises