Apache Spark Fundamentals | | Software Development
Apache Spark Fundamentals
Duration
24 hours
Location
Online
Language
English
Code
EAS-017
€ 550 *
Training for 7-8 or more people? Customize trainings for your specific needs
Description
This training focuses on the key concepts and methods for data processing applications development using Apache Spark. We’ll look at the RDD-based framework for automated distributed code execution, and companion projects in different paradigms: Spark SQL, Spark Streaming, MLLib, Spark ML, GraphX.
After completing the course, a certificate
is issued on the Luxoft Training form
is issued on the Luxoft Training form
Objectives
- Understand core Spark concepts and architecture
- Write data processing pipelines queries using simple and pair RDDs
- Write data processing programs using DataFrames
- Write stream processing programs using DStreams
- Utilize pre-packaged machine learning and graph analysis algorithms
- Move data between Spark and external systems (Kafka, Cassandra)
Target Audience
- Software developers
- Software architects
Roadmap
- Spark concepts and architecture
- Programming with RDDs: transformations and actions
- Using key/value pairs
- Loading and storing data
- Accumulators and broadcast variables
- Spark SQL, DataFrames, Datasets
- Spark Streaming
- Machine Learning using MLLib and Spark ML
- Graph analysis using GraphX
Schedule and prices
View:
Register for the next course
Registering in advance ensures you have priority. We will notify you when we schedule the next course on this topic
Courses you may be interested in
Apache ActiveMQ
The purpose of this training is to introduce participants to ActiveMQ, the most popular and powerful open source messaging server.