Hadoop Fundamentals
This training focuses on the key concepts and methods for data processing applications development using Apache Hadoop.
24 hours
Online
English
EAS-015
Hadoop Fundamentals
Sign Up
Duration
24 hours
Location
Online
Language
English
Code
EAS-015
Schedule and prices
€ 500
Training for 7-8 or more people? Customize trainings for your specific needs
Hadoop Fundamentals
Sign Up
Duration
24 hours
Location
Online
Language
English
Code
EAS-015
Schedule and prices
€ 500
Training for 7-8 or more people? Customize trainings for your specific needs

Description

Apache Hadoop is an open-source framework used for storing and processing large datasets efficiently. It allows the clustering of multiple computers in order to allow the parallel analysis of huge datasets faster. We’ll look at HDFS - de-facto standard for large scale long-term robust data storage, the MapReduce framework for automated distributed code execution, and companion projects from Hadoop ecosystem.
After completing the course, a certificate
is issued on the Luxoft Training form

Objectives

  • Understand core Hadoop concepts and architecture
  • Design data models for Hadoop
  • Write CQL queries using basic types and collections
  • Access to Hadoop from Java programs
  • Be aware of ORM-like libraries/frameworks for Hadoop

Target Audience

  • Software developers
  • Software architects
  • Database designers
  • Database administrators

Prerequisites

  • Basic Java programming skills
  • Unix/Linux shell familiarity
  • Experience with databases is optional

Roadmap

  • Core Hadoop concepts
  • Hadoop local and cloud installation and configuring
  • HDFS architecture, replication, reads and writes
  • HDFS commands
  • MapReduce (MRv1) program structure
  • Data formats for MapReduce
  • YARN architecture
  • Job execution in MRv1 and in YARN
  • Distributed cache and counters
  • Hadoop Streaming
  • Hadoop Ecosystem and Vendors
  • Introduction to Pig
  • Introduction to Hive
  • Introduction to Sqoop
  • Introduction to Flume
  • Introduction to Spark
  • Introduction to Mahout
Schedule and prices
View:
Register for the next course
Registering in advance ensures you have priority. We’ll notify you when we schedule the next course on this topic
+
Courses you may be interested in
Data Warehouse Fundamentals
Understand current approaches to designing data warehouses and using them in heterogeneous enterprise information systems.
Modern Data Management Approaches
This training provides an overview of modern methods for data storage, including key-value stores, document-oriented and database management systems, distributed data storage and processing systems.
BigData SQL Hive
This training is aimed at developers and covers the full stack of technical features, architecture and performance tuning. Apache Hive supports analysis of large datasets stored in Hadoop's HDFS and compatible file systems and it provides an SQL-like language with schema on read and transparently converts queries to map/reduce.
View Catalog
Your benefits
Expertise
Our trainers are industry experts, involved in software development project
Live training
Facilitated online so that you can interact with the trainer and other participants
Practice
A focus on helping you practice your new skills
Still have questions?
Connect with us
Thank you.
Your request has been received.
Thank you!
The form has been submitted successfully.