Code: EAS-019
Duration: 8 hours
Duration: 8 hours
Description
This is a training about Impala for developers covering the full stack of technical features, architecture and performance tuning. Impala supports analysis of large datasets stored in HDFS and compatible file systems, providing an SQL-like language. Other features of Impala include:- Indexing to provide acceleration
- Different storage types such as plain text, RCFile, HBase, ORC, and others.
- Metadata storage in an RDBMS
- Operating on compressed data stored into the Hadoop ecosystem
- SQL-like queries
Roadmap
- What is Impala
- Architecture
- Impala services
- Impala DDL
- Data Types
- “Select” Queries
- DML / Load data
- Hive UDFs types
- Indexes
- Performance tuning
- Hive vs Impala
Objectives
- Developing expertise in the area of Big Data
- Data modal design in Impala
- Developing SQL scripts
- Practical experience in queries and performance tuning
Target Audience
- Developers
- QA
- Analysts
Prerequisites
- Hadoop fundamentals
- ANSI SQL 92