Overview
Apache Impala training course prepares you to efficiently query your organization’s big data storage, whether HDFS or HBase, with ease and quick turnaround time.
Objectives
At the end of Apache Impala training course, participants will be able to:
Prerequisites
- Knowledge of Apache Hadoop ecosystem and SQL is required.
- Basic understanding of database administration is good to have.
Course Outline
- What is Impala
- Benefits of Impala
- Exploratory Business Intelligence
- Impala Installation
- Starting and Stopping Impala
- Data Storage
- Managing Metadata Preview
- Controlling Access to Data Preview
- Impala Shell Commands and Interface
- Querying with Hive and Impala
- SQL Language Statements
- DDL Statements
- DML Statements
- CREATE DATABASE
- CREATE TABLE Preview
- CREATE TABLE – Examples Preview
- Internal and External Tables
- Loading Data into Impala Table
- ALTER TABLE
- DROP TABLE
- DROP DATABASE
- DESCRIBE Statement Preview
- EXPLAIN Statement Preview
- SHOW TABLE Statement
- INSERT Statement
- INSERT Statement – Examples
- SELECT Statement
- Data Type
- Operators Preview
- Functions
- CREATE VIEW in Impala
- Hive and Impala Query Syntax
- Partitioning Tables Preview
- SQL Statements for Partitioned Tables
- File Format and Performance Considerations
- Choosing File Type and Compression Technique
- Impala Architecture Preview
- Impala Daemon
- Impala Statestore
- Impala Catalog Service
- Query Execution Flow in Impala
- User – Defined Functions Preview
- Hive UDFs with Impala
- Demo – UDF in Impala
- Improving Impala Performance