- Introduction to Hive
- Hive Data Types and DDL
- Hive DML and HiveQL
- Hive Queries and Views
- Hive Indexing and Tuning
- Compression with Hive
- UDFs in Hive
- Customizing Hive File and Record Formats
- Hive Storage Handlers and NoSQL
1. Introduction to Hive
- What is Big data
- What is Apache Hadoop
- HDFS in Hadoop
- Data Processing Framework In Hadoop
- Hive Use cases @ Twitter
- Hive vs Mapreduce
- What is Hive
- Advantages of hive
- Hive Architecture
2. Hive Data Types and DDL
- Hive Data Types
- Hive QL
- DDL on Databases
- DDL on Tables
- Different Tables in Hive
- Different Tables in Hive Contd
- Advanced DDL on tables
- File Formats in Hive
3. Hive DML and HiveQL
- DML - Loading Data in tables
Managing Output
Hive QL Queries - Operators and Functions in hive
- Hive Clauses
4. Hive Queries and Views
- Joins in Hive
- Wordcount Example using Hive
- Hive View
- Clauses in Hive
5. Hive Indexing and Tuning
- Hive Indexing
- Indexing with Additional Properties
- Tuning
- Executive Hive Queries in Different Modes
- Hadoop Tuning Parameters
6. Compression with Hive
- Compression with Hive
- Choosing a compression codec
- Sequence Files
- Sequence Files with Different Compression Types
- Running Compression with Hive Queries
7. UDFs in Hive
- Functions in Hive
- Aggregation, Calling, table generating Functions
- Different Kinds of UDFs
- Creating and Calling UDFs in Hive
- Hive UDAFs
8. Customizing Hive File and Record Formats
- TextFile and Sequence File Formats
- RCFile Format
- CSV and JSON SerDe
- Hive Avro SerDe
9. Hive Storage Handlers and NoSQL
- Hive Storage Handlers
- HBase