Bucketing in Hive
With Bucketing in Hive, we can group similar kinds of data and write it to…
Alter Table Partitions in Hive
We have created partitioned tables, inserted data into them. Now, we will learn how to…
Static vs Dynamic Partitioning in Hive
Hive supports Static and Dynamic Partitions. Let us understand what is difference between them and…
Partitioning in Hive
Using Partitioning, We can increase hive query performance. But if we do not choose partitioning…
Adding Custom Schema to Spark Dataframe
We will learn how to specify our custom schema with column names and data types…
Reading data from a file in Spark
We will learn how to load data from JSON, CSV, TSV, Pipe Delimited or any…
Hive Data Manipulation – Loading Data to Hive Tables
We will learn how to load and populate data to hive table. We will also…
Create, Alter, Delete Tables in Hive
We will learn how to create Hive tables, also altering table columns, adding comments and…
Creating Database in Hive
We will learn how to create databases in Hive with simple operations like listing database,…
Data Types in Hive
Hive supports multiple data types like SQL. On top of that, there are multiple complex…
External Vs Internal(Managed) Tables in Hive
Hive has two types of tables, external and managed. In this blog, we will learn…
Set, Get and Delete AWS S3 bucket policies
In this blog, we are going to learn how to get, put and delete S3…