Partitioning in Hive
Using Partitioning, We can increase hive query performance. But if we do not choose partitioning…
Adding Custom Schema to Spark Dataframe
We will learn how to specify our custom schema with column names and data types…
Reading data from a file in Spark
We will learn how to load data from JSON, CSV, TSV, Pipe Delimited or any…
Hive Data Manipulation – Loading Data to Hive Tables
We will learn how to load and populate data to hive table. We will also…
Create, Alter, Delete Tables in Hive
We will learn how to create Hive tables, also altering table columns, adding comments and…
Creating Database in Hive
We will learn how to create databases in Hive with simple operations like listing database,…
Data Types in Hive
Hive supports multiple data types like SQL. On top of that, there are multiple complex…
External Vs Internal(Managed) Tables in Hive
Hive has two types of tables, external and managed. In this blog, we will learn…
Set, Get and Delete AWS S3 bucket policies
In this blog, we are going to learn how to get, put and delete S3…