Using Partitioning, We can increase hive query performance. But if we do not choose partitioning column correctly it can create small file issue.
We will learn how to specify our custom schema with column names and data types for Spark data frames.
We will learn how to load data from JSON, CSV, TSV, Pipe Delimited or any other type for delimited file to spark Dataframe.
We will learn how to load and populate data to hive table. We will also learn how to copy data to hive tables from local …
We will learn how to create Hive tables, also altering table columns, adding comments and table properties and deleting Hive tables.
We will learn how to create databases in Hive with simple operations like listing database, setting database location in HDFS & deleting database.
Hive supports multiple data types like SQL. On top of that, there are multiple complex data types in hive which makes it easy to process …
Hive has two types of tables, external and managed. In this blog, we will learn about them and decide which use case is suitable for …
In this blog, we are going to learn how to get, put and delete S3 bucket policies suing S3 Console as well as programmatically using …