Reading Parquet and ORC data in Spark
In this blog, we are going to learn about reading parquet and orc data in Spark. Both file formats are columnar and store schema information, making it easy to work with them.
In this blog, we are going to learn about reading parquet and orc data in Spark. Both file formats are columnar and store schema information, making it easy to work with them.
We will learn about reading JSON data in Spark. We will also go through most used options provided by spark while working with JSON data.
In this blog, we are going to lean on how to read CSV data in Spark. We will also go through options to deal with common pitfalls while reading CSVs.