Skip to content
Analyticshut
  • Home
  • AWSExpand
    • IAM
    • S3
  • Big DataExpand
    • Spark
    • Hive
    • Sqoop
    • HDFS
  • Kafka
Analyticshut

dataframe

Where and Filter in Spark Dataframes
Spark

Where and Filter in Spark Dataframes

ByMahesh Mogal October 22, 2020November 25, 2024

In this blog, we will learn how to filter rows from spark dataframe using Where and Filter functions.

Read More Where and Filter in Spark DataframesContinue

Distinct Rows and Distinct Count from Spark Dataframe
Spark

Distinct Rows and Distinct Count from Spark Dataframe

ByMahesh Mogal October 20, 2020November 25, 2024

Getting distinct values from columns or rows is one of most used operations. We will learn how to get distinct values as well as count of distinct values.

Read More Distinct Rows and Distinct Count from Spark DataframeContinue

Sorting in Spark Dataframe
Spark

Sorting in Spark Dataframe

ByMahesh Mogal October 13, 2020November 25, 2024

In this blog, we will learn how to sort rows in spark dataframe based on some column values.

Read More Sorting in Spark DataframeContinue

Removing White Spaces From Data in Spark
Spark

Removing White Spaces From Data in Spark

ByMahesh Mogal October 9, 2020November 25, 2024

White spaces can be a headache if not removed before processing data. We will learn how to remove spaces from data in spark using inbuilt functions.

Read More Removing White Spaces From Data in SparkContinue

ADDING SPACES DATA IN SPARK DATAFRAME
Spark

Adding White Spaces to Data in Spark Dataframe

ByMahesh Mogal October 6, 2020November 25, 2024

In this blog, we will learn how to use rpad and lpad functions to add padding to data in spark dataframe.

Read More Adding White Spaces to Data in Spark DataframeContinue

Add, Rename, Drop Columns in Spark Dataframe
Spark

Add, Rename, Drop Columns in Spark Dataframe

ByMahesh Mogal September 14, 2020November 25, 2024

We will go through common column operations like add, rename, list, select, and dropping a column from spark dataframe.

Read More Add, Rename, Drop Columns in Spark DataframeContinue

  • Contact
  • About Me
  • Privacy Policy
  • Sitemap

© 2025 Analyticshut

  • Home
  • AWS
    • IAM
    • S3
  • Big Data
    • Spark
    • Hive
    • Sqoop
    • HDFS
  • Kafka