Adding White Spaces to Data in Spark Dataframe

You may have a use-case where you want to make value in column either string or number to have the same length. we can use “lpad” and “rpad” functions to format strings & numbers properly.

For example, you might need numbers to have the same number of digits like for month should have 2 digits and add 0 if the month has only one digit.

lpad()

lpad function is used to add padding from the left side to string or number. This is useful in the example mentioned above where we would like to add 0 to the left of the month if it has one digit only.

lpad example
Lpad Example

In the above example, we have added 0 to the left side of the number to make it of 4 digits long in each case.

rpad()

In same way we can use rpad to add digits to right side of string or number.

rpad_example
Rpad Example

I hope you found this useful. If you have any questions do let me know. See you later.

ADDING SPACES DATA IN SPARK DATAFRAME

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *