I have a data frame in pyspark with over 300 columns. There are several null columns in these columns.
For instance:
Column_1 column_2 null null null null 234 null 125 124 365 187 and so on
When I want to make the sum of column_1, I get zero as a result instead of 724.
Now I want to replace the zero in all columns of the data frame with empty space. Therefore, when I try to make the sum of these columns, I do not get a null value, but I get a numerical value.
How can we achieve this in pyspark
user7543621
source share