Assuming you want to add a column named new to your DataFrame df , created by calling numpy.random.normal several times, you can do:
import numpy from pyspark.sql.functions import UserDefinedFunction from pyspark.sql.types import DoubleType udf = UserDefinedFunction(numpy.random.normal, DoubleType()) df_with_new_column = df.withColumn('new', udf())
source share