pyspark.sql.functions.
array_repeat
Collection function: creates an array containing a column repeated count times.
New in version 2.4.0.
Column
column name or column that contains the element to be repeated
column name, column, or int containing the number of times to repeat the first argument
Examples
>>> df = spark.createDataFrame([('ab',)], ['data']) >>> df.select(array_repeat(df.data, 3).alias('r')).collect() [Row(r=['ab', 'ab', 'ab'])]