pyspark.sql.functions.
slice
Collection function: returns an array containing all the elements in x from index start (array indices start at 1, or from the end if start is negative) with the specified length.
New in version 2.4.0.
Changed in version 3.4.0: Supports Spark Connect.
Column
column name or column containing the array to be sliced
column name, column, or int containing the starting index
column name, column, or int containing the length of the slice
a column of array type. Subset of array.
Examples
>>> df = spark.createDataFrame([([1, 2, 3],), ([4, 5],)], ['x']) >>> df.select(slice(df.x, 2, 2).alias("sliced")).collect() [Row(sliced=[2, 3]), Row(sliced=[5])]