Overview
Getting Started
User Guides
API Reference
Development
Migration Guides
Spark SQL
Pandas API on Spark
Structured Streaming
MLlib (DataFrame-based)
Spark Streaming (Legacy)
MLlib (RDD-based)
Spark Core
Resource Management
Errors
pyspark.streaming.DStream.groupByKey
¶
DStream.
groupByKey
(
numPartitions
:
Optional
[
int
]
=
None
)
→ pyspark.streaming.dstream.DStream
[
Tuple
[
K
,
Iterable
[
V
]
]
]
[source]
¶
Return a new DStream by applying groupByKey on each RDD.
pyspark.streaming.DStream.glom
pyspark.streaming.DStream.groupByKeyAndWindow