pyspark.Broadcast#
- class pyspark.Broadcast(sc=None, value=None, pickle_registry=None, path=None, sock_file=None)[source]#
A broadcast variable created with
SparkContext.broadcast()
. Access its value throughvalue
.Examples
>>> b = spark.sparkContext.broadcast([1, 2, 3, 4, 5]) >>> b.value [1, 2, 3, 4, 5] >>> spark.sparkContext.parallelize([0, 0]).flatMap(lambda x: b.value).collect() [1, 2, 3, 4, 5, 1, 2, 3, 4, 5] >>> b.unpersist()
>>> large_broadcast = spark.sparkContext.broadcast(range(10000))
Methods
destroy
([blocking])Destroy all data and metadata related to this broadcast variable.
dump
(value, f)Write a pickled representation of value to the open file or socket.
load
(file)Read a pickled representation of value from the open file or socket.
load_from_path
(path)Read the pickled representation of an object from the open file and return the reconstituted object hierarchy specified therein.
unpersist
([blocking])Delete cached copies of this broadcast on the executors.
Attributes
Return the broadcasted value