Contents:
pyspark.SparkContext Main entry point for Spark functionality. pyspark.RDD A Resilient Distributed Dataset (RDD), the basic abstraction in Spark. pyspark.sql.SQLContext Main entry point for DataFrame and SQL functionality. pyspark.sql.DataFrame A distributed collection of data grouped into named columns.
pyspark.SparkContext
Main entry point for Spark functionality.
pyspark.RDD
A Resilient Distributed Dataset (RDD), the basic abstraction in Spark.
pyspark.sql.SQLContext
Main entry point for DataFrame and SQL functionality.
pyspark.sql.DataFrame
A distributed collection of data grouped into named columns.
pyspark package
Enter search terms or a module, class or function name.