spark-fundamentals
RDD in Spark
RDD (Resilient Distributed Dataset) in spark is a fundamental data structure of Spark. It is the primary data abstraction in Apache Spark and the Spark Core.
Posted August 31, 2022 by Rohith ‐ 5 min read
⌖ apache spark bigdata distributed-system spark-fundamentals rdd
Parallelize() In Spark
Parallelize() is the SparkContext method used to create rdd from the list of elements.
Posted August 31, 2022 by Rohith ‐ 3 min read
⌖ apache spark bigdata distributed-system spark-fundamentals