spark学习

spark学习(三)RDD初窥

2019-11-10  本文已影响0人  mumu_cola

The main abstraction Spark provides is a resilient distributed dataset (RDD), which is a collection of elements partitioned across the nodes of the cluster that can be operated on in parallel.
以上时spark官网对RDD的描述,下面让我们进入RDD的世界吧!

上一篇下一篇

猜你喜欢

热点阅读