组件: Spark core

2015-06-02  本文已影响59人  并肩走天涯

Spark can create distributed datasets from any file stored in the Hadoop distributed filesystem (HDFS) or other storage systems supported by the Hadoop APIs (including your local filesystem, Amazon S3, Cassandra, Hive, HBase, etc.).

Spark supports text files, SequenceFiles, Avro, Parquet, and any other Hadoop InputFormat.

上一篇 下一篇

猜你喜欢

热点阅读