Map Reduce笔记

2017-03-24  本文已影响33人  开水的杯子

MapReduce: Simplified Data Processing on Large Clusters acm pdf

What is Map Reduce?

MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key.

..in particular, it “runs on a large cluster of** commodity machines** and is highly scalable.”

Who built it?

Google, for their search indexing.

Why was Map Reduce successful?

Easy to use, expressive (to an extent), scaleable implementation.

What were the takeaways from Map Reduce?

What is map reduce good for?

What is it not good for?

What influences did MR have on later systems and usage today?

上一篇下一篇

猜你喜欢

热点阅读