Google Search Architecture

2017-07-03  本文已影响0人  James_Qiu

Paper Cited: Web Search For A Planet: The Google Cluster Architecture

Google Search handles the largest volume of data in the world. Upon each query, the engine will manipulate the whole database to search for the suitable answer. Therefore, it must use efficient enough method to support faster data retrievement. How is the architecture of the system?

Most imported factors that influence its design:

Google provides reliability in the environment at the software level

Serving a Google Query

Query Execution can be divided into two phases

Phase 1

Phase 2

Both phases can be load balanced

Other Work

Other ancillary tasks upon receiving a query, such as sending the query to a spell-checking system and to an ad-serving system to generate relevant advertisements.

When all the phases are completed, a GWS generates the appropriate HTML page and returns it to the browser.

The CPU speed of the individual index servers does not directly influence the search's overall performance, because we can increase the number of shards to accommodate slower CPUs, and vice versa. Consequently, our hardware selection process focuses on machines that offer an excellent request throughput for our application, rather than machines that offer the highest single-thread performance.

Google cluster key design principles

For Google

上一篇 下一篇

猜你喜欢

热点阅读