Big Data Patterns | Mechanisms | Coordination Engine

Big Data Patterns, Mechanisms > Mechanisms > Coordination Engine

Home > Mechanisms > Coordination Engine

Coordination Engine

A distributed Big Data solution that needs to run on multiple servers relies on the coordination engine to ensure operational consistency across all of the participating servers. Coordination engines make it possible to develop highly reliable, highly available distributed Big Data solutions that can be deployed in a cluster.

The processing engine will often use the coordination engine to coordinate data processing across a large number of servers. This way, the processing engine does not require its own coordination logic.

The coordination engine can also be used for the following purposes, as shown in Figure 1:

to support distributed locks
to support distributed queues
to establish a highly available registry for obtaining configuration information
for reliable asynchronous communication between processes that are running on different servers

Coordination Engine: Figure 1 - Two nodes in a cluster need to write to a shared queue as part of executing a job, and both send a write request at the same time (1a, 1b). The write request is coordinated by the coordination engine. One request is sent to the queue (2) before the other request is sent in a serialized manner (3).

Figure 1 – Two nodes in a cluster need to write to a shared queue as part of executing a job, and both send a write request at the same time (1a, 1b). The write request is coordinated by the coordination engine. One request is sent to the queue (2) before the other request is sent in a serialized manner (3).

Related Patterns:

This pattern is covered in BDSCP Module 2: Big Data Analysis & Technology Concepts.

For more information regarding the Big Data Science Certified Professional (BDSCP) curriculum,
visit www.arcitura.com/bdscp.