CS174 Spring 98, Lecture 15 Summary

CS174 Spring 98 Lecture 15 Summary

Distributed Computing: Choice Coordination

Choice coordination is one of a class of problems that arise in distributed computing. The problem sounds very easy: There are n processors and m choices, and all the processors have to agree on one of the choices. E.g. there may be m parallel programs that can be run, all of which require all n processors. The processors have to decide which one to run first. There are many other similar cases. You want the processors to work together, but in order to do that, they need to all agree on what to do.

But the solution is not so obvious. Its tempting to take a vote, or let one processor pick, but in classical ways of doing this, you end up relying on one processor to choose or tally the votes, and there is a risk of corruption, or of failure if there is a bug in that processor. A good distributed protocol should be truly distributed, that is, it shouldn’t rely on special action by any particular processor.

Model

The model of computation is shown below. There is a set of m registers that all the processors can read or write to.

You don’t need to take the model too literally. The shared registers don’t have to all be in one placed. They may be distributed among the processors. The above picture shows the logical, not the physical structure.

Every processor can write to any register. Since this may cause contention, there is a locking mechanism attached to each register so that if multiple processors try to access it, only one of them is allowed to. The others wait until the first processor’s lock is released, and then contend for access again.

At the end of the protocol, there should be a unique register with the special symbol Ö in it. We measure the complexity of a choice coordination protocol as the number of read-write operations that it requires.

Lemma Any deterministic algorithm that solves the choice coordination problem requires W (n^1/3) operations.

But randomized protocols don’t have that limitation. In fact they are much faster on average. We will describe a protocol such that for any constant c, the probability that agreement is reached in c steps is at least 1 – 2^{-W (c)}. That is, if X is the running time to agreement, then X is a geometric random variable (or more correctly, there is a geometric random variable whose probability distribution bounds X from above).

Synchronous Case, m=n=2

There are two registers C₀ and C₁, and two processors P₀ and P₁. The registers are assumed to be initialized to zero. Each processor has a local variable B_i, which is also zero initially.

Here’s some pseudo-code for the algorithm:

Input: Registers C₀ and C₁ initially zero

Output: Exactly one of the registers has the value Ö

P_i is scanning register C_i.
Read the current register and get a bit R_i.
Select a case:

R_i = Ö : halt

R_i = 0, B_i = 1: Write Ö into the current register and halt

Otherwise: pick a random bit for B_i and write it into the current register.

P_i exchanges its current register with P_1-i and we go to step 1.

To understand the algorithm, first notice that the two processors are alternating access to the registers. Processor 1 accesses register 1 first, then it accesses register 0 on the next iteration, and then register 1 again. Processor 0 always accesses the other register.

Now notice that if a processor doesn’t halt, it goes through step 2(c). That means it picks a random bit for B_i and saves it in register C_i. Each time we start an iteration, since the processors have swapped registers, R_i contains the bit that the other processor wrote, which is still the contents of B_1-i. Step 2(b) is therefore the case where the current register (in R_i) contains a 0, and the other register (whose value is saved in B_i) contains a 1. The registers therefore contain different values and its safe to write a Ö into the current register. The other processor will have R_i = 1 and B_i = 0, so it will not write a Ö .

When neither processor halts, both of them write a random bit into the registers. If two different bits are written, then one of the processors will halt on the next step, and the other on the step after that. The probability of different bits on a given step is just ½. The running time is therefore a geometric random variable with probability p, and the expected running time is 4 (2 to get different bits, and two more for both processors to halt). steps. The probability that it takes more than c+2 steps to halt is 1 – 2^-c.

Asynchronous Case, m=n=2

The main difference between this case is the addition of time stamps to allow for the fact that the two processors are not synchronized. The time stamp in this case isn’t really a measure of exact time, but of "iteration number". But even that can be different for the two processors.

Each processor keeps two time variables. T_i is like processor i’s clock. t_i is the time that processor i reads from its current register, which will be the other processor’s time. The algorithm follows:

Input: Registers C₀ and C₁ initialized to <0,0>

Output: Exactly one of the two registers has the value Ö

P_i is initially scanning a randomly chosen register. Thereafter it switches to the other register at the end of each iteration. The variables T_i and B_i are initialized to 0.
P_i gets a lock on its current register and reads <t_i, R_i>
P_i executes one of these cases:

R_i = Ö : halt

b) T_i < t_i: Set T_i ¬ t_i and B_i ¬ R_i

c) T_i > t_i: Write Ö into the current register and halt.

T_i = t_i, R_i = 0, B_i = 1: Write Ö into the current register and halt.
Otherwise: T_i ¬ T_i + 1, t_i ¬ t_i + 1, assign a random (unbiased) bit to B_i, write <t_i, B_i> into the current register.

P_i releases the lock on its current register, moves to the other register, and returns to step 1.

The analysis for the asynchronous version is very similar to the synchronous version. When T_i = t_i in particular, steps 2(d) and 2(e) of the asynchronous algorithm correspond to steps 2(b) and 2(c) of the synchronous algorithm.

The new cases are 2(b) and 2(c) of the asynchronous algorithm. For step 2(b) the other processor must be at least one step ahead (t_i > T_i). Moving R_i into B_i ensures that at the end of this round, B_i will be equal to the contents of the register that we just left. We don’t write anything new into this register because it has already been read by the other processor in the t_i round.

Step 2(c) takes a short-cut to completing the protocol. Since the processor P_i is at least one time step ahead, if it writes the character Ö into the current register, the other processor will read it on the next time step and halt.