TD 4 - Synchronization 2 - Thread Management

In TD 3b we saw how to write versions of the queue data structure that allowed concurrent reads and writes, but we used the simplifying assumption of a single consumer. We used it to implement task graphs to display multiple images line by line.

Preparation

If you did everything right, you will see a new class Test4 in the default package, some new classes (which we will cover below), and two new images in the imgs directory.

Images and Messages

In TD 3b we only had one kind of message: image lines. In this TD, we will add three more kinds of messages:

You do not need to understand the structure of these messages to finish this TD, but it is useful to look at the implementation of util.PixelBuffer.

Finally, we add one new kind of filtering node, ColorSelectors, to select particular colors in a message. As with the distorters above, we restrict the instances to the following: ColorSelector.red, ColorSelector.green, and ColorSelector.blue.

Bounded Blocking Queues

Condition Variables

Active waiting wastes processor cycles; in the modern world of mobile computing, wasted cycles amount to wasted battery time and excess heat, both undesirable.

Write the class data.BlockingBoundedQueue which is like data.BoundedQueue but is thread-safe and both add() and remove() are (potentially) blocking. More precisely:

The test in the while loop should be the negation of the semantics of the condition variable. For instance, if the condition variable represents emptiness, then you would have:

We use awaitUninterruptibly() instead of await() to avoid dealing with thread interruptions (the InterruptedException exception). If your threads are interrupted during a queue operation, the semantics is unspecified and you may implement any failure method you wish.

Recall from Amphi 4 that the await() and awaitUninterruptibly() methods release the lock associated with the condition variable and puts the calling thread to sleep; when a signal on the variable re-awakens the thread, it reacquires the lock.

To signal a condition, you have to use conditionVar.signal() or signalAll().

Test your implementation by uncommenting one of the following in Test4.main(): Test4.setupSplit(), Test4.setupDistort() or Test4.setupMultiDistort1(). Remember to change Test4.getQueue() as well.

Multiple Consumers

A Fixed Number of Threads Per Processor

As you saw in TD 3b, the single thread in a processor node can get overwhelmed, leading to either unbounded space consumption in the input queue or a slowing down of the entire task graph. In this exercise, you will create a variant of nodes.Processor that will allow for a fixed number (≥ 1) of threads.

Thread Pools using Executors

Sometimes we may not want to create and run all the threads at the same time. Instead, we may want simply to bound the concurrency of the processor, allowing it to run at most n threads. This would mean that we need to be able to spawn new threads when needed, as long as we do not exceed the concurrency limit.

Java provides a convenient framework for managing such thread pools using Executors, which are implementations of java.util.concurrent.Executor. The most important method in an executor is the execute() method that takes an instance of a Runnable and schedules its computation. For example, here is how we can schedule a computation that prints "bonjour":

This creates an anonymous class that is an instance of Runnable; like all instances of Runnable, there must be an implementation of the run() method, which is the computation that the executor will run. Note the similarity to the Thread class: the above code should (eventually) have the same effect as running:

Where and when the computation runs depends on what kind of executor it is. In this exercise, you should focus on the following four executors:

Create the class nodes.ProcessorPool that has the same interface as ProcessorN and uses an executor of your choice. Here is a skeleton that you can start from:

In the run() method, submit a suitable Runnable instance that will process msg. You can make use of processMessage() inherited from Processor.

Fork/Join Thread Pools

Executors are well suited for an unorganized pool of threads, but sometimes our threads have a hierarchy. You saw this in the Fibonacci example in TD 2. Java provides an alternative for these cases called java.util.concurrent.ForkJoinPool, which we are now going to use to process an image by divide an conquer.

You already have a processor called data.TileDistorter that is intended to perform a distortion on PixelBuffers. Your job is to write the processTile() method. You will perform the following steps:

Test your code by running Test4.setupDistortFrame() and Test4.setupDistortAnimation() in Test4.main(). Also experiment with different thresholds (definitely try a threshold of 1 pixel), and with numbers of threads assigned to the ForkJoinPool, which you can set in its constructor argument. If you are feeling adventurous, try running this on some large images, such as those from:

Extra

Optional: Atomic Bounded Queues

Write a lock-free version of LockedBoundedQueue called AtomicBoundedQueue that uses atomic primitives instad of locks, condition variables, or blocking. This queue should not block in add().

HINTS

Use two global atomic counters to count the number of successful add() and remove() steps respectively. These counters should strictly increase.
For each cell in the queue, you need to store a sequence number that also strictly increases. You can use a separate AtomicIntegerArray or just create a special class for queue cells with an AtomicInteger.
The \(n\)th add() in a cell with sequence number \(k\) can succeed if and only if \(n = k\), in which case \(k\) should be incremented. If \(n < k\) then there is no more space in the queue. If \(n > k\), then some other add() snuck in and added something before you (and incremented \(k\)), so you need to try again.
Similarly, the \(n\)th remove remove() should succeed for a cell with sequence number \(k\) if and only if \(n + 1 = k\), in which case the value of \(k\) should be incremented by the size of the queue (so that a future add() can succeed on this entry). If \(n + 1 > k\), then there is nothing to remove from the queue, so the calling thread must wait. If \(n + 1 < k\), then some other remove() already took what was there, so you need to try again.

Introduction