Shared variables and critical sections
Problem 1: A shared counter
In this problem we will consider the following Java program:
class Counter implements Runnable { private int counter = 0; private final int rounds = 100000; public void run () { // try { for(int i = 0; i < rounds; i++) { counter++; } // } catch (InterruptedException e) { // System.out.println ("Interrupted!"); // } } public static void main (String[] args) { try { Counter c = new Counter (); // Create two threads that run our run () method. Thread t1 = new Thread (c, "thread1"); Thread t2 = new Thread (c, "thread2"); t1.start (); t2.start (); // Wait for the threads to finish. t1.join (); t2.join (); // Print the counter System.out.println(c.counter); } catch (InterruptedException e) { System.out.println ("Interrupted!"); } } }
The main method creates a Counter
object and runs two threads on it, both
running its run()
method, which updates the shared counter. After both threads
have updated the counter 100000 times, they finish and its value is printed. The
expected output is 200000. Compile and run the program.
The results that you get may depend on the hardware, operating system and the version of the Java environment. If you are running it on a multi-core machine, it is quite likely that at least some of the runs will give different results than 200000.
The reason for this is that incrementing the counter is in fact not one basic
operation, but two. Code counter++
gets compiled into something like this:
int tmp = counter; counter = tmp + 1;
So while one thread is about to write the new value to the shared counter variable the other one might update it several times (or even very many times) in the mean time. The Java standard does not guarantee that the threads will run synchronized in any way. And in fact they often are completely unsynchronized and execute in a very surprising order. Whenever a particular order of executing code in different threads causes the program to make an error, we call it a race condition. In this case it's a race condition between two threads reading and updating the same counter.
How to fix this problem? One possible way is to restrict the way in which operations of the two threads may be ordered. In particular, we don't want one thread to read and change the counter when the other one has read it but not updated it yet.
In short, we want to make sure that at most one thread at a time is executing
the code that reads and updates the counter. This property is called mutual
exclusion and one way to achieve it is to use the synchronized
keyword in
Java. Replace the line counter++
with this code:
synchronized (this) { counter++; }
What we have added is a synchronized
block, which makes sure that no other
synchronized
block from the same object (this
) executes at the same time. In our
case, this means that the other thread, which tries to enters the synchronized
block on the same Counter
object, will have to wait until the first thread
leaves the block.
Recompile and run the example, and now the printed value should always be
- We haven't changed the actual operations performed by the program, but by controling the order in which operations are performed we made sure that the result will be correct.
Now comes the main part of the exercise. Your job will be to find out what could be the least value of the counter reported by the original, unsynchronized, program. You will also have to demonstrate that happening. But instead of running the program 1000000000 times and hoping to get the very unlikely result you will be able to pause the threads in strategic moments to get a good chance of getting the results.
Start by taking the original program (without the synchronized block) and
replace counter++
with
int tmp = counter; counter = tmp + 1;
Now you will add a number of lines like this to the loop:
if (id== ?? && i== ??) Thread.sleep(??)
Each line will delay one thread on a specific iteration of the loop by a
specific number of milliseconds. To be able to tell which thread you are in, you
have to add this code to the beginning of the run()
method:
int id; String name = Thread.currentThread().getName(); if (name.equals("thread1")) id = 1; else id = 2;
Note that in normal programs you should identify threads by using your own data
(for example integers) as identifiers. Thread.getName()
is good for
debugging. Also, because Thread.sleep()
is interruptible, it may throw a
InterruptedException
, which we have to catch here. This exception is used to for
safe thread cancellation, and we will not deal with this topic in this
exercise. Instead, we will just catch it and ignore it. For that, you need to
uncomment the try-catch block in the run()
method.
class Counter implements Runnable { private int counter = 0; private final int rounds = 100000; public void run () { try { int id; String name = Thread.currentThread().getName(); if (name.equals("thread1")) id = 1; else id = 2; for(int i = 0; i < rounds; i++) { // Delay here? int tmp = counter; // Perhaps here? counter = tmp + 1; // Or here? } } catch (InterruptedException e) { System.out.println ("Interrupted!"); } } public static void main (String[] args) { try { Counter c = new Counter (); // Create two threads that run our run () method. Thread t1 = new Thread (c, "thread1"); Thread t2 = new Thread (c, "thread2"); t1.start (); t2.start (); // Wait for the threads to finish. t1.join (); t2.join (); // Print the counter System.out.println(c.counter); } catch (InterruptedException e) { System.out.println ("Interrupted!"); } } }
Find out what is the least number the program could report and demonstrate this by inserting delays.
Here are conclusions that we would like to draw from the above exercise. First, we demonstrated that even simple data manipulation can give corrupted results if it is performed by many threads concurrently without proper synchronization. Data corruption is a common problem with buggy concurrent code. Secondly, sometimes concurrent programs can give very unexpected results if a very specific (and unlikely) ordering of concurrent actions takes place. We were able to fix the concurrency problem by restricting different possible orderings of actions performed by different threads using synchronization.
Problem 2: An information display
In this example we will consider a display of the kind commonly found in airports, where information about upcoming flights is given:
We will study the programming of such a display in Java. This will enable us to study simple situations where concurrent threads interfere in their use of a shared data structure. The Java exercise should also be possible to do on any platform, given that Java 1.5 is available.
First download the file displayJava.zip to a suitable working directory. Then decompress:
$ unzip displayJava.zip Archive: displayJava.zip creating: java/ inflating: java/HighLevelDisplay.java inflating: java/HWDisplay.java inflating: java/JDisplay.java inflating: java/JDisplay2.java inflating: java/Main1.java inflating: java/Main2.java inflating: java/Main3.java $ ls java HWDisplay.java JDisplay2.java Main3.java HighLevelDisplay.java Main1.java JDisplay.java Main2.java
Unfortunately, we do not have access to a real 10 square meter hardware display, so we have here instead a complete Java implementation, where the display is visualized using the Java Swing library. However, we should think of this as a simulation of the programming of a hardware device in the following way.
The hardware is accessed using the following simple API, found in the file
HWDisplay.java
:
public interface HWDisplay { public int getRows(); public int getCols(); public void write(int row, int col, char c); }
The two first methods return the number of rows and columns, respectively, in the display (presumably the device comes in different sizes). The third is used to write one character in the specified row and column. Following Java practice, we start counting rows and columns from 0.
The file JDisplay.java
contains an implementation of this API using Swing. You
do not need to look into this file for this exercise. It contains a lot of
low-level detail specific to Java's GUI libraries, which are not essential for
this course. Instead, look at Main1.java for a simple main program that writes
an A and a B at different positions in the display, then waits for three seconds
and finally removes the A. After that, the program just waits for further input
and you have to kill it using Ctrl-C. To see this in action, do as follows:
$ javac *.java $ java Main1
You may note the convenient function nap
for future use.
However, it is clear that the interface HWDisplay
is much too low-level for
actually programming the display. Therefore, the manufacturer provides also
HighLevelDisplay.java
, which is much more convenient to use:
public interface HighLevelDisplay { public void clear(); public void addRow(String str); public void deleteRow(int row); }
The method clear removes all text from the display. addRow(str)
adds the
string str
as the new last row, after the last existing line. Finally,
deleteRow(r)
deletes row r
, shifting all rows below up one slot. Also, the
intention is that one can add rows also when the display is fully used; this row
will not be visible on screen until enough rows above has been deleted. Finally,
the last visible change is highlighted by flashing a few times. All this should
of course be more fully documented, preferrably using javadoc
, but we have
intentionally left this out so as not to clutter the code.
he display manufacturer also provides an implementation of this interface in the
file JDisplay2.java
. You should study and understand this class, but first you
may want to run a simple demo:
$ java Main2
We will not have time to try to think of a full-fledged program for the airport
administration using this API, but just note that it is conceivable that the
display will be accessed from a multi-threaded program, with several airport
officials concurrently updating the display. To test the class JDisplay2
under
such conditions, your task now is to write a simple multi-threaded program. A
skeleton is found in the file Main3.java
, which shows the structure of a
simple main program that creates a display d
and starts two threads, one
executing the static procedure addProc(d)
, the other executing
deleteProc(d)
. You must complete the bodies of these two procedures. Fill
addProc
with a sequence of addRow
commands, interspersed with suitable
naps. Similarly, fill deleteProc
with calls to deleteRow(0)
. To allow for a
not too boring simulation, naps should be in the order of seconds (or fractions
of seconds) and not minutes as in a real airport.
If you do this and run your program, you will probably see some un-intended
behaviour. You should make sure that you understand how these problems can
occur. In fact, the class JDisplay2
is not thread safe; it does not guarantee
correct behaviour when its methods are accessed by concurrent threads.
We will now fix this problem in two different ways:
The airport IT department, developing the application
Main3
, does not have access toJDisplay2
, but only to the interface and an object file. Thus they have to solve the problem without modifyingJDisplay2
. One way to do this is to identify the critical sections inMain3
and protect them using a semaphore. Do this! You should use an instance ofjava.util.concurrent.Semaphore
for this purpose. (java.util.concurrent
is a new package from Java 1.5.) Note that the methodacquire()
may throw anInterruptedException
, so you will have to use atry
/catch
construction. Test your program again to see that it seems to behave correctly. We say "seems", since you should now have an initial feeling for the difficulties in testing a concurrent program.The IT department has also complained to the display manufacturer, requesting a thread safe implementation of
JDisplay2
. Fulfill this request. It is easy: just add the modifiersynchronized
to the header of each method. We will explain this later in the course, but for the moment you should just check that your initialMain3
(without the semaphore) together with your thread safeJDisplay2
gives a correct program. As a final remark, we just note that it is an important but non-trivial design decision whether to make a class thread safe. For multi-threaded use it is essential, but it does imply a performance penalty. An example of a large library that is not thread safe is the Swing library for GUI's. One possibility, used in the Collections framework, is to provide both unsynchronized base implementations and synchronized wrapper classes. You may want to think of still another fix to the original program: instead of using semaphores, the IT department could have produced a simple synchronized wrapper class forJDisplay2.java
.