On some machines Test 8 shows that during some periods of time
different threads see operand A take on the same sequence of values.
The period of time can extend for as long as 20 or so loop
executions. On other machines these convoys of values do not
appear. What is it in the hardware design that causes convoys
to appear on some machines, but not on others?
Convoys are of length 14 or 15: A 4-way Sun Sparc 630
Convoys are of length 3: A KSR-1
No convoys were visible:
A NUMAchine
A 2-way Sun Sparc 20
A 2-way Sun Spark Ultra-2
A 2-way Intergraph TD-400