Steven Gottlieb, Fermilab and Indiana University
QCD is the theory of the strong interaction. We can write down the
theory, but there is no known analytic solution.
A numerical approach uses a regular grid of lattice points in spacetime
Quarks are represented by 3-component complex vectors at each grid point
Gluons are represented by 3 X 3 complex matrices on each link between points
![]() Domain decomposition is used to parallelize the code This Spring the MILC code was modified to improve cache performance, by going from a "site major" to a "field major" organization. In site major approach all physical observables at each grid point are stored together. In field major, each observable gets its own array. This results in excellent improvement on problems that don't fit in cache. The benchmarks presented here were run on grids of size L4.
For multinode benchmarks we use L4 grid points per cpu.  On the IBM SP we have results on up to 256 CPUs with both site major and field major inverters.  These were obtained on the Indiana University SP using 4-way SMP nodes.  The field major code results in a substantial increase in speed for L > 8, but for reasons not yet understood, it underperforms the site major code for small L and larger numbers of CPUs.
About 200,000 cpu hours on the IU SP have been used over the past 3-4
months to study B-meson decay constants.
|