This subroutine calculates the individual stresses and strains per pixel per processor. MPI code is needed to sum the individual contributions per processor onto the root (master) node (like the C calculation). The 12 macroscopic stresses and strains are broadcast to each node, but root is the lone processor who uses these values for output purposes.