Sun HPC ClusterTools 3.0 Administrator's Guide: With CRE

Troubleshooting Tips

CRE RPC timeouts in user code are generally not recoverable. The job might continue to run, but processes probably won't be able to communicate with each other. There are two ways to deal with this:

/tmp/.hpcshm_mmap.jid.*

/tmp/.hpcshm_acf.jid.*

The Sun MPI shared memory protocol module uses these files for interprocess communication on the same node. These files consume swap space.