C H A P T E R  1

Quick Reference

This list is a summary of the key performance tips found in this document. They are organized under the following categories:


Compilation and Linking

Compilation and linking are discussed in Chapter 6.

See Compiler Version.

See The mp* Utilities.

See The -fast Switch.

 

See The -xarch Switch.

See The -g Switch.

See Other Useful Switches.


MPProf


Analyzer Profiling

Use of the Performance Analyzer with Sun MPI programs is discussed in Chapter 7.

 

Here, the following techniques have been used:


Job Launch on a Multinode Cluster

See Running on a Dedicated System.

- Run on one node if possible.

- Place heavily communicating processes on the same node as one another.

See Minimizing Communication Costs.

- Run on one node if possible.

- Otherwise, spread over many nodes.

- For example, spread jobs that use multiple I/O servers.

See Controlling Bisection Bandwidth.

or use the following commands:


% cat a.csh#!/bin/cshmprun -n -np 4 a.out% a.csh

See Running Jobs in the Background.

% limit coredumpsize 0 (for csh)

$ ulimit -c 0 (for sh)

See Limiting Core Dumps.

% mprun -np 32 -Zt 4 a.out

or

% mprun -np 32 -Z 4 a.out

See Collocal Blocks of Processes.

See Multithreaded Job.

See Round-Robin Distribution of Processes.

See Detailed Mapping.


MPI Programming Tips

See Reducing Message Volume.

See Reducing Serialization and Load Balancing.

See Synchronization.

See Buffering.

See Nonblocking Operations.

See Polling.

See Sun MPI Collectives.

See Contiguous Data Types.

See Special Considerations for Message Passing Over TCP.