10.5. Fundamentals

The mapping of processes to nodes can be defined not just with general policies but also, if necessary, using arbitrary mappings that cannot be described by a simple policy. Supported directives, given on the command line via the --mapby option, include:

SEQ: (often accompanied by the file=<path> qualifier) assigns one process to each node specified in the file. The sequential file is to contain an entry for each desired process, one per line of the file.
RANKFILE: (often accompanied by the file=<path> qualifier) assigns one process to the node/resource specified in each entry of the file, one per line of the file.

For example, using the hostfile below:

$ cat myhostfile
aa slots=4
bb slots=4
cc slots=4

The command below will launch three processes, one on each of nodes aa, bb, and cc, respectively. The slot counts don’t matter; one process is launched per line on whatever node is listed on the line.

$ prun --hostfile myhostfile --mapby seq ./a.out

Impact of the ranking option is best illustrated by considering the following hostfile and test cases where each node contains two packages (each package with two cores). Using the --mapby ppr:2:package option, we map two processes onto each package and utilize the --rankby option as show below:

$ cat myhostfile
aa
bb

Command	Ranks on `aa`	Ranks on `bb`
`--rankby core`	0 1 ! 2 3	4 5 ! 6 7
`--rankby package`	0 2 ! 1 3	4 6 ! 5 7
`--rankby package:SPAN`	0 4 ! 1 5	2 6 ! 3 7

Ranking by slot provides the identical result as ranking by core in this case — a simple progression of ranks across each node. Ranking by package does a round-robin ranking across packages within each node until all processes have been assigned a rank, and then progresses to the next node. Adding the :SPAN qualifier to the ranking directive causes the ranking algorithm to treat the entire allocation as a single entity — thus, the process ranks are assigned across all packages before circling back around to the beginning.

The binding operation restricts the process to a subset of the CPU resources on the node.

The processors to be used for binding can be identified in terms of topological groupings — e.g., binding to an l3cache will bind each process to all processors within the scope of a single L3 cache within their assigned location. Thus, if a process is assigned by the mapper to a certain package, then a --bindto l3cache directive will cause the process to be bound to the processors that share a single L3 cache within that package.

To help balance loads, the binding directive uses a round-robin method, binding a process to the first available specified object type within the object where the process was mapped. For example, consider the case where a job is mapped to the package level, and then bound to core. Each package will have multiple cores, so if multiple processes are mapped to a given package, the binding algorithm will assign each process located to a package to a unique core in a round-robin manner.

Binding can only be done to the mapped object or to a resource located within that object.

An object is considered completely consumed when the number of processes bound to it equals the number of CPUs within it. Unbound processes are not considered in this computation. Additional processes cannot be mapped to consumed objects unless the OVERLOAD qualifier is provided via the “–bindto” command line option.

Default process mapping/ranking/binding policies can also be set with MCA parameters, overridden by the command line options when provided. MCA parameters can be set on the prte command line when starting the DVM (or in the prterun command line for a single-execution job), but also in a system or user mca-params.conf file or as environment variables, as described in the MCA section below. Some examples include:

`prun` option	MCA parameter key	Value
`--mapby core`	`mapby`	`core`
`--mapby package`	`mapby`	`package`
`--rankby core`	`rankby`	`core`
`--bindto core`	`bindto`	core`
`--bindto package`	`bindto`	`package`
`--bindto none`	`bindto`	`none`