16.1. PRRTE v3.x series

This file contains all the NEWS updates for the PRRTE v3.0 series, in reverse chronological order.

16.1.1. 3.0.1 – 27 Sep 2023

Important

This release contains a major revision of the PRRTE documentation infrastructure. Accordingly, the various documents may contain some number of errors and should be regarded as a work-in-progress. We apologize in advance for any inconvenience this may create, and will continue to improve and correct the documents in future releases.

  • PR #1810: Blacklist the HWLOC GL component to avoid deadlock

  • PR #1807: Multiple commits
    • src/docs/show-help-files/Makefile.am: fix syntax

    • docs: Fix up several minor bugs that showed up via CI

    • Prototype Github Actions for PRRTE CI

  • PR #1800: Keep trying to find the solution to the docs problem

  • PR #1798: Keep trying to fix this Python stuff

  • PR #1796: Don’t ignore failure to create directory

  • PR #1793: build-dummy-ini-files: Be safe for Python 2

  • PR #1791: Update to track master branch

  • PR #1787: Add support for dynamic definition of process sets

  • PR #1784: Multiple commits
    • docs: fix HTML word wapping in table cells

    • Support fix for OMPI Github issue #11532

    • Fix binding to multiple pe’s

    • Fix typos in the –enable-dlopen support

    • Fix typo of variable name (PRRTE->PRTE)

    • Remove Doxygen

  • PR #1766: Multiple commits
    • Update ignores

    • mca/rmaps: impute process count when np is not set

    • Complete cleanup of map/bind default settings

  • PR #1762: v3.0: .github/workflows: update actions versions

  • PR #1752: [v3.0] rmaps/base: remove call to hwloc_bitmap_andnot() in bind_generic()

  • PR #1747: Respect “–” marker

  • PR #1741: [v3.0] RTD: Add .readthedocs.yaml file

  • PR #1737: Multiple commits
    • Add debug output in fence upcall

    • Silence TSan data race warning.

  • PR #1734: v3.0: backport fixes for ompi v5.0 submodule update

  • PR #1727: Protect against bad ppr directives

  • PR #1724: Fix one more place

  • PR #1721: Just disable the xml map output for ancient hwloc

  • PR #1718: Protect against ancient hwloc versions

  • PR #1714: Multiple commits
    • Parsable output in an XML format

    • Support odd topologies and relax pe-binding rules

    • Cleanup some formatting and unnecessary debug

  • PR #1705: Allow specifying ppr as the default mapping policy

  • PR #1702: Multiple commits
    • Recoverable jobs may report more than one proc error over the lifetime

    • Fix double-counting of failed procs in recoverable jobs

  • PR #1700: Update headers for rc2

  • PR #1696: Always set waitpid-fired flag when proc exits

  • PR #1694: Fix passing of envars for spawn

  • PR #1692: Multiple commits
    • Plug small memory leaks.

    • Plug memory leaks.

    • Adhere to project’s variable declaration convention.

    • Cleanup initial implementation of allocate and session ctrl

    • Protect one more place for Session_control

    • Fix the routing for non-tree-spawn launch

    • Cleanup prte_info output

    • Ensure we exit cleanly when a daemon fails to start

    • Store prted URI under correct name

    • Remove duplicate computation of routing tree

    • Minor cleanup of verbose output

    • Fix typo

    • Send direct to HNP if we don’t know a route

    • Cleanup some mangled formatting

    • Cleanup some debug output

    • Update the pmix server integration

  • PR #1686: Update config/oac submodule

  • PR #1682: Multiple commits
    • Remove wrapper definitions

    • Extend example to check for sessionID and jobID

  • PR #1678: Multiple commits
    • add –debug-daemon to prted options

    • Add the rml and routed types to prte_info array

  • PR #1674: Ensure cmd line errors provide error messages

  • PR #1672: Multiple commits
    • Correct default binding for map-by node/slot

    • Provide better error message for cross-package binding

    • Add missing command line directives

    • Fix minor issues found in tests.

    • Silence compiler warning in test/connect.

    • Silence compiler warnings in examples.

    • Use exit macros in example dmodex.

    • Remove antiquated code that can cause problems

    • Update sphinx requirements

    • Remove the PMIX_SIZE_ESTIMATE support

  • PR #1660: Correct function name when using older hwloc versions

  • PR #1658: Multiple commits
    • Properly handle overload-allowed

    • Define MIN for environments that lack it

    • Fix a typo in the README, fixes #1645

    • docs: fix Sphinx parsing error

  • PR #1644: Multiple commits
    • Add a spawn_multiple test

    • Provide an estimate of the size of registration data

    • Provide size estimates for collective operations that return data

    • Add an option to display the available CPUs

    • Fix generate_dist type

    • Protect against missing attribute definition

    • Set distances array type

    • Add “parseable” display qualifier

    • Generate distances by default

    • Show help message when mapping too low

    • Fix round-robin by obj with multiple cpus/rank

    • Fix map-by slot and node for multiple cpus/rank

  • PR #1633: v3.0.x: First cut of Sphinx/ReadTheDocs docs

  • PR #1626: Remove unused variable

  • PR #1622: Multiple commits
    • Update version to 3.0.1

    • Extend query support to include allocation info

    • Remove debug

    • Include topology in allocated node info

    • Minor touchups of the “fault” example

    • Designate the framework project as PRTE

    • Begin adding support for new server module function pointer

    • Silence warning - add return values

    • Update to compatibility with PMIx master ranch

    • Protect against earlier PMIx versions

    • Add an IOF stress test

    • Accept local group op upcalls

    • Check for and track scheduler attachment

    • Check for map/bind conflicting directive, improve unbound report

    • Don’t error out on binding if not required

  • PR #1600: Multiple commits
    • Minor change to check_os_flavors

    • Don’t emit an error output if not needed

  • PR #1597: Fix oac_check_package.m4

  • PR #1595: build: fix bashisms in configure

16.1.2. 3.0.0 – 13 Nov 2022

Important

This release represents a major update to the PRRTE code base from prior releases. Many of the changes will be transparent to the user. However, there are a few key areas that may (likely) impact users. These include:

  • changes to the cmd line interpreter. PRRTE has provided an abstraction layer that allows programming library developers and environment vendors to customize the cmd line of PRRTE binaries (e.g., prterun and prun). This is governed by the detection of a “personality” that is to be used for defining the cmd line options. The personality can be set either directly on the cmd line itself (via the “–personality foo” option) or is detected by PRRTE thru the environment.

  • a more extensive “–help” support system.

  • reduction in the “–rank-by” options.

  • addition of new “–output” options

  • removal of the “mindist” mapper due to lack of a maintainer

There are quite a few more changes than listed here, but they mostly pertain to code internals. However, these dictate that PRRTE v3.0.0 require use of PMIx v4.2 or above to compile. Note that this doesn’t impact the PMIx level associated with any applications executed by PRRTE - the only requirement is that applications link against a PMIx version that meets the PMIx cross-version requirements (i.e., a minimum of PMIx v2.5).

Detailed changes include:

  • PR #1412: Restrict PRRTE to PMIx v4.2+

  • PR #1410: Remove schizo/hydra component

  • PR #1409: Add new output options to help string

  • PR #1407: Support a broader range of IO output formats

  • PR #1404: Fix a hang in daemon callback

  • PR #1402: Cleanup race condition in daemon reports

  • PR #1399: Multiple commits
    • Allow mapping in overload scenario if bind not specified

    • Ensure rankfile and seq mappers computer local and app ranks

  • PR #1394: Add forwarding of stdin to indirect example

  • PR #1391: Change the default mapping for –bind-to none option to BYSLOT.

Branch from master@f3d4089