Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Increase MaxConnections and MaxStartup in sshd config
When more than 10 data movement requests come in for a particular rabbit, the default sshd configuration starts dropping 30% of connections and drops all after 100 connections (the default value is set to 10:30:100). This causes data movement requests to fail since any concurrency over 10 causes ssh to close connections (from mpirun). This change increases that value to be able handle the max theoretical load for a particular rabbit. This image runs on 1 pod per rabbit node (i.e. nnf-dm-worker-*) and each rabbit node supports 16 compute nodes of 192 cores. Each core on a compute node could be creating a data movement request. 16 * 192 = 3072 Bump it up to an power of 2 for good measure -> 4096. Signed-off-by: Blake Devcich <blake.devcich@hpe.com>
- Loading branch information