Changes

Jump to: navigation, search

Nefelibata

673 bytes added, 20:08, February 16, 2023
Job scheduling
Two nodes carrying NVidia A30 GPUs have recently been installed.
To request an interactive session reserving one entire GPU nodefor six hours: TO COMEsalloc --time=6:0:0 --gres=gpu:a30:1 --cpus-per-task=64 --mem=0 === Known issues === <b>Interactive Slurm jobs</b> (the two <code>salloc ...</code> example above) fail with: srun: error: mpi/pmix_v3: init: (null) [0]: mpi_pmix.c:139: pmi/pmix: can not load PMIx library srun: error: Couldn't load specified plugin name for mpi/pmix_v3: Plugin init() callback failed srun: error: cannot create mpi context for mpi/pmix_v3 srun: error: invalid MPI type 'pmix_v3', --mpi=list for acceptable typesWe are investigating causes and solutions. (2023-02-16) To run a <b>multi-host MPI</b> calculation, add export UCX_NET_DEVICES=ib0before calling <code>mpirun</code>.

Navigation menu