Bascule de noeuds sous SGE vers SLURM

Dans notre cas nous devons basculer 3 noeuds actuellement sous gestion SGE vers SLURM, les noeuds à basculer sont mynode05, mynode06, mynode07 et mynode08.

Administration sous SGE

Affichage

On affiche la liste des noeuds sous gestion SGE :

# qstat -f
queuename qtype resv/used/tot. load_avg arch state
-----------------------------------------------------------------------------
[email protected] BIP 0/13/40 10.72 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/12/40 10.00 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/15/40 9.17 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/14/37 10.98 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/11/40 10.54 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/10/40 9.81 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.03 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/3 10.98 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/3 10.98 lx-amd64
-----------------------------------------------------------------------------

Désactivation

On désactive les noeuds 05 à 07 avec qmod et l’option -d (--disable) :

# qmod -d [email protected]
[email protected] changed state of "[email protected]" (disabled)
# qmod -d [email protected]
[email protected] changed state of "[email protected]" (disabled)
# qmod -d [email protected]
[email protected] changed state of "[email protected]" (disabled)
# qmod -d [email protected]
[email protected] changed state of "[email protected]" (disabled)

Vérification

Vérification du statut des noeuds désactivés, ils doivent comporter le states à la valeur d :

# qstat -f
queuename qtype resv/used/tot. load_avg arch state
-----------------------------------------------------------------------------
[email protected] BIP 0/13/40 10.72 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/12/40 10.00 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/15/40 9.17 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/14/37 10.98 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/11/40 10.54 lx-amd64 d
-----------------------------------------------------------------------------
[email protected] BIP 0/10/40 9.81 lx-amd64 d
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.03 lx-amd64 d
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64 d
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/40 0.02 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/3 10.98 lx-amd64
-----------------------------------------------------------------------------
[email protected] BIP 0/0/3 10.98 lx-amd64
-----------------------------------------------------------------------------

Administration sous SLURM

Vérification

Sous SLURM, les noeuds 5 à 8 sont maintenant en statut drain (avant ils étaient en down car ils étaient aussi sous gestion SGE) :

# sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
compute* up infinite 6 down* mynode[01-04,09-10]
compute* up infinite 4 drain mynode[05-08]
compute* up infinite 2 idle mynode[11-12]

Activation

On les passe maintenant sous SLURM, en changeant le statut drain en idle :

# scontrol update node=mynode[05-08] state=idle reason="Bascule SGE vers SLURM"

Vérification :

# sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
compute* up infinite 6 down* mynode[01-04,09-10]
compute* up infinite 5 idle mynode[05-08,11-12]

Documentation

RTFM : SGE
RTFM : SLURM

> Partager <