Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
slurm [2022/09/06 15:35] vodrazka [Running jobs] |
slurm [2022/09/08 13:46] vodrazka [Batch mode] |
#!/bin/bash | #!/bin/bash |
#SBATCH -J helloWorld # name of job | #SBATCH -J helloWorld # name of job |
#SBATCH -p cpu-troja # name of partition or queue | #SBATCH -p cpu-troja # name of partition or queue (if not specified default partition is used) |
#SBATCH -o helloWorld.out # name of output file for this submission script | #SBATCH -o helloWorld.out # name of output file for this submission script |
#SBATCH -e helloWorld.err # name of error file for this submission script | #SBATCH -e helloWorld.err # name of error file for this submission script |
#SBATCH --gres=gpu:1 # number of GPUs to request (default 0) | #SBATCH --gres=gpu:1 # number of GPUs to request (default 0) |
#SBATCH --mem=10G # request 10 gigabytes memory (per node, default depends on node) | #SBATCH --mem=10G # request 10 gigabytes memory (per node, default depends on node) |
| </code> |
| |
| If you need you can have slurm report to you: |
| |
| <code> |
| #SBATCH --mail-type=begin # send email when job begins |
| #SBATCH --mail-type=end # send email when job ends |
| #SBATCH --mail-type=fail # send email if job fails |
| #SBATCH --mail-user=<YourUFALEmailAccount> |
</code> | </code> |
| |
==== Cluster info ==== | ==== Cluster info ==== |
| |
The command ''sinfo'' can give you usefull information about nodes available in the cluster. Here is a short list of some usefull examples: | The command ''sinfo'' can give you useful information about nodes available in the cluster. Here is a short list of some examples: |
| |
List types of available GPUs: | List available partitions(queues). The default partition is marked with ''*'': |
<code> | <code> |
sinfo -o %G | sinfo |
| </code> |
| |
| List detailed info about nodes: |
| <code> |
| sinfo -l -N |
| </code> |
| |
| List nodes with some custom format info: |
| <code> |
| sinfo -N -o "%N %P %.11T %.15f" |
</code> | </code> |
| |
<code>srun -p cpu-troja --mem=64G --pty bash</code> | <code>srun -p cpu-troja --mem=64G --pty bash</code> |
| |
Where: | * ''-p cpu-troja'' explicitly requires partition ''cpu-troja''. If not specified slurm will use default partition. |
* ''-p cpu-troja'' explicitly requires partition ''cpu-troja'' | |
* ''--mem=64G'' requires 64G of memory for the job | * ''--mem=64G'' requires 64G of memory for the job |
| |
| <code>srun -p gpu-troja,gpu-ms --nodelist=tdll-3gpu1 --mem=64G --gres=gpu:2 --pty bash</code> |
| * ''-p gpu-troja,gpu-ms'' require only nodes from these two partitions |
| * ''--nodelist=tdll-3gpu1'' explicitly requires one specific node |
| * ''--gres=gpu:2'' requires 2 GPUs |
| |
| <code>srun -p gpu-troja --constraint="gpuram44G|gpuram39G" --mem=64G --gres=gpu:2 --pty bash</code> |
| * ''--constraint="gpuram44G|gpuram39G"'' only consider nodes that have either ''gpuram44G'' or ''gpuram39G'' feature defined |
| |
To see all the available options type: | To see all the available options type: |