Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
slurm [2022/08/31 11:08] vodrazka [Basic usage] |
slurm [2022/09/06 15:50] vodrazka [Cluster info] |
||
---|---|---|---|
Line 4: | Line 4: | ||
===== Basic usage ===== | ===== Basic usage ===== | ||
+ | |||
+ | ==== Batch mode ==== | ||
+ | |||
+ | The core idea is that you write a batch script containing the commands you wish to run as well as a list of '' | ||
+ | Then the script is submitted to the cluster with: | ||
+ | |||
+ | < | ||
+ | |||
+ | Here is a simple working example: | ||
+ | |||
+ | < | ||
+ | #!/bin/bash | ||
+ | #SBATCH -J helloWorld | ||
+ | #SBATCH -p cpu-troja | ||
+ | #SBATCH -o helloWorld.out | ||
+ | #SBATCH -e helloWorld.err | ||
+ | |||
+ | # run my job (some executable) | ||
+ | sleep 5 | ||
+ | echo "Hello I am running on cluster!" | ||
+ | </ | ||
+ | |||
+ | After submitting this simple code you should end up with the two files ('' | ||
+ | |||
+ | Here is the list of other useful '' | ||
+ | < | ||
+ | #SBATCH -D / | ||
+ | #SBATCH -N 2 # number of nodes (default 1) | ||
+ | #SBATCH --nodelist=node1, | ||
+ | #SBATCH -c 4 # number of cores/ | ||
+ | #SBATCH --gres=gpu: | ||
+ | #SBATCH --mem=10G | ||
+ | </ | ||
+ | |||
+ | As usuall the complete set of options can be found by typing: | ||
+ | |||
+ | < | ||
+ | man sbatch | ||
+ | </ | ||
+ | |||
+ | ==== Running jobs ==== | ||
+ | |||
+ | In order to inspect all running jobs on the cluster use: | ||
+ | |||
+ | < | ||
+ | squeue | ||
+ | </ | ||
+ | |||
+ | filter only jobs of user '' | ||
+ | |||
+ | < | ||
+ | squeue -u linguist | ||
+ | </ | ||
+ | |||
+ | filter only jobs on partition '' | ||
+ | |||
+ | < | ||
+ | squeue -p gpu-ms | ||
+ | </ | ||
+ | |||
+ | filter jobs in specific state (see '' | ||
+ | < | ||
+ | squeue -t RUNNING | ||
+ | </ | ||
+ | |||
+ | filter jobs running on a specific node: | ||
+ | < | ||
+ | squeue -w dll-3gpu1 | ||
+ | </ | ||
+ | |||
+ | ==== Cluster info ==== | ||
+ | |||
+ | The command '' | ||
+ | |||
+ | List available partitions(queues). The default partition is marked with '' | ||
+ | < | ||
+ | sinfo | ||
+ | </ | ||
+ | |||
+ | List types of available GPUs: | ||
+ | < | ||
+ | sinfo -o %G | ||
+ | </ | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
==== Interactive mode ==== | ==== Interactive mode ==== | ||
+ | This mode can be useful for testing You should be using batch mode for any serious computation. | ||
You can use **'' | You can use **'' | ||
Line 19: | Line 108: | ||
* '' | * '' | ||
+ | To see all the available options type: | ||
+ | < | ||