Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
slurm [2022/09/06 15:35] vodrazka [Running jobs] |
slurm [2022/10/25 14:01] vodrazka [ÚFAL Grid Engine (LRC)] |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== ÚFAL Grid Engine (LRC) ====== | ====== ÚFAL Grid Engine (LRC) ====== | ||
- | LRC (Linguistic Research Cluster) is a name of ÚFAL' | + | LRC (Linguistic Research Cluster) is the name of ÚFAL' |
+ | Currently there are following partitions (queues) available for computing: | ||
+ | |||
+ | ===== Node list by partitions ===== | ||
+ | |||
+ | ==== cpu-troja ==== | ||
+ | |||
+ | ==== cpu-ms ==== | ||
+ | |||
+ | ==== gpu-troja ==== | ||
+ | |||
+ | ==== gpu-ms ==== | ||
+ | |||
+ | In order to submit a job you need to login to one of the head nodes: | ||
+ | |||
+ | | ||
+ | | ||
===== Basic usage ===== | ===== Basic usage ===== | ||
Line 17: | Line 33: | ||
#!/bin/bash | #!/bin/bash | ||
#SBATCH -J helloWorld | #SBATCH -J helloWorld | ||
- | #SBATCH -p cpu-troja | + | #SBATCH -p cpu-troja |
#SBATCH -o helloWorld.out | #SBATCH -o helloWorld.out | ||
#SBATCH -e helloWorld.err | #SBATCH -e helloWorld.err | ||
Line 33: | Line 49: | ||
#SBATCH -N 2 # number of nodes (default 1) | #SBATCH -N 2 # number of nodes (default 1) | ||
#SBATCH --nodelist=node1, | #SBATCH --nodelist=node1, | ||
- | #SBATCH -c 4 # number of cores/ | + | #SBATCH --cpus-per-task=4 |
#SBATCH --gres=gpu: | #SBATCH --gres=gpu: | ||
#SBATCH --mem=10G | #SBATCH --mem=10G | ||
+ | </ | ||
+ | |||
+ | If you need you can have slurm report to you: | ||
+ | |||
+ | < | ||
+ | #SBATCH --mail-type=begin | ||
+ | #SBATCH --mail-type=end | ||
+ | #SBATCH --mail-type=fail | ||
+ | #SBATCH --mail-user=< | ||
</ | </ | ||
Line 76: | Line 101: | ||
==== Cluster info ==== | ==== Cluster info ==== | ||
- | The command '' | + | The command '' |
- | List types of available | + | List available |
< | < | ||
- | sinfo -o %G | + | sinfo |
</ | </ | ||
+ | List detailed info about nodes: | ||
+ | < | ||
+ | sinfo -l -N | ||
+ | </ | ||
+ | |||
+ | List nodes with some custom format info: | ||
+ | < | ||
+ | sinfo -N -o "%N %P %.11T %.15f" | ||
+ | </ | ||
+ | |||
+ | === CPU core allocation === | ||
+ | |||
+ | The minimal computing resource in SLURM is one CPU core. However, CPU count advertised by SLURM corresponds to the number of CPU threads. | ||
+ | If you ask for 1 CPU core with < | ||
+ | |||
+ | For example '' | ||
+ | |||
+ | < | ||
+ | $> scontrol show node dll-8gpu1 | ||
+ | $ scontrol show node dll-8gpu1 | ||
+ | NodeName=dll-8gpu1 Arch=x86_64 CoresPerSocket=16 | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | </ | ||
+ | In the example above you can see comments at all lines relevant to CPU allocation. | ||
Line 97: | Line 162: | ||
There are many more parameters available to use. For example: | There are many more parameters available to use. For example: | ||
- | < | + | **To get an interactive CPU job with 64GB of reserved memory:** |
+ | < | ||
- | Where: | + | |
- | | + | |
* '' | * '' | ||
+ | |||
+ | **To get interactive job with a single GPU of any kind:** | ||
+ | < | ||
+ | * '' | ||
+ | * '' | ||
+ | |||
+ | < | ||
+ | * '' | ||
+ | * '' | ||
+ | * '' | ||
+ | |||
+ | < | ||
+ | * '' | ||
To see all the available options type: | To see all the available options type: | ||
< | < | ||
+ | |||
+ | ===== See also ===== | ||
+ | |||
+ | https:// | ||
+ | |||