Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
gpu [2019/10/18 09:57] vodrazka [Servers with GPU units] |
gpu [2021/07/02 16:49] ptacek |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== GPU at ÚFAL ====== | + | ====== GPU at ÚFAL. ====== |
This page summarizes which UFAL servers have some GPU card, and suggests basic diagnostic commands, paths to installed tools, etc., simply everything necessary at the very beginning of using GPUs for experiments. | This page summarizes which UFAL servers have some GPU card, and suggests basic diagnostic commands, paths to installed tools, etc., simply everything necessary at the very beginning of using GPUs for experiments. | ||
Line 7: | Line 7: | ||
| machine | GPU type | GPU driver version | [[https:// | | machine | GPU type | GPU driver version | [[https:// | ||
- | | dll3 | GeForce GTX 1080 Ti | 396.24 | 6.1 | 10 | 11.0 | 248.0 | | + | | dll3 | GeForce GTX 1080 Ti | 455.23.05 | 6.1 | 10 | 11 | 248.8 | |
- | | dll4 | GeForce GTX 1080 Ti | 396.24 | 6.1 | 10 | 11.0 | 248.0 | | + | | dll4 | GeForce GTX 1080 Ti | 455.23.05 | 6.1 | 10 | 11 | 248.8 | |
- | | dll5 | GeForce GTX 1080 Ti | 396.24 | 6.1 | 10 | 11.0 | 248.0 | | + | | dll5 | GeForce GTX 1080 Ti | 455.23.05 | 6.1 | 10 | 11 | 248.8 | |
- | | dll6 | GeForce GTX 1080 Ti | 396.24 | 6.1 | | + | | dll6 | GeForce GTX 1080 Ti | 455.23.05 | 6.1 | |
- | | dll7 | GeForce RTX 2080 Ti | 418.39 | 7.5 | 7 | 11.0 | | + | | dll7 | GeForce RTX 2080 Ti | 455.23.05 | 7.5 | |
- | | kronos | + | | dll9 | |
- | | + | | dll10 | GeForce RTX 3090 | |
GPU cluster '' | GPU cluster '' | ||
| machine | GPU type | GPU driver version | [[https:// | | machine | GPU type | GPU driver version | [[https:// | ||
- | | tdll1 | Quadro P5000 | 410.48 | 6.1 | 8 | 17 | 245 | | + | | tdll1 | Quadro P5000 | 455.23.05 | 6.1 | 8 | 17 | 245.0 | |
- | | tdll2 | Quadro P5000 | 410.48 | 6.1 | 8 | 17 | 245 | | + | | tdll2 | Quadro P5000 | 455.23.05 | 6.1 | 8 | 17 | 245.0 | |
- | | tdll3 | Quadro P5000 | 410.48 | 6.1 | 8 | 17 | 245 | | + | | tdll3 | Quadro P5000 | 455.23.05 | 6.1 | 8 | 17 | 245.0 | |
- | | tdll4 | Quadro P5000 | 410.48 | 6.1 | 8 | 17 | 245 | | + | | tdll4 | Quadro P5000 | 455.23.05 | 6.1 | 8 | 17 | 245.0 | |
- | | tdll5 | Quadro P5000 | 410.48 | 6.1 | 8 | 17 | 245 | | + | | tdll5 | Quadro P5000 | 455.23.05 | 6.1 | 8 | 17 | 245.0 | |
Desktop machines: | Desktop machines: | ||
Line 38: | Line 38: | ||
* All the rules from [[:Grid]] apply, even more strictly than for CPU because there are too many GPU users and not as many GPUs available. So as a reminder: always use GPUs via '' | * All the rules from [[:Grid]] apply, even more strictly than for CPU because there are too many GPU users and not as many GPUs available. So as a reminder: always use GPUs via '' | ||
* **Note that you need to use '' | * **Note that you need to use '' | ||
- | * Always specify the number of GPU cards (e.g. '' | + | * Always specify the number of GPU cards (e.g. '' |
- | * If you need more than one GPU card (on a single machine), always require as many CPU cores ('' | + | * If you need more than one GPU card (on a single machine), always require |
* For interactive jobs, you can use '' | * For interactive jobs, you can use '' | ||
* Note that the dll machines have typically 10 cards, but " | * Note that the dll machines have typically 10 cards, but " | ||
Line 53: | Line 53: | ||
You need to set library path from your '' | You need to set library path from your '' | ||
- | | + | |
- | | + | |
CUDA_DIR_OPT=/ | CUDA_DIR_OPT=/ | ||
if [ -d " | if [ -d " | ||
Line 65: | Line 65: | ||
fi | fi | ||
- | * When not using Theano, just Tensorflow this can be simplified to '' | + | * When not using Theano, just Tensorflow this can be simplified to '' |
* Note that the '' | * Note that the '' | ||
Line 111: | Line 111: | ||
watch nvidia-smi | watch nvidia-smi | ||
# For monitoring GPU activity in a separate terminal (thanks to Jindrich Libovicky for this!) | # For monitoring GPU activity in a separate terminal (thanks to Jindrich Libovicky for this!) | ||
+ | # You can also use nvidia-smi -l TIME | ||
nvcc --version | nvcc --version | ||
# this should tell CUDA version | # this should tell CUDA version |