Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
gpu [2017/07/19 16:15] kocmanek [Performance tests] |
gpu [2017/10/11 11:46] popel [Basic commands] |
||
---|---|---|---|
Line 4: | Line 4: | ||
===== Servers with GPU units ===== | ===== Servers with GPU units ===== | ||
+ | GPU cluster '' | ||
- | | machine | + | | machine |
- | | titan | + | | iridium |
- | | titan-gpu | + | | titan-gpu |
- | | twister1; twister2; kronos | Tesla K40c; cc3.5 | 1 | 12 GB | + | | twister1; twister2; kronos | Tesla K40c |
- | | iridium | + | | dll1; dll2 | GeForce GTX 1080 |
- | | victoria; arc | GeForce GT 630; cc3.0 | 1 | 2 GB | desktop machine | + | | titan |
- | | athena | + | | dll3; dll4; dll5 | GeForce GTX 1080 Ti | |
- | | dll1; dll2 | GeForce GTX 1080; cc6.1 | 8 | + | | dll6 | GeForce GTX 1080 Ti | |
- | | dll3; dll4; dll5 | GeForce GTX 1080 Ti; cc6.1 | 10 | 11 GB each core | | | + | |
- | not used at the moment: GeForce GTX 570 (from twister2) | + | Desktop machines: |
+ | | machine | ||
+ | | victoria; arc | GeForce GT 630 | cc3.0 | 1 | 2 GB | desktop machine | | ||
+ | | athena | ||
+ | |||
+ | Not used at the moment: GeForce GTX 570 (from twister2) | ||
All machines have CUDA8.0 and should support both Theano and TensorFlow. | All machines have CUDA8.0 and should support both Theano and TensorFlow. | ||
- | Summary of future plans: | + | === Disk space === |
- | * Current | + | All the GPU machines are at Malá Strana (not at Troja), so you should not use ''/ |
- | * The old Quadro K2000 we have is a much more low end piece, so we can't test is in Troja. | + | - '' |
- | * There is MetaCentrum which also has GPUs, so testing can be done there. | + | - '' |
- | * It is impossible (wasteful in terms of space and forbidden by a dean regulation) to put non-rack machines to our servers rooms. So we won't be buying GeForce GTX 1080 (~20000CZK, out of stock now), for a non-rack machine since we most likely don't have any available. | + | - ''/ |
- | * Yes, there are grant applications under review which include rack machines with GPUs, e.g. 5x2 or something like that; more will be known in 2017. | + | - ''/ |
=== Individual acquisitions: | === Individual acquisitions: | ||
Line 43: | Line 47: | ||
In this section will be explained how to use cluster properly. | In this section will be explained how to use cluster properly. | ||
+ | |||
+ | ==== Set-up CUDA and CUDNN ==== | ||
+ | |||
+ | You can add following command into your ~/.bashrc | ||
+ | |||
+ | CUDNN_version=6.0 | ||
+ | CUDA_version=8.0 | ||
+ | CUDA_DIR_OPT=/ | ||
+ | if [ -d " | ||
+ | CUDA_DIR=$CUDA_DIR_OPT | ||
+ | export CUDA_HOME=$CUDA_DIR | ||
+ | export THEANO_FLAGS=" | ||
+ | export PATH=$PATH: | ||
+ | export LD_LIBRARY_PATH=$LD_LIBRARY_PATH: | ||
+ | export CPATH=$CUDA_DIR/ | ||
+ | fi | ||
+ | |||
==== TensorFlow Environment ==== | ==== TensorFlow Environment ==== | ||
Line 95: | Line 116: | ||
/ | / | ||
# shows CUDA capability etc. | # shows CUDA capability etc. | ||
+ | ssh dll1; ~popel/ | ||
+ | # who occupies which card on a given machine | ||
| | ||
=== Select GPU device === | === Select GPU device === | ||
Line 108: | Line 131: | ||
* [[http:// | * [[http:// | ||
- | In the following table is the experiment conducted by Tom Kocmi. You can replicate experiment: / | + | In the following table is the experiment conducted by Tom Kocmi. You can replicate experiment: / |
| machine | Setup; CPU/GPU; [[https:// | | machine | Setup; CPU/GPU; [[https:// | ||
| athena | | athena | ||
- | | dll2 | + | | dll2 | GeForce GTX 1080; cc6.1 | |
| titan | GeForce GTX 1080 Ti | | | titan | GeForce GTX 1080 Ti | | ||
| dll1 | (2 GPU) GeForce GTX 1080; cc6.1 | | | dll1 | (2 GPU) GeForce GTX 1080; cc6.1 | | ||
- | | dll2 | + | | dll2 | GeForce GTX 1080; cc6.1 | |
| titan-gpu | | titan-gpu | ||
| kronos-dev | Tesla K40c; cc3.5 | | | kronos-dev | Tesla K40c; cc3.5 | |