Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
gpu [2017/06/26 16:51] kocmanek [Using cluster] |
gpu [2017/07/15 18:03] kocmanek [Performance tests] |
||
---|---|---|---|
Line 6: | Line 6: | ||
| machine | | machine | ||
- | | titan | GeForce GTX 1080 Ti; cc6.1 | 1 | 12 GB | | | + | | titan | GeForce GTX 1080 Ti; cc6.1 | 1 | 11 GB |
- | | titan-gpu | + | | titan-gpu |
- | | twister1; twister2; kronos | Tesla K40c; cc3.5 | 1 | 12 GB | | | + | | twister1; twister2; kronos | Tesla K40c; cc3.5 | 1 | 12 GB |
- | | iridium | + | | iridium |
- | | victoria; arc | GeForce GT 630; cc3.0 | 1 | 2 GB | + | | victoria; arc | GeForce GT 630; cc3.0 | 1 | 2 GB | desktop machine | |
- | | athena | + | | athena |
- | | dll1; dll2 | GeForce GTX 1080; cc6.1 | 8 | 8 GB each core | | | + | | dll1; dll2 | GeForce GTX 1080; cc6.1 | 8 | 8 GB each core | | |
+ | | dll3; dll4; dll5 | GeForce GTX 1080 Ti; cc6.1 | 10 | 11 GB each core | | | ||
not used at the moment: GeForce GTX 570 (from twister2) | not used at the moment: GeForce GTX 570 (from twister2) | ||
Line 35: | Line 36: | ||
* Ondřej Plátek - granted (2015) | * Ondřej Plátek - granted (2015) | ||
* Jan Hajič jr. - granted (early 2016) | * Jan Hajič jr. - granted (early 2016) | ||
- | * Jindra Helcl - planning to apply (fall 2016) | ||
Line 113: | Line 113: | ||
| athena | | athena | ||
| dll2 | (2 GPU) GeForce GTX 1080; cc6.1 | | | dll2 | (2 GPU) GeForce GTX 1080; cc6.1 | | ||
- | | titan | GeForce GTX 1080 Ti | 11:41:08 | | | + | | titan | GeForce GTX 1080 Ti | 10:45:11 | (new result with correct CUDA version) |
| dll1 | (2 GPU) GeForce GTX 1080; cc6.1 | | | dll1 | (2 GPU) GeForce GTX 1080; cc6.1 | | ||
| dll2 | (2 GPU) GeForce GTX 1080; cc6.1 | | | dll2 | (2 GPU) GeForce GTX 1080; cc6.1 | | ||
Line 129: | Line 129: | ||
+ | === Better Benchmark === | ||
+ | |||
+ | The previous benchmark only compares the speed of processing units within the GPUs and do not take into account the size of memory. Therefore I have conducted another benchmark, this time for each graphic card increased the batch_size as much as possible so the model still could fit into the GPU. This way the results should be more representative of the power for each GPU. | ||
+ | |||
+ | GPU | GPU RAM | Walltime | Batch size | Machine | | ||
+ | Tesla K40c; cc3.5 | 12 GB | ||
+ | GeForce GTX 1080 Ti; cc6.1 | 11 GB | ||
+ | GeForce GTX 1080; cc6.1 | 8 GB | | | | | ||
+ | GeForce GTX 1080; cc6.1 | 8 GB | | | Athena (without virtualization) | | ||
+ | GeForce GTX Titan Z; cc3.5 | 6 GB | | | | | ||
===== Links ===== | ===== Links ===== |