Task07 Denis Sokolov ITMO#1070
Closed
DenChika wants to merge 1 commit into
Closed
Conversation
Collaborator
|
✅ Результаты тестирования PR #1070 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_sparse_matrix_multiply === === main_sparse_matrix_multiply stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 8.57898 sec (CUDA: 0.115557 sec, OpenCL: 0.706491 sec, Vulkan: 7.75687 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using OpenCL API... Evaluating CSR matrix nrows x ncols=1000000x1000000 with values in range [0; 1000] ____________________________________________________________________________________________ Evaluating with NNZ per row in range [32; 32], median NNZ per row=32, total NNZ=32000000... CPU (multi-threaded via OpenMP) finished in 0.0432619 sec CPU effective bandwidth: 2.92585 GB/s (739.182 uint millions/s) Kernels compilation done in 3.49236 seconds GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0130587 10%=0.0130666 median=0.0292415 90%=3.52165 max=3.52165) GPU SpMV median effective VRAM bandwidth: 4.33151 GB/s (1094.34 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [128; 128], median NNZ per row=128, total NNZ=128000000... CPU (multi-threaded via OpenMP) finished in 0.167461 sec CPU effective bandwidth: 2.89139 GB/s (764.205 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0189664 10%=0.0189897 median=0.0192241 90%=0.0325111 max=0.0325111) GPU SpMV median effective VRAM bandwidth: 25.1916 GB/s (6658.3 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [1; 32], median NNZ per row=17, total NNZ=16499998... CPU (multi-threaded via OpenMP) finished in 0.0237571 sec CPU effective bandwidth: 2.89746 GB/s (672.649 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.01152 10%=0.0115225 median=0.0115239 90%=0.0115559 max=0.0115559) GPU SpMV median effective VRAM bandwidth: 5.98045 GB/s (1388.42 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [1; 128], median NNZ per row=64, total NNZ=64499934... CPU (multi-threaded via OpenMP) finished in 0.0830694 sec CPU effective bandwidth: 2.98108 GB/s (770.134 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0144483 10%=0.0145255 median=0.030362 90%=0.030388 max=0.030388) GPU SpMV median effective VRAM bandwidth: 8.15927 GB/s (2107.9 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [32; 128], median NNZ per row=80, total NNZ=80011495... CPU (multi-threaded via OpenMP) finished in 0.104959 sec CPU effective bandwidth: 2.90984 GB/s (761.936 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0154801 10%=0.0156685 median=0.0308816 90%=0.0309165 max=0.0309165) GPU SpMV median effective VRAM bandwidth: 9.89317 GB/s (2590.54 uint millions/s) |
Member
|
4/5 баллов 👍 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Локальный вывод
Вывод Github CI