Wiki
Clone wikiConjugateGradient / Result
Result
revision | CPU | GPU | Memory | Compiler | Solver | PreProcess | Solve | PostProcess | note |
---|---|---|---|---|---|---|---|---|---|
CPU | |||||||||
df3bb61 | Intel Core i7 920 (Nehalem) | - | DDR3-1333 | gcc 4.9.2 | SequentialNative | 10.697 | 2503.610 | 1.803 | |
df3bb61 | Intel Core i7 920 (Nehalem) | - | DDR3-1333 | gcc 4.9.2 | OpenMPNative(16) | 13.112 | 2013.830 | 1.820 | |
27d26c6 | Intel Core i7 4790 (Haswell) | - | DDR3-800 | Visual Studio 2015 Update 2 | SequentialNative | 8.063 | 1907.520 | 1.490 | |
27d26c6 | Intel Core i7 4790 (Haswell) | - | DDR3-800 | Visual Studio 2015 Update 2 | OpenMPNative(8) | 8.899 | 1381.230 | 1.519 | |
27d26c6 | Intel Core i7 4790 (Haswell) | - | DDR3-800 | Visual Studio 2015 Update 2 | SequentialAvx2 | 208.016 | 2089.530 | 1.088 | |
27d26c6 | Intel Core i7 4790 (Haswell) | - | DDR3-800 | Visual Studio 2015 Update 2 | OpenMPAvx2(8) | 227.840 | 1477.360 | 0.975 | |
037b553 | Intel Core i7 5820K (Haswell) | - | DDR4-2133 | gcc 4.9.0 | SequentialNative | 5.799 | 1813.08 | 0.903 | |
037b553 | Intel Core i7 5820K (Haswell) | - | DDR4-2133 | gcc 4.9.0 | OpenMPNative(12) | 11.169 | 888.972 | 0.959 | |
df3bb61 | Intel Xeon E5-2623 v3 (Haswell) | - | DDR4-2133 | gcc 5.4.0 | SequentialNative | 9.007 | 1665.050 | 1.114 | |
df3bb61 | Intel Xeon E5-2623 v3 (Haswell) | - | DDR4-2133 | gcc 5.4.0 | OpenMPNative(16) | 14.558 | 3812.690 | 1.914 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | clang 3.8.0 | SequentialNative | 9.237 | 1825.770 | 1.235 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | clang 3.8.0 | OpenMPNative(32) | 13.415 | 1765.790 | 1.475 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | clang 3.8.0 | SequentialAvx2 | 811.419 | 2225.010 | 1.146 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | clang 3.8.0 | OpenMPAvx2(32) | 759.796 | 2108.790 | 1.218 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | gcc 6.2.0 | SequentialNative | 9.217 | 2740.540 | 1.029 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | gcc 6.2.0 | OpenMPNative(32) | 13.666 | 2191.960 | 1.609 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | gcc 6.2.0 | SequentialAvx2 | 967.925 | 2225.010 | 1.128 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | - | DDR4-2133 | gcc 6.2.0 | OpenMPAvx2(32) | 703.479 | 2706.910 | 1.284 | |
99e9942 | AMD FX-8350 (Piledriver) | - | DDR3-1600 | Visual Studio 2015 Update 2 | SequentialNative | 21.658 | 3901.640 | 3.390 | |
99e9942 | AMD FX-8350 (Piledriver) | - | DDR3-1600 | Visual Studio 2015 Update 2 | OpenMPNative(8) | 27.210 | 3380.72 | 4.502 | |
e628194 | AMD Ryzen Threadripper 1950X (SummitRidge) | - | DDR4-3000 | Visual Studio Community 2017 | SequentialNative | 13.887 | 1695.260 | 1.445 | |
e628194 | AMD Ryzen Threadripper 1950X (SummitRidge) | - | DDR4-3000 | Visual Studio Community 2017 | OpenMPNative(32) | 14.476 | 575.883 | 1.471 | |
e628194 | AMD Ryzen Threadripper 1950X (SummitRidge) | - | DDR4-3000 | Visual Studio Community 2017 | SequentialAvx2 | 247.851 | 1871.100 | 0.710 | |
e628194 | AMD Ryzen Threadripper 1950X (SummitRidge) | - | DDR4-3000 | Visual Studio Community 2017 | OpenMPAvx2(32) | 255.062 | 848.715 | 0.942 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | clang 6.0.0 | SequentialNative | 9.184 | 2028.720 | 1.043 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | clang 6.0.0 | OpenMPNative(20) | 24.646 | 410.553 | 1.049 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | clang 6.0.0 | SequentialAvx2 | 229.210 | 2120.010 | 1.715 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | clang 6.0.0 | OpenMPAvx2(20) | 225.222 | 447.602 | 1.932 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | gcc 5.4.0 | SequentialNative | 8.924 | 2040.490 | 1.005 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | gcc 5.4.0 | OpenMPNative(20) | 10.254 | 410.062 | 0.996 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | gcc 5.4.0 | SequentialAvx2 | 202.017 | 2066.810 | 1.638 | |
e628194 | Intel Core i9 7900X (Skylake-X) | - | DDR4-2666 | gcc 5.4.0 | OpenMPAvx2(20) | 187.950 | 446.356 | 1.955 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | SequentialNative | 6.497 | 1061.300 | 0.907 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | OpenMPNative(32) | 9.900 | 831.579 | 1.294 | ハイパースレッディング無効 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | OpenMPNative(64) | 10.536 | 892.760 | 1.345 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | SequentialAvx2 | 201.219 | 1993.750 | 0.855 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | OpenMPAvx2(32) | 263.348 | 1143.590 | 0.784 | ハイパースレッディング無効 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | clang 7.0.1 | OpenMPAvx2(64) | 348.107 | 1173.250 | 0.757 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | SequentialNative | 6.492 | 1137.170 | 0.940 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | OpenMPNative(32) | 6.863 | 1064.000 | 1.288 | ハイパースレッディング無効 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | OpenMPNative(64) | 7.577 | 1154.85 | 1.225 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | SequentialAvx2 | 178.391 | 1700.92 | 0.523 | |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | OpenMPAvx2(32) | 182.598 | 1116.75 | 0.811 | ハイパースレッディング無効 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | - | DDR4-2666 | gcc 7.3.0 | OpenMPAvx2(64) | 184.162 | 1248.42 | 0.792 | |
1b39478 | Intel Core i7 9700 (Coffe Lake) | - | DDR4-2666 | Visual Studio Community 2019 | SequentialNative | 6.938 | 1277.050 | 0.662 | |
1b39478 | Intel Core i7 9700 (Coffe Lake) | - | DDR4-2666 | Visual Studio Community 2019 | OpenMPNative(8) | 7.175 | 868.563 | 0.783 | |
1b39478 | Intel Core i7 9700 (Coffe Lake) | - | DDR4-2666 | Visual Studio Community 2019 | SequentialAvx2 | 178.672 | 1173.510 | 0.593 | |
1b39478 | Intel Core i7 9700 (Coffe Lake) | - | DDR4-2666 | Visual Studio Community 2019 | OpenMPAvx2(8) | 163.732 | 939.186 | 0.631 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | SequentialNative | 9.517 | 2737.990 | 0.898 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | OpenMPNative(48) | 16.394 | 338.657 | 1.082 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | SequentialAvx2 | 217.850 | 2169.820 | 1.304 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | OpenMPAvx2(48) | 221.020 | 481.484 | 1.454 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | SequentialAvx512 | 209.959 | 2170.150 | 1.282 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | clang 10.0.1 | OpenMPAvx512(48) | 225.018 | 347.813 | 1.523 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | SequentialNative | 9.771 | 2035.430 | 0.930 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | OpenMPNative(48) | 9.764 | 326.958 | 1.042 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | SequentialAvx2 | 214.001 | 2127.910 | 1.287 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | OpenMPAvx2(48) | 201.544 | 384.380 | 1.453 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | SequentialAvx512 | 203.910 | 2132.680 | 1.244 | |
1b39478 | Intel Xeon Gold 6126 (Skylake SP) | - | DDR4-2666 | gcc 7.3.1 | OpenMPAvx512(48) | 217.464 | 350.191 | 1.499 | |
CUDA | |||||||||
df3bb61 | Intel Core i7 4790 (Haswell) | GeForce GT 705 (Fermi) | DDR3-800 | Visual Studio 2015 Update 2 | CUDA | 201.146 | 18634.900 | 2.054 | |
ec667d8 | Intel Core i7 4790 (Haswell) | GeForce GT 705 (Fermi) | DDR3-800 | Visual Studio 2015 Update 2 | cuSPARSE | 288.109 | 3745.180 | 2.194 | |
d24d7fd | Intel Core i7 4790 (Haswell) | GeForce GT 705 (Fermi) | DDR3-800 | Visual Studio 2015 Update 2 | cuSPARSE&cuBLAS | 281.384 | 3048.540 | 1.968 | CUSPARSE_ELLマクロをコメントアウト |
d24d7fd | Intel Core i7 4790 (Haswell) | GeForce GT 705 (Fermi) | DDR3-800 | Visual Studio 2015 Update 2 | cuSPARSE-ELL | 864.774 | 2987.220 | 2.133 | |
d24d7fd | Intel Core i7 4790 (Haswell) | GeForce GT 705 (Fermi) | DDR3-800 | Visual Studio 2015 Update 2 | cuSPARSE-ELL&cuBLAS | 752.477 | 2236.960 | 2.032 | |
df3bb61 | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | CUDA | 210.878 | 1009.120 | 2.020 | sm35, compute_35 |
41b109d | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | CUDA | 207.652 | 782.976 | 1.887 | |
3f8570d | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | cuSPARSE | 237.946 | 222.014 | 1.955 | |
037b553 | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | cuSPARSE-ELL | 270.817 | 179.017 | 2.213 | |
f3c68b2 | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | cuSPARSE&cuBLAS | 238.447 | 177.110 | 1.974 | |
037b553 | Intel Core i7 920 (Nehalem) | Tesla K20c (Kepler) | DDR3-1333 | gcc 4.9.2 | cuSPARSE-ELL&cuBLAS | 270.254 | 134.303 | 2.111 | |
df3bb61 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | CUDA | 461.334 | 247.741 | 2.799 | sm_52, compute_52 |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | CUDA | 483.294 | 194.594 | 2.703 | sm_52, compute_52 |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | cuSPARSE | 252.967 | 109.931 | 2.893 | sm_52, compute_52。CUSPARSE_ELLマクロをコメントアウト |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | cuSPARSE-ELL | 296.747 | 89.986 | 2.654 | sm_52, compute_52 |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | cuSPARSE&cuBLAS | 213.013 | 90.902 | 2.292 | sm_52, compute_52。CUSPARSE_ELLマクロをコメントアウト |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX TITAN X (Maxwell) | DDR4-2133 | gcc 5.4.0 | cuSPARSE-ELL&cuBLAS | 252.779 | 70.582 | 2.331 | sm_52, compute_52 |
df3bb61 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | CUDA | 366.786 | 206.448 | 2.275 | |
41b109d | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | CUDA | 485.465 | 182.589 | 2.616 | sm_61, compute_61 |
3f8570d | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE | 250.856 | 111.227 | 2.824 | sm_61, compute_61 |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE-ELL | 262.811 | 96.279 | 2.550 | sm_61, compute_61 |
f3c68b2 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE&cuBLAS | 342.045 | 96.840 | 3.745 | sm_61, compute_61 |
037b553 | Intel Xeon E5-2623 v3 (Haswell) | GeForce GTX 1080 (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE-ELL&cuBLAS | 244.823 | 79.910 | 2.060 | sm_61, compute_61 |
037b553 | Intel Core i7 5820K (Haswell) | TITAN X (Pascal) | DDR4-2133 | gcc 4.9.0 | CUDA | 232.67 | 145.854 | 0.900 | |
037b553 | Intel Core i7 5820K (Haswell) | TITAN X (Pascal) | DDR4-2133 | gcc 4.9.0 | cuSPARSE | 123.409 | 84.464 | 0.903 | CUSPARSE_ELLマクロをコメントアウト |
037b553 | Intel Core i7 5820K (Haswell) | TITAN X (Pascal) | DDR4-2133 | gcc 4.9.0 | cuSPARSE-ELL | 134.510 | 70.148 | 0.876 | |
037b553 | Intel Core i7 5820K (Haswell) | TITAN X (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE&cuBLAS | 123.963 | 69.535 | 1.456 | CUSPARSE_ELLマクロをコメントアウト |
037b553 | Intel Core i7 5820K (Haswell) | TITAN X (Pascal) | DDR4-2133 | gcc 5.4.0 | cuSPARSE-ELL&cuBLAS | 138.951 | 55.150 | 0.872 | |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | Tesla P100 SXM2 16GB (Pascal) | DDR4-2133 | gcc 6.2.0 | CUDA | 362.139 | 113.25 | 1.924 | sm_60,compute_60 |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | Tesla P100 SXM2 16GB (Pascal) | DDR4-2133 | gcc 6.2.0 | cuSPARSE | 492.803 | 60.408 | 1.427 | sm_60,compute_60 |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | Tesla P100 SXM2 16GB (Pascal) | DDR4-2133 | gcc 6.2.0 | cuSPARSE-ELL | 670.095 | 57.553 | 1.737 | sm_60,compute_60 |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | Tesla P100 SXM2 16GB (Pascal) | DDR4-2133 | gcc 6.2.0 | cuSPARSE&cuBLAS | 518.066 | 44.357 | 1.416 | sm_60,compute_60 |
99e9942 | Intel Xeon E5-2630L v3 (Haswell) | Tesla P100 SXM2 16GB (Pascal) | DDR4-2133 | gcc 6.2.0 | cuSPARSE-ELL&cuBLAS | 526.738 | 41.084 | 1.482 | sm_60,compute_60 |
e628194 | Intel Core i9 7900X (Skylake-X) | Titan V (Volta) | DDR4-2666 | gcc 5.4.0 | CUDA | 284.426 | 127.931 | 1.182 | sm_70, compute_70 |
e628194 | Intel Core i9 7900X (Skylake-X) | Titan V (Volta) | DDR4-2666 | gcc 5.4.0 | cuSPARSE-ELL | 193.549 | 51.433 | 1.268 | sm_70, compute_70 |
e628194 | Intel Core i9 7900X (Skylake-X) | Titan V (Volta) | DDR4-2666 | gcc 5.4.0 | cuSPARSE-ELL&cuBLAS | 187.831 | 33.704 | 1.241 | sm_70, compute_70 |
e628194 | Intel Core i9 7900X (Skylake-X) | Titan V (Volta) | DDR4-2666 | gcc 5.4.0 | cuSPARSE | 174.251 | 51.34 | 1.173 | sm_70, compute_70。CUSPARSE_ELLマクロをコメントアウト |
e628194 | Intel Core i9 7900X (Skylake-X) | Titan V (Volta) | DDR4-2666 | gcc 5.4.0 | cuSPARSE&cuBLAS | 172.509 | 33.727 | 1.159 | sm_70, compute_70。CUSPARSE_ELLマクロをコメントアウト |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | GeForce GTX 1080 (Pascal) | DDR4-2666 | gcc 7.3.0 | CUDA | 98.909 | 173.534 | 0.763 | sm_61, compute_61 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | GeForce GTX 1080 (Pascal) | DDR4-2666 | gcc 7.3.0 | cuSPARSE-ELL | 132.686 | 94.603 | 0.782 | sm_61, compute_61 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | GeForce GTX 1080 (Pascal) | DDR4-2666 | gcc 7.3.0 | cuSPARSE-ELL&cuBLAS | 132.645 | 79.872 | 0.787 | sm_61, compute_61 |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | GeForce GTX 1080 (Pascal) | DDR4-2666 | gcc 7.3.0 | cuSPARSE | 151.611 | 111.998 | 1.100 | sm_61, compute_61。CUSPARSE_ELLマクロをコメントアウト |
e628194 | AMD Ryzen Threadripper 2990WX (Pinnacle Ridge) | GeForce GTX 1080 (Pascal) | DDR4-2666 | gcc 7.3.0 | cuSPARSE&cuBLAS | 150.932 | 97.382 | 1.104 | sm_61, compute_61。CUSPARSE_ELLマクロをコメントアウト |
1b39478 | Intel Xeon Gold 6152 (Skylake SP) | Tesla V100 SXM2 32GB (Volta) | DDR4-2666 | gcc 7.4.0 | CUDA | 298.301 | 115.998 | 1.581 | sm_70, compute_70 |
1b39478 | Intel Xeon Gold 6152 (Skylake SP) | Tesla V100 SXM2 32GB (Volta) | DDR4-2666 | gcc 7.4.0 | cuSPARSE-ELL | 360.980 | 45.459 | 1.548 | sm_70, compute_70 |
1b39478 | Intel Xeon Gold 6152 (Skylake SP) | Tesla V100 SXM2 32GB (Volta) | DDR4-2666 | gcc 7.4.0 | cuSPARSE-ELL&cuBLAS | 346.652 | 27.857 | 1.548 | sm_70, compute_70 |
1b39478 | Intel Xeon Gold 6152 (Skylake SP) | Tesla V100 SXM2 32GB (Volta) | DDR4-2666 | gcc 7.4.0 | cuSPARSE | 336.482 | 46.841 | 2.233 | sm_70, compute_70。CUSPARSE_ELLマクロをコメントアウト |
1b39478 | Intel Xeon Gold 6152 (Skylake SP) | Tesla V100 SXM2 32GB (Volta) | DDR4-2666 | gcc 7.4.0 | cuSPARSE&cuBLAS | 394.93 | 29.279 | 1.682 | sm_70, compute_70。CUSPARSE_ELLマクロをコメントアウト |
Xeon Phi | |||||||||
a165054 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | clang 3.8.1 | SequentialNative | 16.012 | 6984.320 | 0.002 | numactl -p0 ./ConjugateGradient |
a165054 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | clang 3.8.1 | OpenMPNative(256) | 13.695 | 381.483 | 0.003 | numactl -p0 ./ConjugateGradient |
bc58b6d | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | clang 3.8.1 | SequentialAvx2 | 427.623 | 5869.350 | 2.007 | numactl -p0 ./ConjugateGradient |
bc58b6d | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | clang 3.8.1 | OpenMPAvx2(256) | 475.428 | 465.915 | 3.921 | numactl -p0 ./ConjugateGradient |
a165054 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | clang 3.8.1 | SequentialNative | 15.642 | 6886.110 | 0.002 | numactl -p1 ./ConjugateGradient |
a165054 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | clang 3.8.1 | OpenMPNative(256) | 16.801 | 97.478 | 0.003 | numactl -p1 ./ConjugateGradient |
bc58b6d | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | clang 3.8.1 | SequentialAvx2 | 461.833 | 5849.790 | 1.830 | numactl -p1 ./ConjugateGradient |
bc58b6d | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | clang 3.8.1 | OpenMPAvx2(256) | 395.451 | 162.667 | 3.973 | numactl -p1 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | SequentialNative | 17.656 | 7572.050 | 3.044 | numactl -p0 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | OpenMPNative(256) | 22.510 | 535.463 | 5.146 | numactl -p0 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | SequentialAvx2 | 370.924 | 13236.600 | 2.110 | numactl -p0 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | OpenMPAvx2(256) | 376.235 | 479.499 | 3.374 | numactl -p0 ./ConjugateGradient |
3b4565d | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | SequentialAvx512 | 285.671 | 13186.600 | 1.868 | numactl -p0 ./ConjugateGradient |
3b4565d | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | gcc 6.1.0 | OpenMPAvx512(256) | 351.503 | 501.11 | 3.415 | numactl -p0 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | SequentialNative | 15.479 | 7495.210 | 2.969 | numactl -p1 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | OpenMPNative(256) | 20.512 | 202.463 | 5.086 | numactl -p1 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | SequentialAvx2 | 356.729 | 13238.900 | 1.749 | numactl -p1 ./ConjugateGradient |
9d4e1a9 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | OpenMPAvx2(256) | 359.745 | 184.324 | 3.324 | numactl -p1 ./ConjugateGradient |
3b4565d | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | SequentialAvx512 | 279.218 | 13272.900 | 1.541 | numactl -p1 ./ConjugateGradient |
3b4565d | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | gcc 6.1.0 | OpenMPAvx512(256) | 337.39 | 174.269 | 3.762 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | SequentialNative | 4.376 | 7787.140 | 4.477 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | OpenMPNative(256) | 6.794 | 423.155 | 11.242 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | SequentialAvx2 | 344.749 | 8935.240 | 4.838 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | OpenMPAvx2(256) | 395.898 | 445.284 | 9.407 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | SequentialAvx512 | 361.778 | 8938.510 | 4.855 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | DDR4-2133 | icc 16.0.3 | OpenMPAvx512(256) | 357.743 | 488.334 | 9.475 | numactl -p0 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | SequentialNative | 3.867 | 7914.800 | 3.912 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | OpenMPNative(256) | 6.421 | 106.360 | 7.295 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | SequentialAvx2 | 387.308 | 9441.260 | 4.338 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | OpenMPAvx2(256) | 368.937 | 105.556 | 9.083 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | SequentialAvx512 | 318.770 | 9443.310 | 4.302 | numactl -p1 ./ConjugateGradient |
99e9942 | Intel Xeon Phi 7210 (Kights Landing) | - | MCDRAM | icc 16.0.3 | OpenMPAvx512(256) | 378.751 | 115.929 | 9.166 | numactl -p1 ./ConjugateGradient |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | SequentialNative | 3.883 | 4052.020 | 6.547 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | OpenMPNative(272) | 6.599 | 362.637 | 10.406 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | SequentialAvx2 | 280.491 | 8022.430 | 4.501 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | OpenMPAvx2(272) | 276.864 | 385.442 | 8.368 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | SequentialAvx512 | 266.080 | 8085.770 | 4.611 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | DDR4-2133 | icc 17.0.1 | OpenMPAvx512(272) | 318.311 | 420.487 | 8.149 | numactl -p0 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | SequentialNative | 5.578 | 3898.490 | 3.850 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | OpenMPNative(272) | 9.643 | 89.475 | 6.244 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | SequentialAvx2 | 276.552 | 8165.090 | 4.115 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | OpenMPAvx2(272) | 274.735 | 91.487 | 8.054 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | SequentialAvx512 | 262.743 | 8176.420 | 4.292 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
e628194 | Intel Xeon Phi 7250 (Kights Landing) | - | MCDRAM | icc 17.0.1 | OpenMPAvx512(272) | 311.328 | 98.040 | 7.830 | numactl -p1 ./ConjugateGradient , Executed on Oakforest-PACS |
Updated