# New results * Runs on Dardel test node at PDC. The CPU is [AMD EPYC 7742 64-Core Processor](https://www.amd.com/en/products/cpu/amd-epyc-7742) * 4 implementations: local, global, OpenMP allocator, memory pool. * 3 workloads: 15000 (small), 150000 (medium), 1500000 (large) double precision numbers in buffer. * Outer loop of size 10000 * I used [hyperfine](https://github.com/sharkdp/hyperfine) to run the benchmarks. ## Large workload | Command | Mean [s] | Min [s] | Max [s] | Relative | |:---|---:|---:|---:|---:| | `env OMP_NUM_THREADS=1 ./local.x 1500000 10000` | 252.532 ± 0.480 | 252.051 | 253.725 | 3606.69 ± 103.63 | | `env OMP_NUM_THREADS=2 ./local.x 1500000 10000` | 130.159 ± 0.810 | 128.971 | 131.324 | 1858.94 ± 54.54 | | `env OMP_NUM_THREADS=4 ./local.x 1500000 10000` | 65.964 ± 0.131 | 65.699 | 66.172 | 942.11 ± 27.08 | | `env OMP_NUM_THREADS=8 ./local.x 1500000 10000` | 33.372 ± 0.069 | 33.251 | 33.469 | 476.62 ± 13.70 | | `env OMP_NUM_THREADS=16 ./local.x 1500000 10000` | 17.317 ± 0.064 | 17.163 | 17.397 | 247.33 ± 7.15 | | `env OMP_NUM_THREADS=32 ./local.x 1500000 10000` | 10.120 ± 0.099 | 9.983 | 10.272 | 144.53 ± 4.38 | | `env OMP_NUM_THREADS=64 ./local.x 1500000 10000` | 8.806 ± 0.171 | 8.513 | 9.023 | 125.77 ± 4.36 | | `env OMP_NUM_THREADS=128 ./local.x 1500000 10000` | 7.980 ± 0.162 | 7.787 | 8.273 | 113.97 ± 4.00 | | `env OMP_NUM_THREADS=1 ./global.x 1500000 10000` | 232.908 ± 0.364 | 232.154 | 233.326 | 3326.41 ± 95.51 | | `env OMP_NUM_THREADS=2 ./global.x 1500000 10000` | 117.080 ± 0.054 | 117.035 | 117.213 | 1672.15 ± 47.95 | | `env OMP_NUM_THREADS=4 ./global.x 1500000 10000` | 58.261 ± 0.031 | 58.203 | 58.300 | 832.08 ± 23.86 | | `env OMP_NUM_THREADS=8 ./global.x 1500000 10000` | 29.151 ± 0.014 | 29.132 | 29.178 | 416.34 ± 11.94 | | `env OMP_NUM_THREADS=16 ./global.x 1500000 10000` | 14.645 ± 0.014 | 14.636 | 14.684 | 209.15 ± 6.00 | | `env OMP_NUM_THREADS=32 ./global.x 1500000 10000` | 7.657 ± 0.392 | 7.457 | 8.403 | 109.36 ± 6.41 | | `env OMP_NUM_THREADS=64 ./global.x 1500000 10000` | 3.946 ± 0.147 | 3.891 | 4.363 | 56.36 ± 2.65 | | `env OMP_NUM_THREADS=128 ./global.x 1500000 10000` | 2.684 ± 0.096 | 2.559 | 2.767 | 38.33 ± 1.76 | | `env OMP_NUM_THREADS=1 ./omp_alloc.x 1500000 10000` | 252.785 ± 0.225 | 252.362 | 253.034 | 3610.29 ± 103.56 | | `env OMP_NUM_THREADS=2 ./omp_alloc.x 1500000 10000` | 131.554 ± 1.092 | 129.127 | 132.999 | 1878.86 ± 56.08 | | `env OMP_NUM_THREADS=4 ./omp_alloc.x 1500000 10000` | 66.081 ± 0.153 | 65.850 | 66.288 | 943.78 ± 27.15 | | `env OMP_NUM_THREADS=8 ./omp_alloc.x 1500000 10000` | 33.401 ± 0.037 | 33.357 | 33.471 | 477.04 ± 13.69 | | `env OMP_NUM_THREADS=16 ./omp_alloc.x 1500000 10000` | 17.273 ± 0.057 | 17.196 | 17.356 | 246.70 ± 7.12 | | `env OMP_NUM_THREADS=32 ./omp_alloc.x 1500000 10000` | 10.153 ± 0.059 | 10.061 | 10.260 | 145.00 ± 4.24 | | `env OMP_NUM_THREADS=64 ./omp_alloc.x 1500000 10000` | 8.748 ± 0.080 | 8.628 | 8.937 | 124.93 ± 3.76 | | `env OMP_NUM_THREADS=128 ./omp_alloc.x 1500000 10000` | 7.973 ± 0.199 | 7.592 | 8.235 | 113.87 ± 4.32 | | `env OMP_NUM_THREADS=1 ./pool.x 1500000 10000` | 252.053 ± 0.363 | 251.614 | 252.718 | 3230.76 ± 250.86 | | `env OMP_NUM_THREADS=2 ./pool.x 1500000 10000` | 126.822 ± 0.223 | 126.404 | 127.076 | 1625.58 ± 126.23 | | `env OMP_NUM_THREADS=4 ./pool.x 1500000 10000` | 63.302 ± 0.064 | 63.252 | 63.426 | 811.39 ± 63.00 | | `env OMP_NUM_THREADS=8 ./pool.x 1500000 10000` | 32.000 ± 0.096 | 31.884 | 32.190 | 410.16 ± 31.87 | | `env OMP_NUM_THREADS=16 ./pool.x 1500000 10000` | 16.283 ± 0.141 | 16.171 | 16.668 | 208.71 ± 16.30 | | `env OMP_NUM_THREADS=32 ./pool.x 1500000 10000` | 8.785 ± 1.570 | 8.200 | 13.251 | 112.60 ± 21.94 | | `env OMP_NUM_THREADS=64 ./pool.x 1500000 10000` | 4.997 ± 1.045 | 4.483 | 6.980 | 64.05 ± 14.29 | | `env OMP_NUM_THREADS=128 ./pool.x 1500000 10000` | 4.722 ± 0.432 | 4.224 | 5.187 | 60.53 ± 7.26 | | `env OMP_NUM_THREADS=1 ./store.x 1500000 10000` | 252.150 ± 0.548 | 251.489 | 253.292 | 3539.33 ± 316.82 | | `env OMP_NUM_THREADS=2 ./store.x 1500000 10000` | 126.658 ± 0.222 | 126.392 | 127.025 | 1777.84 ± 159.13 | | `env OMP_NUM_THREADS=4 ./store.x 1500000 10000` | 63.351 ± 0.167 | 63.225 | 63.767 | 889.24 ± 79.61 | | `env OMP_NUM_THREADS=8 ./store.x 1500000 10000` | 32.089 ± 0.124 | 31.990 | 32.323 | 450.42 ± 40.34 | | `env OMP_NUM_THREADS=16 ./store.x 1500000 10000` | 16.178 ± 0.029 | 16.123 | 16.217 | 227.08 ± 20.32 | | `env OMP_NUM_THREADS=32 ./store.x 1500000 10000` | 8.716 ± 1.606 | 8.196 | 13.288 | 122.35 ± 25.07 | | `env OMP_NUM_THREADS=64 ./store.x 1500000 10000` | 5.742 ± 1.306 | 4.485 | 7.028 | 80.60 ± 19.71 | | `env OMP_NUM_THREADS=128 ./store.x 1500000 10000` | 4.655 ± 0.388 | 4.178 | 5.092 | 65.34 ± 7.99 | ## All new results | Command | Mean [s] | Min [s] | Max [s] | Relative | |:---|---:|---:|---:|---:| | `env OMP_NUM_THREADS=1 ./local.x 15000 10000` | 2.386 ± 0.014 | 2.371 | 2.417 | 34.08 ± 1.00 | | `env OMP_NUM_THREADS=2 ./local.x 15000 10000` | 1.207 ± 0.004 | 1.201 | 1.212 | 17.23 ± 0.50 | | `env OMP_NUM_THREADS=4 ./local.x 15000 10000` | 0.611 ± 0.002 | 0.608 | 0.615 | 8.72 ± 0.25 | | `env OMP_NUM_THREADS=8 ./local.x 15000 10000` | 0.317 ± 0.003 | 0.315 | 0.323 | 4.52 ± 0.13 | | `env OMP_NUM_THREADS=16 ./local.x 15000 10000` | 0.170 ± 0.002 | 0.168 | 0.174 | 2.43 ± 0.07 | | `env OMP_NUM_THREADS=32 ./local.x 15000 10000` | 0.100 ± 0.002 | 0.098 | 0.105 | 1.43 ± 0.05 | | `env OMP_NUM_THREADS=64 ./local.x 15000 10000` | 0.070 ± 0.002 | 0.068 | 0.078 | 1.00 | | `env OMP_NUM_THREADS=128 ./local.x 15000 10000` | 0.075 ± 0.003 | 0.066 | 0.080 | 1.07 ± 0.06 | | `env OMP_NUM_THREADS=1 ./local.x 150000 10000` | 23.891 ± 0.042 | 23.835 | 23.945 | 341.22 ± 9.80 | | `env OMP_NUM_THREADS=2 ./local.x 150000 10000` | 12.066 ± 0.020 | 12.034 | 12.105 | 172.33 ± 4.95 | | `env OMP_NUM_THREADS=4 ./local.x 150000 10000` | 6.030 ± 0.018 | 6.016 | 6.072 | 86.12 ± 2.48 | | `env OMP_NUM_THREADS=8 ./local.x 150000 10000` | 3.033 ± 0.011 | 3.019 | 3.047 | 43.31 ± 1.25 | | `env OMP_NUM_THREADS=16 ./local.x 150000 10000` | 1.544 ± 0.009 | 1.538 | 1.567 | 22.05 ± 0.65 | | `env OMP_NUM_THREADS=32 ./local.x 150000 10000` | 0.806 ± 0.001 | 0.805 | 0.807 | 11.51 ± 0.33 | | `env OMP_NUM_THREADS=64 ./local.x 150000 10000` | 0.472 ± 0.027 | 0.453 | 0.538 | 6.74 ± 0.43 | | `env OMP_NUM_THREADS=128 ./local.x 150000 10000` | 0.356 ± 0.012 | 0.342 | 0.379 | 5.09 ± 0.23 | | `env OMP_NUM_THREADS=1 ./local.x 1500000 10000` | 252.532 ± 0.480 | 252.051 | 253.725 | 3606.69 ± 103.63 | | `env OMP_NUM_THREADS=2 ./local.x 1500000 10000` | 130.159 ± 0.810 | 128.971 | 131.324 | 1858.94 ± 54.54 | | `env OMP_NUM_THREADS=4 ./local.x 1500000 10000` | 65.964 ± 0.131 | 65.699 | 66.172 | 942.11 ± 27.08 | | `env OMP_NUM_THREADS=8 ./local.x 1500000 10000` | 33.372 ± 0.069 | 33.251 | 33.469 | 476.62 ± 13.70 | | `env OMP_NUM_THREADS=16 ./local.x 1500000 10000` | 17.317 ± 0.064 | 17.163 | 17.397 | 247.33 ± 7.15 | | `env OMP_NUM_THREADS=32 ./local.x 1500000 10000` | 10.120 ± 0.099 | 9.983 | 10.272 | 144.53 ± 4.38 | | `env OMP_NUM_THREADS=64 ./local.x 1500000 10000` | 8.806 ± 0.171 | 8.513 | 9.023 | 125.77 ± 4.36 | | `env OMP_NUM_THREADS=128 ./local.x 1500000 10000` | 7.980 ± 0.162 | 7.787 | 8.273 | 113.97 ± 4.00 | | `env OMP_NUM_THREADS=1 ./global.x 15000 10000` | 2.339 ± 0.015 | 2.330 | 2.380 | 33.41 ± 0.98 | | `env OMP_NUM_THREADS=2 ./global.x 15000 10000` | 1.183 ± 0.003 | 1.179 | 1.186 | 16.90 ± 0.49 | | `env OMP_NUM_THREADS=4 ./global.x 15000 10000` | 0.599 ± 0.002 | 0.597 | 0.605 | 8.56 ± 0.25 | | `env OMP_NUM_THREADS=8 ./global.x 15000 10000` | 0.312 ± 0.003 | 0.309 | 0.317 | 4.45 ± 0.14 | | `env OMP_NUM_THREADS=16 ./global.x 15000 10000` | 0.166 ± 0.000 | 0.165 | 0.167 | 2.37 ± 0.07 | | `env OMP_NUM_THREADS=32 ./global.x 15000 10000` | 0.100 ± 0.003 | 0.097 | 0.108 | 1.43 ± 0.06 | | `env OMP_NUM_THREADS=64 ./global.x 15000 10000` | 0.071 ± 0.003 | 0.067 | 0.078 | 1.02 ± 0.05 | | `env OMP_NUM_THREADS=128 ./global.x 15000 10000` | 0.073 ± 0.003 | 0.065 | 0.082 | 1.04 ± 0.06 | | `env OMP_NUM_THREADS=1 ./global.x 150000 10000` | 23.248 ± 0.012 | 23.231 | 23.273 | 332.03 ± 9.52 | | `env OMP_NUM_THREADS=2 ./global.x 150000 10000` | 11.734 ± 0.019 | 11.703 | 11.759 | 167.59 ± 4.81 | | `env OMP_NUM_THREADS=4 ./global.x 150000 10000` | 5.856 ± 0.011 | 5.838 | 5.871 | 83.63 ± 2.40 | | `env OMP_NUM_THREADS=8 ./global.x 150000 10000` | 2.945 ± 0.004 | 2.940 | 2.953 | 42.06 ± 1.21 | | `env OMP_NUM_THREADS=16 ./global.x 150000 10000` | 1.499 ± 0.001 | 1.498 | 1.500 | 21.41 ± 0.61 | | `env OMP_NUM_THREADS=32 ./global.x 150000 10000` | 0.790 ± 0.004 | 0.786 | 0.800 | 11.28 ± 0.33 | | `env OMP_NUM_THREADS=64 ./global.x 150000 10000` | 0.452 ± 0.020 | 0.436 | 0.491 | 6.46 ± 0.33 | | `env OMP_NUM_THREADS=128 ./global.x 150000 10000` | 0.339 ± 0.010 | 0.330 | 0.356 | 4.84 ± 0.20 | | `env OMP_NUM_THREADS=1 ./global.x 1500000 10000` | 232.908 ± 0.364 | 232.154 | 233.326 | 3326.41 ± 95.51 | | `env OMP_NUM_THREADS=2 ./global.x 1500000 10000` | 117.080 ± 0.054 | 117.035 | 117.213 | 1672.15 ± 47.95 | | `env OMP_NUM_THREADS=4 ./global.x 1500000 10000` | 58.261 ± 0.031 | 58.203 | 58.300 | 832.08 ± 23.86 | | `env OMP_NUM_THREADS=8 ./global.x 1500000 10000` | 29.151 ± 0.014 | 29.132 | 29.178 | 416.34 ± 11.94 | | `env OMP_NUM_THREADS=16 ./global.x 1500000 10000` | 14.645 ± 0.014 | 14.636 | 14.684 | 209.15 ± 6.00 | | `env OMP_NUM_THREADS=32 ./global.x 1500000 10000` | 7.657 ± 0.392 | 7.457 | 8.403 | 109.36 ± 6.41 | | `env OMP_NUM_THREADS=64 ./global.x 1500000 10000` | 3.946 ± 0.147 | 3.891 | 4.363 | 56.36 ± 2.65 | | `env OMP_NUM_THREADS=128 ./global.x 1500000 10000` | 2.684 ± 0.096 | 2.559 | 2.767 | 38.33 ± 1.76 | | `env OMP_NUM_THREADS=1 ./omp_alloc.x 15000 10000` | 2.381 ± 0.008 | 2.373 | 2.393 | 34.00 ± 0.98 | | `env OMP_NUM_THREADS=2 ./omp_alloc.x 15000 10000` | 1.206 ± 0.003 | 1.202 | 1.210 | 17.22 ± 0.50 | | `env OMP_NUM_THREADS=4 ./omp_alloc.x 15000 10000` | 0.613 ± 0.003 | 0.610 | 0.619 | 8.75 ± 0.25 | | `env OMP_NUM_THREADS=8 ./omp_alloc.x 15000 10000` | 0.315 ± 0.001 | 0.314 | 0.317 | 4.50 ± 0.13 | | `env OMP_NUM_THREADS=16 ./omp_alloc.x 15000 10000` | 0.169 ± 0.001 | 0.168 | 0.172 | 2.42 ± 0.07 | | `env OMP_NUM_THREADS=32 ./omp_alloc.x 15000 10000` | 0.100 ± 0.001 | 0.099 | 0.103 | 1.43 ± 0.04 | | `env OMP_NUM_THREADS=64 ./omp_alloc.x 15000 10000` | 0.071 ± 0.003 | 0.067 | 0.079 | 1.02 ± 0.05 | | `env OMP_NUM_THREADS=128 ./omp_alloc.x 15000 10000` | 0.075 ± 0.004 | 0.069 | 0.081 | 1.07 ± 0.06 | | `env OMP_NUM_THREADS=1 ./omp_alloc.x 150000 10000` | 23.931 ± 0.060 | 23.827 | 23.996 | 341.79 ± 9.84 | | `env OMP_NUM_THREADS=2 ./omp_alloc.x 150000 10000` | 12.058 ± 0.027 | 12.017 | 12.100 | 172.21 ± 4.95 | | `env OMP_NUM_THREADS=4 ./omp_alloc.x 150000 10000` | 6.034 ± 0.015 | 6.004 | 6.053 | 86.18 ± 2.48 | | `env OMP_NUM_THREADS=8 ./omp_alloc.x 150000 10000` | 3.060 ± 0.021 | 3.022 | 3.092 | 43.70 ± 1.29 | | `env OMP_NUM_THREADS=16 ./omp_alloc.x 150000 10000` | 1.545 ± 0.009 | 1.539 | 1.565 | 22.07 ± 0.65 | | `env OMP_NUM_THREADS=32 ./omp_alloc.x 150000 10000` | 0.822 ± 0.043 | 0.805 | 0.943 | 11.73 ± 0.70 | | `env OMP_NUM_THREADS=64 ./omp_alloc.x 150000 10000` | 0.470 ± 0.018 | 0.452 | 0.507 | 6.71 ± 0.32 | | `env OMP_NUM_THREADS=128 ./omp_alloc.x 150000 10000` | 0.354 ± 0.021 | 0.338 | 0.405 | 5.06 ± 0.33 | | `env OMP_NUM_THREADS=1 ./omp_alloc.x 1500000 10000` | 252.785 ± 0.225 | 252.362 | 253.034 | 3610.29 ± 103.56 | | `env OMP_NUM_THREADS=2 ./omp_alloc.x 1500000 10000` | 131.554 ± 1.092 | 129.127 | 132.999 | 1878.86 ± 56.08 | | `env OMP_NUM_THREADS=4 ./omp_alloc.x 1500000 10000` | 66.081 ± 0.153 | 65.850 | 66.288 | 943.78 ± 27.15 | | `env OMP_NUM_THREADS=8 ./omp_alloc.x 1500000 10000` | 33.401 ± 0.037 | 33.357 | 33.471 | 477.04 ± 13.69 | | `env OMP_NUM_THREADS=16 ./omp_alloc.x 1500000 10000` | 17.273 ± 0.057 | 17.196 | 17.356 | 246.70 ± 7.12 | | `env OMP_NUM_THREADS=32 ./omp_alloc.x 1500000 10000` | 10.153 ± 0.059 | 10.061 | 10.260 | 145.00 ± 4.24 | | `env OMP_NUM_THREADS=64 ./omp_alloc.x 1500000 10000` | 8.748 ± 0.080 | 8.628 | 8.937 | 124.93 ± 3.76 | | `env OMP_NUM_THREADS=128 ./omp_alloc.x 1500000 10000` | 7.973 ± 0.199 | 7.592 | 8.235 | 113.87 ± 4.32 | | `env OMP_NUM_THREADS=1 ./pool.x 15000 10000` | 2.382 ± 0.005 | 2.373 | 2.390 | 30.53 ± 2.37 | | `env OMP_NUM_THREADS=2 ./pool.x 15000 10000` | 1.211 ± 0.006 | 1.203 | 1.226 | 15.53 ± 1.21 | | `env OMP_NUM_THREADS=4 ./pool.x 15000 10000` | 0.618 ± 0.005 | 0.615 | 0.629 | 7.93 ± 0.62 | | `env OMP_NUM_THREADS=8 ./pool.x 15000 10000` | 0.318 ± 0.003 | 0.315 | 0.324 | 4.08 ± 0.32 | | `env OMP_NUM_THREADS=16 ./pool.x 15000 10000` | 0.170 ± 0.002 | 0.168 | 0.176 | 2.18 ± 0.17 | | `env OMP_NUM_THREADS=32 ./pool.x 15000 10000` | 0.107 ± 0.018 | 0.099 | 0.151 | 1.37 ± 0.25 | | `env OMP_NUM_THREADS=64 ./pool.x 15000 10000` | 0.079 ± 0.012 | 0.069 | 0.100 | 1.01 ± 0.17 | | `env OMP_NUM_THREADS=128 ./pool.x 15000 10000` | 0.078 ± 0.006 | 0.069 | 0.092 | 1.00 | | `env OMP_NUM_THREADS=1 ./pool.x 150000 10000` | 24.007 ± 0.049 | 23.915 | 24.110 | 307.71 ± 23.90 | | `env OMP_NUM_THREADS=2 ./pool.x 150000 10000` | 12.075 ± 0.025 | 12.040 | 12.108 | 154.78 ± 12.02 | | `env OMP_NUM_THREADS=4 ./pool.x 150000 10000` | 6.020 ± 0.016 | 6.007 | 6.063 | 77.16 ± 5.99 | | `env OMP_NUM_THREADS=8 ./pool.x 150000 10000` | 3.039 ± 0.008 | 3.029 | 3.053 | 38.96 ± 3.03 | | `env OMP_NUM_THREADS=16 ./pool.x 150000 10000` | 1.546 ± 0.007 | 1.540 | 1.563 | 19.82 ± 1.54 | | `env OMP_NUM_THREADS=32 ./pool.x 150000 10000` | 0.862 ± 0.162 | 0.808 | 1.322 | 11.05 ± 2.24 | | `env OMP_NUM_THREADS=64 ./pool.x 150000 10000` | 0.536 ± 0.122 | 0.454 | 0.715 | 6.88 ± 1.66 | | `env OMP_NUM_THREADS=128 ./pool.x 150000 10000` | 0.445 ± 0.064 | 0.352 | 0.488 | 5.70 ± 0.93 | | `env OMP_NUM_THREADS=1 ./pool.x 1500000 10000` | 252.053 ± 0.363 | 251.614 | 252.718 | 3230.76 ± 250.86 | | `env OMP_NUM_THREADS=2 ./pool.x 1500000 10000` | 126.822 ± 0.223 | 126.404 | 127.076 | 1625.58 ± 126.23 | | `env OMP_NUM_THREADS=4 ./pool.x 1500000 10000` | 63.302 ± 0.064 | 63.252 | 63.426 | 811.39 ± 63.00 | | `env OMP_NUM_THREADS=8 ./pool.x 1500000 10000` | 32.000 ± 0.096 | 31.884 | 32.190 | 410.16 ± 31.87 | | `env OMP_NUM_THREADS=16 ./pool.x 1500000 10000` | 16.283 ± 0.141 | 16.171 | 16.668 | 208.71 ± 16.30 | | `env OMP_NUM_THREADS=32 ./pool.x 1500000 10000` | 8.785 ± 1.570 | 8.200 | 13.251 | 112.60 ± 21.94 | | `env OMP_NUM_THREADS=64 ./pool.x 1500000 10000` | 4.997 ± 1.045 | 4.483 | 6.980 | 64.05 ± 14.29 | | `env OMP_NUM_THREADS=128 ./pool.x 1500000 10000` | 4.722 ± 0.432 | 4.224 | 5.187 | 60.53 ± 7.26 | | `env OMP_NUM_THREADS=1 ./store.x 15000 10000` | 2.382 ± 0.005 | 2.372 | 2.387 | 33.43 ± 2.99 | | `env OMP_NUM_THREADS=2 ./store.x 15000 10000` | 1.210 ± 0.003 | 1.205 | 1.215 | 16.98 ± 1.52 | | `env OMP_NUM_THREADS=4 ./store.x 15000 10000` | 0.612 ± 0.003 | 0.609 | 0.620 | 8.59 ± 0.77 | | `env OMP_NUM_THREADS=8 ./store.x 15000 10000` | 0.318 ± 0.003 | 0.315 | 0.323 | 4.47 ± 0.40 | | `env OMP_NUM_THREADS=16 ./store.x 15000 10000` | 0.169 ± 0.002 | 0.168 | 0.176 | 2.37 ± 0.21 | | `env OMP_NUM_THREADS=32 ./store.x 15000 10000` | 0.100 ± 0.009 | 0.097 | 0.148 | 1.41 ± 0.18 | | `env OMP_NUM_THREADS=64 ./store.x 15000 10000` | 0.077 ± 0.012 | 0.066 | 0.095 | 1.08 ± 0.19 | | `env OMP_NUM_THREADS=128 ./store.x 15000 10000` | 0.071 ± 0.006 | 0.062 | 0.082 | 1.00 | | `env OMP_NUM_THREADS=1 ./store.x 150000 10000` | 23.973 ± 0.036 | 23.903 | 24.025 | 336.49 ± 30.12 | | `env OMP_NUM_THREADS=2 ./store.x 150000 10000` | 12.072 ± 0.027 | 12.041 | 12.112 | 169.44 ± 15.17 | | `env OMP_NUM_THREADS=4 ./store.x 150000 10000` | 6.036 ± 0.010 | 6.017 | 6.045 | 84.72 ± 7.58 | | `env OMP_NUM_THREADS=8 ./store.x 150000 10000` | 3.036 ± 0.006 | 3.027 | 3.044 | 42.62 ± 3.81 | | `env OMP_NUM_THREADS=16 ./store.x 150000 10000` | 1.548 ± 0.006 | 1.540 | 1.563 | 21.72 ± 1.95 | | `env OMP_NUM_THREADS=32 ./store.x 150000 10000` | 0.860 ± 0.162 | 0.804 | 1.320 | 12.07 ± 2.51 | | `env OMP_NUM_THREADS=64 ./store.x 150000 10000` | 0.451 ± 0.002 | 0.446 | 0.454 | 6.33 ± 0.57 | | `env OMP_NUM_THREADS=128 ./store.x 150000 10000` | 0.377 ± 0.070 | 0.321 | 0.460 | 5.29 ± 1.09 | | `env OMP_NUM_THREADS=1 ./store.x 1500000 10000` | 252.150 ± 0.548 | 251.489 | 253.292 | 3539.33 ± 316.82 | | `env OMP_NUM_THREADS=2 ./store.x 1500000 10000` | 126.658 ± 0.222 | 126.392 | 127.025 | 1777.84 ± 159.13 | | `env OMP_NUM_THREADS=4 ./store.x 1500000 10000` | 63.351 ± 0.167 | 63.225 | 63.767 | 889.24 ± 79.61 | | `env OMP_NUM_THREADS=8 ./store.x 1500000 10000` | 32.089 ± 0.124 | 31.990 | 32.323 | 450.42 ± 40.34 | | `env OMP_NUM_THREADS=16 ./store.x 1500000 10000` | 16.178 ± 0.029 | 16.123 | 16.217 | 227.08 ± 20.32 | | `env OMP_NUM_THREADS=32 ./store.x 1500000 10000` | 8.716 ± 1.606 | 8.196 | 13.288 | 122.35 ± 25.07 | | `env OMP_NUM_THREADS=64 ./store.x 1500000 10000` | 5.742 ± 1.306 | 4.485 | 7.028 | 80.60 ± 19.71 | | `env OMP_NUM_THREADS=128 ./store.x 1500000 10000` | 4.655 ± 0.388 | 4.178 | 5.092 | 65.34 ± 7.99 | # Earlier results | Command | Mean [s] | Min [s] | Max [s] | Relative | |:---|---:|---:|---:|---:| | `env OMP_NUM_THREADS=1 ./local.x` | 252.690 ± 0.537 | 251.900 | 253.580 | 95.97 ± 4.06 | | `env OMP_NUM_THREADS=2 ./local.x` | 131.344 ± 0.792 | 130.341 | 132.464 | 49.89 ± 2.13 | | `env OMP_NUM_THREADS=4 ./local.x` | 66.160 ± 0.358 | 65.800 | 66.763 | 25.13 ± 1.07 | | `env OMP_NUM_THREADS=8 ./local.x` | 33.806 ± 1.214 | 33.336 | 37.258 | 12.84 ± 0.71 | | `env OMP_NUM_THREADS=16 ./local.x` | 17.527 ± 0.492 | 17.257 | 18.911 | 6.66 ± 0.34 | | `env OMP_NUM_THREADS=32 ./local.x` | 10.132 ± 0.066 | 9.984 | 10.221 | 3.85 ± 0.16 | | `env OMP_NUM_THREADS=64 ./local.x` | 8.964 ± 0.155 | 8.650 | 9.160 | 3.40 ± 0.16 | | `env OMP_NUM_THREADS=128 ./local.x` | 8.159 ± 0.082 | 8.049 | 8.264 | 3.10 ± 0.13 | | `env OMP_NUM_THREADS=1 ./global.x` | 232.101 ± 0.433 | 231.685 | 232.875 | 88.15 ± 3.73 | | `env OMP_NUM_THREADS=2 ./global.x` | 116.540 ± 0.198 | 116.321 | 116.881 | 44.26 ± 1.87 | | `env OMP_NUM_THREADS=4 ./global.x` | 58.225 ± 0.037 | 58.165 | 58.273 | 22.11 ± 0.94 | | `env OMP_NUM_THREADS=8 ./global.x` | 29.164 ± 0.022 | 29.139 | 29.212 | 11.08 ± 0.47 | | `env OMP_NUM_THREADS=16 ./global.x` | 14.649 ± 0.018 | 14.631 | 14.692 | 5.56 ± 0.24 | | `env OMP_NUM_THREADS=32 ./global.x` | 7.523 ± 0.043 | 7.457 | 7.568 | 2.86 ± 0.12 | | `env OMP_NUM_THREADS=64 ./global.x` | 4.065 ± 0.224 | 3.887 | 4.397 | 1.54 ± 0.11 | | `env OMP_NUM_THREADS=128 ./global.x` | 2.633 ± 0.111 | 2.526 | 2.772 | 1.00 | | `env OMP_NUM_THREADS=1 ./pool.x` | 252.478 ± 0.363 | 251.910 | 252.960 | 95.89 ± 4.06 | | `env OMP_NUM_THREADS=2 ./pool.x` | 126.735 ± 0.221 | 126.351 | 127.222 | 48.14 ± 2.04 | | `env OMP_NUM_THREADS=4 ./pool.x` | 63.365 ± 0.104 | 63.259 | 63.601 | 24.07 ± 1.02 | | `env OMP_NUM_THREADS=8 ./pool.x` | 31.992 ± 0.086 | 31.892 | 32.164 | 12.15 ± 0.51 | | `env OMP_NUM_THREADS=16 ./pool.x` | 16.220 ± 0.040 | 16.155 | 16.278 | 6.16 ± 0.26 | | `env OMP_NUM_THREADS=32 ./pool.x` | 8.253 ± 0.043 | 8.209 | 8.335 | 3.13 ± 0.13 | | `env OMP_NUM_THREADS=64 ./pool.x` | 4.787 ± 0.776 | 4.468 | 6.954 | 1.82 ± 0.30 | | `env OMP_NUM_THREADS=128 ./pool.x` | 4.566 ± 0.338 | 4.168 | 5.013 | 1.73 ± 0.15 |
×
Sign in
Email
Password
Forgot password
or
By clicking below, you agree to our
terms of service
.
Sign in via Facebook
Sign in via Twitter
Sign in via GitHub
Sign in via Dropbox
Sign in with Wallet
Wallet (
)
Connect another wallet
New to HackMD?
Sign up