Memory bandwith and latency measurements KU Leuven Tier-2¶
Memory bandwidth and latencies for main memory, as well as latencies for L2
and LLC cache. Measurements have been performed using Intel’s Memory Latency
Checker (mlc
and mlc_avx512
).
cascadelake nodes Tier-2¶
Intel(R) Memory Latency Checker - v3.5 Measuring idle latencies (in ns)... Numa node Numa node 0 1 0 67.5 131.4 1 131.1 66.5 Measuring Peak Injection Memory Bandwidths for the system Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec) Using all the threads from each core if Hyper-threading is enabled Using traffic with the following read-write ratios ALL Reads : 224913.3 3:1 Reads-Writes : 206783.1 2:1 Reads-Writes : 205568.1 1:1 Reads-Writes : 199691.7 Stream-triad like: 185847.0 Measuring Memory Bandwidths between nodes within system Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec) Using all the threads from each core if Hyper-threading is enabled Using Read-only traffic type Numa node Numa node 0 1 0 110086.1 34345.8 1 34339.7 113429.0 Measuring Loaded Latencies for the system Using all the threads from each core if Hyper-threading is enabled Using Read-only traffic type Inject Latency Bandwidth Delay (ns) MB/sec ========================== 00000 149.31 223590.8 00002 150.53 221597.4 00008 150.32 220448.9 00015 150.29 221072.1 00050 141.78 217282.2 00100 118.47 184049.5 00200 92.41 121656.4 00300 84.10 87825.2 00400 80.31 68195.4 00500 78.99 55429.8 00700 80.58 40627.3 01000 74.68 29264.2 01300 73.08 22907.2 01700 72.76 17823.6 02500 80.48 12349.6 03500 70.86 9214.6 05000 70.28 6741.1 09000 69.50 4170.8 20000 69.57 2384.7 Measuring cache-to-cache transfer latency (in ns)... Local Socket L2->L2 HIT latency 48.2 Local Socket L2->L2 HITM latency 48.3 Remote Socket L2->L2 HITM latency (data address homed in writer socket) Reader Numa Node Writer Numa Node 0 1 0 - 111.8 1 111.8 - Remote Socket L2->L2 HITM latency (data address homed in reader socket) Reader Numa Node Writer Numa Node 0 1 0 - 165.3 1 169.2 -
skylake nodes Tier-2¶
Intel(R) Memory Latency Checker - v3.5 Measuring idle latencies (in ns)... Numa node Numa node 0 1 0 81.7 138.7 1 135.1 80.5 Measuring Peak Injection Memory Bandwidths for the system Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec) Using all the threads from each core if Hyper-threading is enabled Using traffic with the following read-write ratios ALL Reads : 214070.8 3:1 Reads-Writes : 193785.9 2:1 Reads-Writes : 192831.8 1:1 Reads-Writes : 185078.5 Stream-triad like: 174210.1 Measuring Memory Bandwidths between nodes within system Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec) Using all the threads from each core if Hyper-threading is enabled Using Read-only traffic type Numa node Numa node 0 1 0 107714.3 34296.0 1 34335.8 106946.9 Measuring Loaded Latencies for the system Using all the threads from each core if Hyper-threading is enabled Using Read-only traffic type Inject Latency Bandwidth Delay (ns) MB/sec ========================== 00000 158.98 213853.9 00002 159.23 213136.8 00008 158.80 212548.8 00015 158.82 213148.0 00050 148.20 207473.3 00100 116.89 169119.3 00200 101.80 111879.4 00300 95.37 80124.9 00400 92.73 61870.6 00500 90.39 50648.1 00700 88.17 37060.4 01000 86.13 26585.2 01300 85.57 20769.8 01700 83.69 16167.2 02500 83.13 11305.0 03500 82.30 8327.4 05000 82.45 6071.2 09000 82.42 3720.2 20000 82.30 2106.4 Measuring cache-to-cache transfer latency (in ns)... Local Socket L2->L2 HIT latency 51.1 Local Socket L2->L2 HITM latency 51.1 Remote Socket L2->L2 HITM latency (data address homed in writer socket) Reader Numa Node Writer Numa Node 0 1 0 - 115.3 1 116.3 - Remote Socket L2->L2 HITM latency (data address homed in reader socket) Reader Numa Node Writer Numa Node 0 1 0 - 178.8 1 180.5 -