Memory bandwith and latency measurements KU Leuven Tier-2

Memory bandwidth and latencies for main memory, as well as latencies for L2 and LLC cache. Measurements have been performed using Intel’s Memory Latency Checker (mlc and mlc_avx512).

cascadelake nodes Tier-2

Intel(R) Memory Latency Checker - v3.5
Measuring idle latencies (in ns)...
                Numa node
Numa node            0       1  
       0          67.5   131.4  
       1         131.1    66.5  

Measuring Peak Injection Memory Bandwidths for the system
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using traffic with the following read-write ratios
ALL Reads        :      224913.3        
3:1 Reads-Writes :      206783.1        
2:1 Reads-Writes :      205568.1        
1:1 Reads-Writes :      199691.7        
Stream-triad like:      185847.0        

Measuring Memory Bandwidths between nodes within system 
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
                Numa node
Numa node            0       1  
       0        110086.1        34345.8 
       1        34339.7 113429.0        

Measuring Loaded Latencies for the system
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
Inject  Latency Bandwidth
Delay   (ns)    MB/sec
==========================
 00000  149.31   223590.8
 00002  150.53   221597.4
 00008  150.32   220448.9
 00015  150.29   221072.1
 00050  141.78   217282.2
 00100  118.47   184049.5
 00200   92.41   121656.4
 00300   84.10    87825.2
 00400   80.31    68195.4
 00500   78.99    55429.8
 00700   80.58    40627.3
 01000   74.68    29264.2
 01300   73.08    22907.2
 01700   72.76    17823.6
 02500   80.48    12349.6
 03500   70.86     9214.6
 05000   70.28     6741.1
 09000   69.50     4170.8
 20000   69.57     2384.7

Measuring cache-to-cache transfer latency (in ns)...
Local Socket L2->L2 HIT  latency        48.2
Local Socket L2->L2 HITM latency        48.3
Remote Socket L2->L2 HITM latency (data address homed in writer socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   111.8  
            1    111.8       -  
Remote Socket L2->L2 HITM latency (data address homed in reader socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   165.3  
            1    169.2       -  

skylake nodes Tier-2

Intel(R) Memory Latency Checker - v3.5
Measuring idle latencies (in ns)...
                Numa node
Numa node            0       1  
       0          81.7   138.7  
       1         135.1    80.5  

Measuring Peak Injection Memory Bandwidths for the system
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using traffic with the following read-write ratios
ALL Reads        :      214070.8        
3:1 Reads-Writes :      193785.9        
2:1 Reads-Writes :      192831.8        
1:1 Reads-Writes :      185078.5        
Stream-triad like:      174210.1        

Measuring Memory Bandwidths between nodes within system 
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
                Numa node
Numa node            0       1  
       0        107714.3        34296.0 
       1        34335.8 106946.9        

Measuring Loaded Latencies for the system
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
Inject  Latency Bandwidth
Delay   (ns)    MB/sec
==========================
 00000  158.98   213853.9
 00002  159.23   213136.8
 00008  158.80   212548.8
 00015  158.82   213148.0
 00050  148.20   207473.3
 00100  116.89   169119.3
 00200  101.80   111879.4
 00300   95.37    80124.9
 00400   92.73    61870.6
 00500   90.39    50648.1
 00700   88.17    37060.4
 01000   86.13    26585.2
 01300   85.57    20769.8
 01700   83.69    16167.2
 02500   83.13    11305.0
 03500   82.30     8327.4
 05000   82.45     6071.2
 09000   82.42     3720.2
 20000   82.30     2106.4

Measuring cache-to-cache transfer latency (in ns)...
Local Socket L2->L2 HIT  latency        51.1
Local Socket L2->L2 HITM latency        51.1
Remote Socket L2->L2 HITM latency (data address homed in writer socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   115.3  
            1    116.3       -  
Remote Socket L2->L2 HITM latency (data address homed in reader socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   178.8  
            1    180.5       -