Memory bandwith and latency measurements (wICE)#

Memory bandwidth and latencies for main memory, as well as latencies for L2 and LLC cache. Measurements have been performed using Intel’s Memory Latency Checker (mlc).

Icelake CPUs#

Intel(R) Memory Latency Checker - v3.10
*** Unable to modify prefetchers (try executing 'modprobe msr')
*** So, enabling random access for latency measurements
Measuring idle latencies for random access (in ns)...
                Numa node
Numa node            0       1  
       0          89.2   142.7  
       1         142.9    89.0  

Measuring Peak Injection Memory Bandwidths for the system
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using traffic with the following read-write ratios
ALL Reads        :      351762.1        
3:1 Reads-Writes :      313084.7        
2:1 Reads-Writes :      297154.4        
1:1 Reads-Writes :      291841.4        
Stream-triad like:      320198.4        

Measuring Memory Bandwidths between nodes within system 
Bandwidths are in MB/sec (1 MB/sec = 1,000,000 Bytes/sec)
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
                Numa node
Numa node            0       1  
       0        175690.9        55676.8 
       1        55692.3 175912.6        

Measuring Loaded Latencies for the system
Using all the threads from each core if Hyper-threading is enabled
Using Read-only traffic type
Inject  Latency Bandwidth
Delay   (ns)    MB/sec
==========================
 00000  264.04   351871.6
 00002  265.59   351747.1
 00008  259.87   351724.8
 00015  243.81   350234.2
 00050  184.82   327723.9
 00100  153.80   297201.4
 00200  109.56   173692.5
 00300  102.11   123303.1
 00400   99.73    95482.2
 00500   98.46    77827.2
 00700  100.06    56746.4
 01000   95.63    41144.3
 01300   95.68    32297.0
 01700   94.76    25179.4
 02500   94.09    17571.8
 03500   94.28    12837.9
 05000   93.89     9247.0
 09000   92.56     5483.5
 20000   91.23     2867.0

Measuring cache-to-cache transfer latency (in ns)...
Local Socket L2->L2 HIT  latency        60.1
Local Socket L2->L2 HITM latency        64.2
Remote Socket L2->L2 HITM latency (data address homed in writer socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   127.1  
            1    127.2       -  
Remote Socket L2->L2 HITM latency (data address homed in reader socket)
                        Reader Numa Node
Writer Numa Node     0       1  
            0        -   127.0  
            1    127.4       -