60 lines
		
	
	
		
			2.4 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			60 lines
		
	
	
		
			2.4 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| ARM Cache Coherent Network
 | |
| ==========================
 | |
| 
 | |
| CCN-504 is a ring-bus interconnect consisting of 11 crosspoints
 | |
| (XPs), with each crosspoint supporting up to two device ports,
 | |
| so nodes (devices) 0 and 1 are connected to crosspoint 0,
 | |
| nodes 2 and 3 to crosspoint 1 etc.
 | |
| 
 | |
| PMU (perf) driver
 | |
| -----------------
 | |
| 
 | |
| The CCN driver registers a perf PMU driver, which provides
 | |
| description of available events and configuration options
 | |
| in sysfs, see /sys/bus/event_source/devices/ccn*.
 | |
| 
 | |
| The "format" directory describes format of the config, config1
 | |
| and config2 fields of the perf_event_attr structure. The "events"
 | |
| directory provides configuration templates for all documented
 | |
| events, that can be used with perf tool. For example "xp_valid_flit"
 | |
| is an equivalent of "type=0x8,event=0x4". Other parameters must be
 | |
| explicitly specified.
 | |
| 
 | |
| For events originating from device, "node" defines its index.
 | |
| 
 | |
| Crosspoint PMU events require "xp" (index), "bus" (bus number)
 | |
| and "vc" (virtual channel ID).
 | |
| 
 | |
| Crosspoint watchpoint-based events (special "event" value 0xfe)
 | |
| require "xp" and "vc" as as above plus "port" (device port index),
 | |
| "dir" (transmit/receive direction), comparator values ("cmp_l"
 | |
| and "cmp_h") and "mask", being index of the comparator mask.
 | |
| Masks are defined separately from the event description
 | |
| (due to limited number of the config values) in the "cmp_mask"
 | |
| directory, with first 8 configurable by user and additional
 | |
| 4 hardcoded for the most frequent use cases.
 | |
| 
 | |
| Cycle counter is described by a "type" value 0xff and does
 | |
| not require any other settings.
 | |
| 
 | |
| The driver also provides a "cpumask" sysfs attribute, which contains
 | |
| a single CPU ID, of the processor which will be used to handle all
 | |
| the CCN PMU events. It is recommended that the user space tools
 | |
| request the events on this processor (if not, the perf_event->cpu value
 | |
| will be overwritten anyway). In case of this processor being offlined,
 | |
| the events are migrated to another one and the attribute is updated.
 | |
| 
 | |
| Example of perf tool use:
 | |
| 
 | |
| / # perf list | grep ccn
 | |
|   ccn/cycles/                                        [Kernel PMU event]
 | |
| <...>
 | |
|   ccn/xp_valid_flit,xp=?,port=?,vc=?,dir=?/          [Kernel PMU event]
 | |
| <...>
 | |
| 
 | |
| / # perf stat -a -e ccn/cycles/,ccn/xp_valid_flit,xp=1,port=0,vc=1,dir=1/ \
 | |
|                                                                        sleep 1
 | |
| 
 | |
| The driver does not support sampling, therefore "perf record" will
 | |
| not work. Per-task (without "-a") perf sessions are not supported.
 | 
