This program is meant to measure the latency of performing copy operations with specific instruction set on arm64 platforms.
If your arm64 platform is able to be served by a PM server (a.k.a. fx serve), then adding //zircon/kernel/arch/arm64/user-copy:benchmarks will be enough. If this is not the case, then adding //zircon/kernel/arch/arm64/user-copy:benchmarks-standalone to your base labels is enough.
Once you have built, then the following binaries will be available on your fuchsia instance:
The three binaries share the same set of command line arguments:
The sampling is performed as follows, for each block size, source alignment and destination alignment triplet a warm up is performed, which consist of an arbitrary round of copies of arbitrary values into the range.
To reduce variance and noise, the measurement is performed of kSampleCount copies in sequence which is then averaged (divided by kSampleCount).