commit | dc392094fc77ecc00e12cb9f47ba7168fcac1dcd | [log] [tgz] |
---|---|---|
author | George Steed <george.steed@arm.com> | Wed May 15 21:37:36 2024 +0100 |
committer | Frank Barchard <fbarchard@chromium.org> | Fri Jul 19 19:52:01 2024 +0000 |
tree | 12e819b4b28f709f813660a0cdb369ed183bc01f | |
parent | 776a509891796d07006d2aedfe40e26b1075b026 [diff] |
[AArch64] Unroll ScaleRowDown34_0_Box_NEON The additional parallel instruction streams provide a good benefit to little cores with limited out-of-order capability. Reduction in runtimes observed compared to the existing Neon implementation: Cortex-A55: -19.1% Cortex-A510: -31.6% Cortex-A520: -35.2% Cortex-A76: -14.3% Cortex-A715: +0.1% Cortex-A720: =0.0% Cortex-X1: -6.6% Cortex-X2: -0.1% Cortex-X3: -0.2% Cortex-X4: -7.2% Bug: b/42280945 Change-Id: Idca21a5af1dc6f189e644a81537d41f50ef66498 Reviewed-on: https://chromium-review.googlesource.com/c/libyuv/libyuv/+/5725171 Reviewed-by: Frank Barchard <fbarchard@chromium.org>
libyuv is an open source project that includes YUV scaling and conversion functionality.
See Getting started for instructions on how to get started developing.
You can also browse the docs directory for more documentation.