commit | a4ccf9940e2389c3c791c71a0aec53099bc0fee0 | [log] [tgz] |
---|---|---|
author | George Steed <george.steed@arm.com> | Thu Apr 18 16:18:10 2024 +0100 |
committer | Frank Barchard <fbarchard@chromium.org> | Tue Jul 16 17:32:52 2024 +0000 |
tree | b2b9be4ecd4c259a6471983529def1dedd93e7ae | |
parent | 302d29d1a8415ebcb4ef5d3dda632df6c136cfa6 [diff] |
[AArch64] Add I8MM implementation of ARGBToUV444Row We cannot use the standard dot-product instructions since the coefficients multiplication results are both added and subtracted, but I8MM supports mixed-sign dot products which work well here. We need to add an additional variant of the coefficient structs since we need negative constants for the elements that were previously subtracted. Reduction in runtimes observed compared to the previous Neon implementation: Cortex-A510: -37.3% Cortex-A520: -31.1% Cortex-A715: -37.1% Cortex-A720: -37.0% Cortex-X2: -62.1% Cortex-X3: -62.2% Cortex-X4: -40.4% Bug: libyuv:977 Change-Id: Idc3d9a6408c30e1bce3816a1ed926ecd76792236 Reviewed-on: https://chromium-review.googlesource.com/c/libyuv/libyuv/+/5712928 Reviewed-by: Frank Barchard <fbarchard@chromium.org> Reviewed-by: Justin Green <greenjustin@google.com>
libyuv is an open source project that includes YUV scaling and conversion functionality.
See Getting started for instructions on how to get started developing.
You can also browse the docs directory for more documentation.