9f0d1898206db17cdd9478af5fcfd7522ac1554e - third_party/swift-clang

commit	9f0d1898206db17cdd9478af5fcfd7522ac1554e	[log] [tgz]
author	Ahmed Bougacha <ahmed.bougacha@gmail.com>	Tue Apr 19 19:44:45 2016 +0000
committer	Mishal Shah <mishal_shah@apple.com>	Wed May 04 16:34:11 2016 -0700
tree	a6828fad94a6a94383a8b1e7cffa104ebc7005f4
parent	1e6cba3ce39043326f4278271a970f63322ba724 [diff]

[ARM NEON] Define vfms_f32 on ARM, and all vfms using vfma.

r259537 added vfma/vfms to armv7, but the builtin was only lowered
on the AArch64 side. Instead of supporting it on ARM, get rid of it.

The vfms builtin lowered to:
  %nb = fsub float -0.0, %b
  %r = @llvm.fma.f32(%a, %nb, %c)

Instead, define the operation in terms of vfma, and swap the
multiplicands. It now lowers to:
  %na = fsub float -0.0, %a
  %r = @llvm.fma.f32(%na, %b, %c)

This matches the instruction more closely, and lets current LLVM
generate the "natural" operand ordering:
  fmls.2s v0, v1, v2
instead of the crooked (but equivalent):
  fmls.2s v0, v2, v1
Except for theses changes, assembly is identical.

LLVM accepts both commutations, and the LLVM tests in:
  test/CodeGen/AArch64/arm64-fmadd.ll
  test/CodeGen/AArch64/fp-dp3.ll
  test/CodeGen/AArch64/neon-fma.ll
  test/CodeGen/ARM/fusedMAC.ll
already check either the new one only, or both.

Also verified against the test-suite unittests.

git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@266807 91177308-0d34-0410-b5e6-96231b3b80d8
(cherry picked from commit fa8ab2562a582a60fb7dff9d873b65d84ab864f4)

Conflicts:
	test/CodeGen/aarch64-neon-2velem.c
	test/CodeGen/aarch64-neon-fma.c
	test/CodeGen/aarch64-neon-intrinsics.c
	test/CodeGen/aarch64-neon-scalar-x-indexed-elem.c

5 files changed

tree: a6828fad94a6a94383a8b1e7cffa104ebc7005f4