Removing extra allocation dramatically improved the speed--from 1.5ns to 1.14ns.
1 file changed