• Lynn Boger's avatar
    crypto/cipher: improve xorBytesVSX asm for ppc64x · d979ac33
    Lynn Boger authored
    This improves the performance of xorBytesVSX in crypto/cipher by
    unrolling the loop that does the stores. Improvement on power9:
    
    name                 old time/op    new time/op    delta
    XORBytes/8Bytes        17.9ns ± 0%    18.2ns ± 0%   +1.53%  (p=0.029 n=4+4)
    XORBytes/128Bytes      24.4ns ± 0%    22.5ns ± 0%   -7.79%  (p=0.029 n=4+4)
    XORBytes/2048Bytes      131ns ± 0%     109ns ± 0%  -16.79%  (p=0.029 n=4+4)
    XORBytes/32768Bytes    1.74µs ± 0%    1.43µs ± 8%  -18.04%  (p=0.029 n=4+4)
    
    Change-Id: I75bd625d3ae9daa7bda54c523028671ab036b13d
    Reviewed-on: https://go-review.googlesource.com/c/go/+/197058
    Run-TryBot: Lynn Boger <laboger@linux.vnet.ibm.com>
    TryBot-Result: Gobot Gobot <gobot@golang.org>
    Reviewed-by: default avatarCarlos Eduardo Seo <cseo@linux.vnet.ibm.com>
    d979ac33
xor_ppc64x.s 2.03 KB