• Dominik Brodowski's avatar
    x86/entry/64: Use 'xorl' for faster register clearing · ced5d0bf
    Dominik Brodowski authored
    On some x86 CPU microarchitectures using 'xorq' to clear general-purpose
    registers is slower than 'xorl'. As 'xorl' is sufficient to clear all
    64 bits of these registers due to zero-extension [*], switch the x86
    64-bit entry code to use 'xorl'.
    
    No change in functionality and no change in code size.
    
    [*] According to Intel 64 and IA-32 Architecture Software Developer's
        Manual, section 3.4.1.1, the result of 32-bit operands are "zero-
        extended to a 64-bit result in the destination general-purpose
        register." The AMD64 Architecture Programmer’s Manual Volume 3,
        Appendix B.1, describes the same behaviour.
    Suggested-by: default avatarDenys Vlasenko <dvlasenk@redhat.com>
    Signed-off-by: default avatarDominik Brodowski <linux@dominikbrodowski.net>
    Cc: Andy Lutomirski <luto@kernel.org>
    Cc: Arjan van de Ven <arjan@linux.intel.com>
    Cc: Borislav Petkov <bp@alien8.de>
    Cc: Dan Williams <dan.j.williams@intel.com>
    Cc: Dave Hansen <dave.hansen@linux.intel.com>
    Cc: David Woodhouse <dwmw2@infradead.org>
    Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Cc: Josh Poimboeuf <jpoimboe@redhat.com>
    Cc: Linus Torvalds <torvalds@linux-foundation.org>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Cc: Thomas Gleixner <tglx@linutronix.de>
    Link: http://lkml.kernel.org/r/20180214175924.23065-3-linux@dominikbrodowski.net
    [ Improved on the changelog a bit. ]
    Signed-off-by: default avatarIngo Molnar <mingo@kernel.org>
    ced5d0bf
entry_64_compat.S 13.1 KB