openssl/crypto/aes
Andy Polyakov 64d92d7498 x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results.
"Optimize" is in quotes because it's rather a "salvage operation"
for now. Idea is to identify processor capability flags that
drive Knights Landing to suboptimial code paths and mask them.
Two flags were identified, XSAVE and ADCX/ADOX. Former affects
choice of AES-NI code path specific for Silvermont (Knights Landing
is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are
effectively mishandled at decode time. In both cases we are looking
at ~2x improvement.

AVX-512 results cover even Skylake-X :-)

Hardware used for benchmarking courtesy of Atos, experiments run by
Romain Dolbeau <romain.dolbeau@atos.net>. Kudos!

Reviewed-by: Rich Salz <rsalz@openssl.org>
2017-07-21 14:07:32 +02:00
..
asm x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. 2017-07-21 14:07:32 +02:00
aes_cbc.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_cfb.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_core.c Fix a few if(, for(, while( inside code. 2016-07-20 07:21:53 -04:00
aes_ecb.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_ige.c Don't call memcpy if len is zero. 2017-02-20 19:17:53 -05:00
aes_locl.h Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_misc.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_ofb.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_wrap.c Copyright consolidation 05/10 2016-05-17 15:38:09 -04:00
aes_x86core.c aes/aes_x86core.c: clarify reference implementation status. 2017-03-02 16:26:01 +01:00
build.info MIPS assembly pack: adapt it for MIPS[32|64]R6. 2016-09-02 13:33:17 +02:00