openssl/crypto/bn/asm
Andy Polyakov 31ed9a2131 crypto/bn/rsaz*: fix licensing note.
rsaz_exp.c: harmonize line terminating;
asm/rsaz-*.pl: minor optimizations.
2013-12-03 22:08:29 +01:00
..
x86
.cvsignore
alpha-mont.pl alphacpuid.pl: fix alignment bug. 2011-08-12 12:28:52 +00:00
armv4-gf2m.pl armv4cpuid.S, armv4-gf2m.pl: make newest code compilable by older assembler. 2011-11-05 13:07:18 +00:00
armv4-mont.pl armv4-mont.pl: profiler-assisted optimization gives 8%-14% improvement 2011-08-13 12:38:41 +00:00
bn-586.pl
bn-c64xplus.asm C64x+ assembly pack: improve EABI support. 2012-11-28 13:19:10 +00:00
c64xplus-gf2m.pl C64x+ assembly pack: improve EABI support. 2012-11-28 13:19:10 +00:00
co-586.pl
ia64-mont.pl IA-64 assembler pack: fix typos and make it work on HP-UX. 2011-05-07 20:36:05 +00:00
ia64.S
mips-mont.pl MIPS assembly pack: get rid of deprecated instructions. 2013-10-13 13:14:52 +02:00
mips.pl MIPS assembly pack: get rid of deprecated instructions. 2013-10-13 13:14:52 +02:00
modexp512-x86_64.pl x86_64 assembly pack: make Windows build more robust. 2013-01-22 22:27:28 +01:00
pa-risc2.s
pa-risc2W.s
parisc-mont.pl PA-RISC assembler pack: switch to bve in 64-bit builds. 2013-06-18 10:37:00 +02:00
ppc64-mont.pl ppc64-mont.pl: eliminate dependency on GPRs' upper halves. 2013-11-27 22:50:00 +01:00
ppc-mont.pl PPC assembly pack: add .size directives. 2013-10-15 00:14:39 +02:00
ppc.pl PPC assembly pack: add .size directives. 2013-10-15 00:14:39 +02:00
README misspellings fixes by https://github.com/vlajos/misspell_fixer 2013-09-05 21:39:42 +01:00
rsaz-avx2.pl crypto/bn/rsaz*: fix licensing note. 2013-12-03 22:08:29 +01:00
rsaz-x86_64.pl crypto/bn/rsaz*: fix licensing note. 2013-12-03 22:08:29 +01:00
s390x-gf2m.pl s390x-gf2m.pl: commentary update (final performance numbers turned to be 2011-07-04 11:20:33 +00:00
s390x-mont.pl s390x assembler pack: tune-up and support for new z196 hardware. 2011-03-04 13:09:16 +00:00
s390x.S s390x.S: fix typo in bn_mul_words. 2010-11-22 21:55:07 +00:00
sparct4-mont.pl Optimize SPARC T4 MONTMUL support. 2013-06-18 10:39:38 +02:00
sparcv8.S
sparcv8plus.S misspellings fixes by https://github.com/vlajos/misspell_fixer 2013-09-05 21:39:42 +01:00
sparcv9-gf2m.pl SPARCv9 assembly pack: harmonize ABI handling (so that it's handled in one 2012-10-25 12:07:32 +00:00
sparcv9-mont.pl
sparcv9a-mont.pl misspellings fixes by https://github.com/vlajos/misspell_fixer 2013-09-05 21:39:42 +01:00
via-mont.pl
vis3-mont.pl misspellings fixes by https://github.com/vlajos/misspell_fixer 2013-09-05 21:39:42 +01:00
vms.mar
x86_64-gcc.c Add volatile qualifications to two blocks of inline asm to stop GCC from 2013-06-04 18:46:25 +01:00
x86_64-gf2m.pl x86_64-gf2m.pl: fix typo. 2013-03-01 22:36:36 +01:00
x86_64-mont5.pl bn/asm/*x86_64*.pl: correct assembler requirement for ad*x. 2013-10-14 22:41:00 +02:00
x86_64-mont.pl bn/asm/x86_64-mont.pl: minor optimization [for Decoded ICache]. 2013-10-25 10:14:20 +02:00
x86-gf2m.pl x86gas.pl: add palignr and move pclmulqdq. 2011-05-16 18:07:00 +00:00
x86-mont.pl x86-mont.pl: fix bug in integer-only squaring path. 2011-12-09 14:21:25 +00:00
x86.pl

<OBSOLETE>

All assember in this directory are just version of the file
crypto/bn/bn_asm.c.

Quite a few of these files are just the assember output from gcc since on 
quite a few machines they are 2 times faster than the system compiler.

For the x86, I have hand written assember because of the bad job all
compilers seem to do on it.  This normally gives a 2 time speed up in the RSA
routines.

For the DEC alpha, I also hand wrote the assember (except the division which
is just the output from the C compiler pasted on the end of the file).
On the 2 alpha C compilers I had access to, it was not possible to do
64b x 64b -> 128b calculations (both long and the long long data types
were 64 bits).  So the hand assember gives access to the 128 bit result and
a 2 times speedup :-).

There are 3 versions of assember for the HP PA-RISC.

pa-risc.s is the original one which works fine and generated using gcc :-)

pa-risc2W.s and pa-risc2.s are 64 and 32-bit PA-RISC 2.0 implementations
by Chris Ruemmler from HP (with some help from the HP C compiler).

</OBSOLETE>