openssl/crypto/bn/asm
Andy Polyakov f8cee9d081 bn/asm/armv4-gf2m.pl, modes/asm/ghash-armv4.pl: faster multiplication
algorithm suggested in following paper:

Câmara, D.; Gouvêa, C. P. L.; López, J. & Dahab, R.: Fast Software
Polynomial Multiplication on ARM Processors using the NEON Engine.

http://conradoplg.cryptoland.net/files/2010/12/mocrysen13.pdf
2014-04-24 10:24:53 +02:00
..
x86
.cvsignore
alpha-mont.pl
armv4-gf2m.pl bn/asm/armv4-gf2m.pl, modes/asm/ghash-armv4.pl: faster multiplication 2014-04-24 10:24:53 +02:00
armv4-mont.pl bn/asm/armv4-mont.pl: add NEON code path. 2013-12-04 22:37:49 +01:00
bn-586.pl
bn-c64xplus.asm
c64xplus-gf2m.pl
co-586.pl
ia64-mont.pl
ia64.S
mips-mont.pl MIPS assembly pack: get rid of deprecated instructions. 2013-10-13 13:14:52 +02:00
mips.pl MIPS assembly pack: get rid of deprecated instructions. 2013-10-13 13:14:52 +02:00
modexp512-x86_64.pl
pa-risc2.s
pa-risc2W.s
parisc-mont.pl
ppc64-mont.pl ppc64-mont.pl: eliminate dependency on GPRs' upper halves. 2013-11-27 22:50:00 +01:00
ppc-mont.pl PPC assembly pack: add .size directives. 2013-10-15 00:14:39 +02:00
ppc.pl PPC assembly pack: add .size directives. 2013-10-15 00:14:39 +02:00
README
rsaz-avx2.pl crypto/bn/rsaz*: fix licensing note. 2013-12-03 22:08:29 +01:00
rsaz-x86_64.pl crypto/bn/asm/rsaz-x86_64.pl: make it work on Win64. 2013-12-03 22:28:48 +01:00
s390x-gf2m.pl
s390x-mont.pl
s390x.S
sparct4-mont.pl
sparcv8.S
sparcv8plus.S
sparcv9-gf2m.pl
sparcv9-mont.pl
sparcv9a-mont.pl
via-mont.pl
vis3-mont.pl
vms.mar
x86_64-gcc.c
x86_64-gf2m.pl
x86_64-mont5.pl bn/asm/x86_64-mont5.pl: fix compilation error on Solaris. 2014-01-09 13:44:59 +01:00
x86_64-mont.pl bn/asm/x86_64-mont5.pl: add MULX/AD*X code path. 2013-12-09 21:02:24 +01:00
x86-gf2m.pl
x86-mont.pl
x86.pl

<OBSOLETE>

All assember in this directory are just version of the file
crypto/bn/bn_asm.c.

Quite a few of these files are just the assember output from gcc since on 
quite a few machines they are 2 times faster than the system compiler.

For the x86, I have hand written assember because of the bad job all
compilers seem to do on it.  This normally gives a 2 time speed up in the RSA
routines.

For the DEC alpha, I also hand wrote the assember (except the division which
is just the output from the C compiler pasted on the end of the file).
On the 2 alpha C compilers I had access to, it was not possible to do
64b x 64b -> 128b calculations (both long and the long long data types
were 64 bits).  So the hand assember gives access to the 128 bit result and
a 2 times speedup :-).

There are 3 versions of assember for the HP PA-RISC.

pa-risc.s is the original one which works fine and generated using gcc :-)

pa-risc2W.s and pa-risc2.s are 64 and 32-bit PA-RISC 2.0 implementations
by Chris Ruemmler from HP (with some help from the HP C compiler).

</OBSOLETE>