mirror/nasm - nasm - Collaboration & Inovation

mirror of https://github.com/netwide-assembler/nasm.git synced 2025-02-23 17:29:23 +08:00

Author	SHA1	Message	Date
Jin Kyu Song	cc1dc9de53	AVX-512: Add EVEX encoding and new instructions EVEX encoding support includes 32 vector regs (XMM/YMM/ZMM), opmask, broadcasting, embedded rounding mode, suppress all exceptions, compressed displacement. Signed-off-by: Jin Kyu Song <jin.kyu.song@intel.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-08-16 09:06:15 +04:00
Philipp Kloke	dae212d049	Fixed several resource and memory leaks Bug found by: CppCheck 1.59 (static source analysis tool) Signed-off-by: Philipp Kloke <philipp.kloke@web.de> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-04-01 02:16:27 +04:00
Ben Rudiak-Gould	94ba02fa16	Make F2 and F3 SSE prefixes override 66 According to XED and experimentation, the 66 is ignored. Signed-off-by: Ben Rudiak-Gould <benrudiak@gmail.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-10 21:46:12 +04:00
Ben Rudiak-Gould	6e87893f06	Drop SAME_AS flag from instruction matcher It was there to support the SSE5 DREX encoding, which as far as I know is dead forever. Signed-off-by: Ben Rudiak-Gould <benrudiak@gmail.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-04 00:46:16 +04:00
Ben Rudiak-Gould	d1ac29a3cc	insns: Remove pushseg/popseg internal bytecodes This patch is getting rid of the following bytecodes 'pushseg','popseg','pushseg2','popseg2' and simplifies overall code. [gorcunov@: a few style fixes] Signed-off-by: Ben Rudiak-Gould <benrudiak@gmail.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-03 20:50:46 +04:00
Cyrill Gorcunov	83e6924e1a	Move conditional opcodes close to enum ccode definition Thus if someone need to rework this code he won't need to jump between files trying to figure out where enum and opcodes lay. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-03 14:34:31 +04:00
Cyrill Gorcunov	982387606b	assemble: Make emit_rex being a function Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-02 02:59:29 +04:00
Cyrill Gorcunov	59df421af3	assemble: Use case3/4 where appropriate This allows to shrink code a bit. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-02 02:59:21 +04:00
Cyrill Gorcunov	62576a016d	assemble: Add case3 helper Signed-off-by: cyrill <cyrill@cyrills-MacBook-Pro.local>	2013-03-02 02:46:17 +04:00
Cyrill Gorcunov	c7ce6a4f22	process_ea: Drop redundant variable Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-02 02:45:53 +04:00
Ben Rudiak-Gould	4e8396b5cf	Remove +s It doesn't seem worth >200 lines of C and Perl to save ~50 lines in insns.dat. In order to make this work I had to rename sbyte16/sbyte32 so that they can take an ordinary size suffix (their size suffix was formerly treated specially). This fixes one disassembly bug: 48C7C000000080 disassembles to mov rax,0x80000000, which reassembles to B800000080, which loads a different value. Signed-off-by: Ben Rudiak-Gould <benrudiak@gmail.com> Acked-by: "H. Peter Anvin" <hpa@zytor.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-03-01 10:28:32 +04:00
Ben Rudiak-Gould	d7ab1f9638	Add np and similar prefixes to instructions that should have them This adds "np" to a bunch of SSE-style instructions that should have it, "norep" (which was implemented but unused) on quasi-SSE instructions that use F2 and F3 as instruction extensions but 66 for operand size, "nof3" (newly implemented) on a few instructions, "norexw" on some instructions that have only 32-bit and 64-bit versions, and one NOLONG. It also removes some incorrect "np"s, changes some "f3"s to "f3i"s, and fixes the decoding of the XCHG/NOP/PAUSE mess: F390 is always PAUSE even when rex.b=1 (at least according to XED). Signed-off-by: Ben Rudiak-Gould <benrudiak@gmail.com> Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2013-02-20 23:25:54 +04:00
Cyrill Gorcunov	167917abe5	opflags: Extend opflags_t to 64 bits Soon we will need to encode 512 bits values thus there is no space left in our opflags_t which is 32 bitfield. Extend it to 64 bits width. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2012-09-10 01:35:38 +04:00
H. Peter Anvin	e014f354d5	HLE: One more byte code conversion Add missing site for the \265..267 -> \271..273 byte code move. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 22:35:19 -08:00
H. Peter Anvin	574784d177	HLE: Move byte codes back to \271-\273 Since we are back to three bytecodes, move them back to the \271-\273 slot to free up the \264 complete quad. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 22:33:46 -08:00
H. Peter Anvin	fb3f4e6ddb	HLE: Change NOHLE to be an instruction flag The way our matching system works we have to make NOHLE an instruction flag rather than an byte code; by the time we run the byte code interpreter we have already picked an instruction pattern once and for all. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 22:22:07 -08:00
H. Peter Anvin	5a24fdd547	Make the LOCK and HLE warnings suppressable. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 15:11:11 -08:00
H. Peter Anvin	755f5214b7	Remove all remaining explicit bytecodes from insns.dat Get rid of the last vestiges of the explicit byte codes in insns.dat. The only files that now depend on actual byte code numbers are insns.pl, assemble.c and disasm.c. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 11:41:34 -08:00
H. Peter Anvin	8cc8a1d836	Add support for warning on invalid LOCK prefixes Add an LOCK flag to the instruction template, and make the presence of a LOCK prefix trigger a warning if it is not set. Simplify the LOCK and HLE logic by hard-coding the knowledge that operand 0 has to be memory. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 11:11:42 -08:00
H. Peter Anvin	8ea2200415	Move HLE byte codes to \264..\267 Move the HLE byte codes to \264..\267 so as not to break up an unused group of 8 (\240..\247). Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 10:24:24 -08:00
H. Peter Anvin	7849dd07b9	Add a "nohle" byte code to skip an instruction pattern The a2/a3 mem_offs MOV opcodes are invalid with XRELEASE; those instructions instead have to use a modrm form. Therefore give a way to annotate those instruction patters so the pattern matcher will move on to the next pattern, rather than selecting them and then issue a warning. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-25 10:19:56 -08:00
H. Peter Anvin	4ecd5d79fc	HLE: Implement the basic mechanism for XACQUIRE/XRELEASE This implements the mechanism for XACQUIRE/XRELEASE. It does not include the necessary annotations in insns.dat. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-24 21:57:17 -08:00
H. Peter Anvin	10da41e328	HLE: Split the LOCK and REP prefix slots With HLE, the sequence REP LOCK actually makes sense, so support it. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2012-02-24 20:57:04 -08:00
Cyrill Gorcunov	18914e6330	BR3392198: Fix compilation warning on prefixes insn->prefixes might contain not only values from 'enum prefixes' but from 'enum reg_enum' as well so make it generic 'int' instead. This calms down the compiler about enum's mess and eliminates a wrong assumption that we always have values by particular type in this field. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-11-12 11:41:51 +04:00
Cyrill Gorcunov	d6851d4d26	assemble: Drop redundant variable Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-09-25 18:01:45 +04:00
Cyrill Gorcunov	10734c7e58	A couple of simplifications to assemble.c - GEN_SIB and GEN_MODRM helpers added - a number of tabs vs space fixs - more use of is_class() helper Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-08-29 00:07:17 +04:00
Cyrill Gorcunov	cdb8cd7b22	Drop empty line and bracket Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-08-28 16:33:39 +04:00
H. Peter Anvin	9f2043eaad	assemble.c: remove stray debugging code My bad for checking this in at all. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2011-08-22 13:52:02 -07:00
Cyrill Gorcunov	c4d328c165	assemble.c: Comment out debug printing Probably we need some kind of pr_debug or something like that instead. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-08-23 00:12:50 +04:00
Cyrill Gorcunov	397402016f	Drop unused 'type' from gencode Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2011-07-17 14:02:52 +04:00
H. Peter Anvin	cffe61e776	Use a normal quad-case for valueless /is4 When we don't have an immediate for the i-field in /is4, then use a normal quad-bytecode encoding for it to save some small amount of space and re-use existing machinery. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2011-07-07 17:21:24 -07:00
H. Peter Anvin	fc561203fd	Remove support for DREX encoding The DREX encoding never hit production silicon, and has been replaced by VEX/XOP encoding, so remove support for it. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2011-07-07 16:58:22 -07:00
H. Peter Anvin	3089f7ef8a	Add support for VSIB instructions Add support for VSIB instructions, which use vector registers as the index registers in an EA. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2011-06-22 18:19:28 -07:00
Victor van den Elzen	6dfbddb6b0	Move implicit operand size override logic to calc_size It is more logical, it cleans up the code and it makes implicit operand size override prefixes come out in the same order as explicit ones instead of after all other prefixes. Suggested-by: H. Peter Anvin <hpa@zytor.com>	2010-12-29 18:13:38 +01:00
H. Peter Anvin	bcf9f2a08b	Merge branch 'nasm-2.09.xx'	2010-11-16 09:40:03 -08:00
H. Peter Anvin	3cb0e8c052	BR 3109604: Fix C4 vs C5 VEX form selection in calcsize() calcsize() had the wrong criterion for when C5 prefixes are permitted (REX.R is permitted, REX.X is forbidden.) assemble() had the right test already. This caused symbol value errors.	2010-11-16 09:39:32 -08:00
Victor van den Elzen	b3cee5a57a	BR3058845: mostly fix bogus warning with implicit operand size override The implicit operand size override code didn't set the operand size prefix, which confused the size calculation code for the range check. The BITS 64 operand size calculation is still off, but "fixing" it by making it 32-bit unless REX.W is set breaks PUSH and maybe others.	2010-11-07 23:27:48 +01:00
H. Peter Anvin	47fb7bc088	assemble: add an OPT instruction flags for optimizing assembly only Add an OPT flag to only use a pattern for optimizing assembly only. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-08-24 13:53:22 -07:00
H. Peter Anvin	229fa6c465	assmemble.c: fix VEX.W logic Fix the generation logic for VEX.W, which unfortunately got the wrong constants. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-08-16 15:21:48 -07:00
H. Peter Anvin	421059c689	assemble: handle vex.lig AVX version 7 introduces the concept of .lig, meaning VEX.L is ignored. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-08-16 14:56:33 -07:00
H. Peter Anvin	978c2170fc	vex: change .wx to .wig to match the latest AVX spec Change the .wx (ignore the W field) to .wig, to match the latest version of the AVX specification. This is not a functional change, but just makes instruction patterns a little easier to write. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-08-16 13:48:43 -07:00
Cyrill Gorcunov	d6f31240c5	assemble.c: Style nitfix Various tabs/space mixture cleaned and some more. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2010-07-26 23:16:45 +04:00
H. Peter Anvin	ab5bd05d82	Revert "Improve process_ea and introduce -OL" This reverts commit `ac732cb6a5`. Resolved Conflicts: doc/nasmdoc.src Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2010-07-25 12:43:30 -07:00
Cyrill Gorcunov	2124b7b7dc	Use is_register helper Save us some line of code Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2010-07-25 01:16:33 +04:00
Victor van den Elzen	ac732cb6a5	Improve process_ea and introduce -OL Two fixes: 1. Optimization of [bx+0xFFFF] etc 0xFFFF is an sbyte under 16-bit semantics, so make sure to check it right. 2. Don't optimize displacements in -O0 Displacements that fit into an sbyte or can be removed should not be optimized in -O0. Implicit zero displacements are still optimized, e.g.: [eax] -> 0 bit displacement, [ebp] -> 8 bit displacement. However explicit displacements are not optimized: [eax+0] -> 32 bit displacement, [ebp+0] -> 32 bit displacement. Because #2 breaks compatibility with 0.98, I introduced a new optimization level: -OL, legacy.	2010-07-24 22:00:12 +02:00
H. Peter Anvin	fea84d7fec	Permit short intersegment jumps Allow an intersegment jump to be short (OUT_REL1ADR) if explicitly specified so by the user. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-05-06 15:33:24 -07:00
H. Peter Anvin	55ae12052c	Add support for one-byte relocations Add OUT_REL1ADR (one-byte relative address) and support for OUT_ADDRESs with size == 1. Add support for it in outbin and outdbg. It still needs to be added to other backends, both the OUT_REL*ADR and OUT_ADDRESS codepaths need to be handled. Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>	2010-05-06 15:33:24 -07:00
Victor van den Elzen	0d268fb78c	BR 2496848: Tighten ea checks Check if the offset and the representation are equivalent. Disallow REL on absolute addresses. I'm not sure what that would mean and the output formats don't support it. Warn about ignored displacement size modifiers.	2010-03-12 23:52:04 +01:00
Cyrill Gorcunov	6531d6d159	BR2907058: insn_size - close file handle before returning As example of such behaviour is when fseek fails for some reason. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2009-12-05 14:10:41 +03:00
Cyrill Gorcunov	1de9500c89	Comment out matches() operand flags logic Also space fix Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2009-11-06 00:08:38 +03:00

1 2 3 4 5

217 Commits