nasm/output/legacy.c
H. Peter Anvin bff94fbd39 Major changes to a number of subsystems to improve matching
Work through a number of changes toward making matching a lot saner,
both to reduce the number of patterns to generate for APX but also to
make a number of code patterns simpler.

This replaces a fair number of byte codes.

Improve a number of error messages, especially related to overflows.

Move process_insn() from nasm.c to assemble.c, as it really is the
primary entry point to the assembler module.

Reorder some prefixes. In particular, F2/F3 override 66 when used as a
mandatory prefix, so it makes more sense for them to be closer to the
opcode.

Move a lot more information into struct insn. It is better to have it
in one place; memory consumption is not an issue because struct insn
is transient information.

Get rid of "optimization levels" and replace it with a mask of
flags. That was already halfway done; complete the job.

Replace seg:offset in struct out_data with a struct location. It would
be better to extend this to more places, too.

The ARx and SMx flags are now explicit bitmasks, instead of having a
couple of hard-coded ranges.

Add __func__ to assert or panic messages.

Because of prefix and message changes, a number of travis tests had to
be audited and updated.

Fix a number of instruction patterns which had .128 when they ought to
be .lig. This is no longer a minor issue with the disassembler: for
AVX10, the pattern vector length determines how SAE/RC are encoded,
and there is no valid 128-bit encoding. However, with .lig the 512-bit
encoding can be used.

Separate "o64nw" into two pieces: opsize 64 and "nw" = "REX.w not necessary". The
latter can be included in non-64-bit patterns. "o64" still set REX.W
since that is still the common thing.

New "osz" bytecode: emit an OSP *or* REX.W depending on the current
mode and operand size. Useful for special cases like "nop" where "o64
nop" probably wants to be encoded as "48 90".

Signed-off-by: H. Peter Anvin <hpa@zytor.com>
2024-08-07 17:13:44 -07:00

126 lines
4.1 KiB
C

/* ----------------------------------------------------------------------- *
*
* Copyright 2016-2024 The NASM Authors - All Rights Reserved
* See the file AUTHORS included with the NASM distribution for
* the specific copyright holders.
*
* Redistribution and use in source and binary forms, with or without
* modification, are permitted provided that the following
* conditions are met:
*
* * Redistributions of source code must retain the above copyright
* notice, this list of conditions and the following disclaimer.
* * Redistributions in binary form must reproduce the above
* copyright notice, this list of conditions and the following
* disclaimer in the documentation and/or other materials provided
* with the distribution.
*
* THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND
* CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES,
* INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
* MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
* DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
* CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
* SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT
* NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
* LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
* HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
* CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR
* OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE,
* EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
*
* ----------------------------------------------------------------------- */
/*
* output/legacy.c
*
* Mangle a struct out_data to match the rather bizarre legacy
* backend interface.
*
* The "data" parameter for the output function points to a "int64_t",
* containing the address of the target in question, unless the type is
* OUT_RAWDATA, in which case it points to an "uint8_t"
* array.
*
* Exceptions are OUT_RELxADR, which denote an x-byte relocation
* which will be a relative jump. For this we need to know the
* distance in bytes from the start of the relocated record until
* the end of the containing instruction. _This_ is what is stored
* in the size part of the parameter, in this case.
*
* Also OUT_RESERVE denotes reservation of N bytes of BSS space,
* and the contents of the "data" parameter is irrelevant.
*/
#include "nasm.h"
#include "outlib.h"
void nasm_do_legacy_output(const struct out_data *data)
{
const void *dptr = data->data;
enum out_type type = data->type;
int32_t tsegment = data->tsegment;
int32_t twrt = data->twrt;
uint64_t size = data->size;
switch (data->type) {
case OUT_RELADDR:
switch (data->size) {
case 1:
type = OUT_REL1ADR;
break;
case 2:
type = OUT_REL2ADR;
break;
case 4:
type = OUT_REL4ADR;
break;
case 8:
type = OUT_REL8ADR;
break;
default:
panic();
break;
}
dptr = &data->toffset;
size = data->relbase - data->loc.offset;
break;
case OUT_SEGMENT:
type = OUT_ADDRESS;
if (tsegment != NO_SEG && tsegment < SEG_ABS)
tsegment |= 1;
dptr = zero_buffer;
size = data->size;
break;
case OUT_ADDRESS:
dptr = &data->toffset;
size = (data->flags & OUT_SIGNED) ? -data->size : data->size;
break;
case OUT_RAWDATA:
case OUT_RESERVE:
tsegment = twrt = NO_SEG;
break;
case OUT_ZERODATA:
tsegment = twrt = NO_SEG;
type = OUT_RAWDATA;
dptr = zero_buffer;
while (size > ZERO_BUF_SIZE) {
ofmt->legacy_output(data->loc.segment, dptr, type,
ZERO_BUF_SIZE, tsegment, twrt);
size -= ZERO_BUF_SIZE;
}
break;
default:
panic();
break;
}
ofmt->legacy_output(data->loc.segment, dptr, type, size, tsegment, twrt);
}