I've looked up Wikipedia for x86 backward compatibility in x86-x64 and it says:
x86-64 is fully backwards compatible with 16-bit and 32-bit x86 code.Because the full x86 16-bit and 32-bit instruction sets remain implemented in hardware without any intervening emulation, existing x86 executables run with no compatibility or performance penalties,whereas existing applications that are recoded to take advantage of new features of the processor design may achieve performance improvements.
So I've tested some instructions to see that some are actually yield entirely different opcodes (rather than just applying prefix) such as: INC/DEC. Looking at (x86):
\x40 inc eax
\x48 dec eax
And while assembling the same in x86-x64 yields:
\xff \xc0 inc eax
\xff \xc8 dec eax
I'm trying to figure out the reason and more examples of other instructions that has the same symptoms that yield different opcodes. I am familiar with that push, pop, call, ret, enter and leave are not available 32 bit in x86-x64.
The statement is true, but somewhat misleading. The x86-64 architecture is backward compatible with x86, but the 32-bit instruction set is not compatible with the 64-bit instruction set.
You can run x86 code on a x86-64 CPU by using a compatibility mode. Actually, since the CPU should be transparently x86 to x86 code, it's the opposite : you enter 64-bit mode (long mode) when you want to run x86-64 code. This means you can't run both at the same time, though it is possible to switch from one mode to another.
Almost all instructions that are available in both modes have the same opcodes in both modes.
Removed instructions:
CS
/DS
/ES
/SS
were removed. push/pop FS and GS are still valid (those two segments can still have a non-zero base in long mode).mov Sreg, r32
andmov r32, Sreg
are still available for the "neutered" segment registers, so you can emulate push / pop using a scratch integer reg. CS still matters; a far jump to another code segment can switch to 32-bit mode, and the others still need valid segment descriptors.Removed (repurposed) encodings of some still-available instructions: In your case, 32bit can use the
inc r32
single-byte opcodes (0x40 + register-number). 64bit mode only has theinc r/m32
encoding, where the register to be incremented is specified with a 2nd byte. (In this case, the 0x4x bytes were repurposed as the REX prefix byte).Intel's insn reference (follow the link in the x86 tag wiki, or ) shows the following for
inc
:N.E. means not encodable. The Op/En column describes how operands are encoded.
Jan Hubicka's AMD64 ISA overview briefly describes the repurposing of single-byte inc/dec opcodes for REX prefixes, and the default operand sizes and how immediate data is still 32bit.
movabs
is available for loading 64bit immediate constants, or load/store from/to a 64bit absolute address.AMD's AMD64 manual, Section 2.5.11 Reassigned Opcodes has a table which is quite short. It only lists:
4x inc/dec r32
that turned into REX prefixes63 ARPL
that becameMOVSXD
(sign-extend dword to qword, when used with REX.W=1 (which means the W bit in the REX prefix = 1)).Early AMD64 and Intel EMT64 CPUs left out
SAHF/LAHF
in long mode, but later re-added that instruction with the same opcode as in 32bit. That table also doesn't list instructions that were removed entirely (the BCD instructions and maybe others) that were removed to make room for possible future extensions.They could have simplified things a lot, and made x86-64 a much better cleaner instruction set with more room for future extensions, but every difference from 32bit means more decoder transistors. There are no machine instructions that moved to a different opcode in 64bit.
Multiple machine instructions often share the same asm mnemonic,
mov
being the most overloaded one. There are loads, stores, mov with immediate-constants, move to/from segment registers, all in 8bit and 32bit. (16bit is the 32bit with an operand-size prefix, same for 64bit with a REX prefix.) There's a sepecial opcode for loading RAX from a 64bit absolute address. There's also a special opcode for loading a 64bit immediate-constant into a register. (AT&T syntax calls this movabs, but it's still just mov in Intel/NASM)