I'm pretty new to x64-assembly on the Mac, so I'm getting confused porting some 32-bit code in 64-bit.
The program should simply print out a message via the printf
function from the C standart library.
I've started with this code:
section .data
msg db 'This is a test', 10, 0 ; something stupid here
section .text
global _main
extern _printf
_main:
push rbp
mov rbp, rsp
push msg
call _printf
mov rsp, rbp
pop rbp
ret
Compiling it with nasm this way:
$ nasm -f macho64 main.s
Returned following error:
main.s:12: error: Mach-O 64-bit format does not support 32-bit absolute addresses
I've tried to fix that problem byte changing the code to this:
section .data
msg db 'This is a test', 10, 0 ; something stupid here
section .text
global _main
extern _printf
_main:
push rbp
mov rbp, rsp
mov rax, msg ; shouldn't rax now contain the address of msg?
push rax ; push the address
call _printf
mov rsp, rbp
pop rbp
ret
It compiled fine with the nasm
command above but now there is a warning while compiling the object file with gcc
to actual program:
$ gcc main.o
ld: warning: PIE disabled. Absolute addressing (perhaps -mdynamic-no-pic) not
allowed in code signed PIE, but used in _main from main.o. To fix this warning,
don't compile with -mdynamic-no-pic or link with -Wl,-no_pie
Since it's a warning not an error I've executed the a.out
file:
$ ./a.out
Segmentation fault: 11
Hope anyone knows what I'm doing wrong.
According to the documentation for the x86 64bit instruction set http://download.intel.com/products/processor/manual/325383.pdf
PUSH only accepts 8, 16 and 32bit immediate values (64bit registers and register addressed memory blocks are allowed though).
Where msg is a 64bit immediate address will not compile as you found out.
What calling convention is _printf defined as in your 64bit library?
Is it expecting the parameter on the stack or using a fast-call convention where the parameters on in registers? Because x86-64 makes more general purpose registers available the fast-call convention is used more often.
No answer yet has explained why NASM reports
The reason NASM won't do this is explained in Agner Fog's Optimizing Assembly manual in section 3.3 Addressing modes under the subsection titled 32-bit absolute addressing in 64 bit mode he writes
This is not a problem on Linux or Windows. In fact I already showed this works at static-linkage-with-glibc-without-calling-main. That hello world code uses 32-bit absolute addressing with elf64 and runs fine.
@HristoIliev suggested using rip relative addressing but did not explain that 32-bit absolute addressing in Linux would work as well. In fact if you change
lea rdi, [rel msg]
tolea rdi, [msg]
it assembles and runs fine withnasm -efl64
but fails withnasm -macho64
Like this:
You can check that this is an absolute 32-bit address and not rip relative with
objdump
. However, it's important to point out that the preferred method is still rip relative addressing. Agner in the same manual writes:So when would use use 32-bit absolute addresses in 64-bit mode? Static arrays is a good candidate. See the following subsection Addressing static arrays in 64 bit mode. The simple case would be e.g:
where A is the absolute 32-bit address of the static array. This works fine with Linux but once again you can't do this with Mac OS X because the image base is larger than 2^32 by default. To to this on Mac OS X see example 3.11c and 3.11d in Agner's manual. In example 3.11c you could do
Where you use the extern reference from Mach O
__mh_execute_header
to get the image base. In example 3.11c you use rip relative addressing and load the address like thisThe 64-bit OS X ABI complies at large to the System V ABI - AMD64 Architecture Processor Supplement. Its code model is very similar to the Small position independent code model (PIC) with the differences explained here. In that code model all local and small data is accessed directly using RIP-relative addressing. As noted in the comments by Z boson, the image base for 64-bit Mach-O executables is beyond the first 4 GiB of the virtual address space, therefore
push msg
is not only an invalid way to put the address ofmsg
on the stack, but it is also an impossible one sincePUSH
does not support 64-bit immediate values. The code should rather look similar to:But in that particular case one needs not push the value on the stack at all. The 64-bit calling convention mandates that the fist 6 integer/pointer arguments are passed in registers
RDI
,RSI
,RDX
,RCX
,R8
, andR9
, exactly in that order. The first 8 floating-point or vector arguments go intoXMM0
,XMM1
, ...,XMM7
. Only after all the available registers are used or there are arguments that cannot fit in any of those registers (e.g. a 80-bitlong double
value) the stack is used. 64-bit immediate pushes are performed usingMOV
(theQWORD
variant) and notPUSH
. Simple return values are passed back in theRAX
register. The caller must also provide stack space for the callee to save some of the registers.printf
is a special function because it takes variable number of arguments. When calling such functionsRAX
should be set to the number of floating-point arguments, passed in the vector registers. Also note thatRIP
-relative addressing is preferred for data that lies within 2 GiB of the code.Here is how
gcc
translatesprintf("This is a test\n");
into assembly on OS X:(this is AT&T style assembly, source is left, destination is right, register names are prefixed with
%
, data width is encoded as a suffix to the instruction name)At
(1)
zero is put intoRAX
since no floating-point arguments are being passed. At(2)
the address of the string is loaded inRDI
. Note how the value is actually an offset from the current value ofRIP
. Since the assembler doesn't know what this value would be, it puts a relocation request in the object file. The linker then sees the relocation and puts the correct value at link time.I am not a NASM guru, but I think the following code should do it: