I am trying to learn C and am very confused already.
In the OOP languages i have used there exists the ability to perform method overloading, where the same function could have different parameter types and call whichever was the most appropriate.
Now in C i know that this is not the case so i cant figure out the following problem, How printf() works.
For example:
char chVar = 'A';
int intVar = 123;
float flVar = 99.999;
printf("%c - %i - %f \n",chVar, intVar, flVar);
printf("%i - %f - %c \n",intVar, flVar, chVar);
printf("%f - %c - %i \n",flVar, chVar, intVar);
Now as C does'nt support function overloading, How does printf manage to take any number of arguments, of any type, and then work correctly with them?
I have tried to find the printf() working by downloading the glibc source package but can quite seem to find it, though i'll keep looking.
Could anyone here explain how C performs the above task?
C supports a type of function signature called "varargs" meaning "variable (number of) arguments". Such a function must have at least one required argument. In the case of
printf
, the format string is a required argument.Generally, on a stack-based machine, when you call any C function, the arguments are pushed onto the stack from right-to-left. In this way, the first argument to the function is that found on the "top" of the stack, just after the return address.
There are C macros defined which allow you to retrieve the variable arguments.
The key points are:
printf()
, if the format string is wrong, the code will read invalid results from memory, possibly crashing.va_start
, incremented withva_arg
, and released withva_end
.I have posted a ton of code you may find interesting on the related question:
Best Way to Store a va_list for Later Use in C/C++
Here's a skeleton of a
printf()
which only formats integers ("%d"):Internally,
printf
will (at least usually) use some macros from stdarg.h. The general idea is (a greatly expanded version of) something like this:Fleshing it out does involve quite a bit of work -- dealing with field width, precision, more conversions, etc. This is enough, however, to at least give a flavor of how you retrieve varying arguments of varying types inside your function.
(Don't forget that, if you're using gcc (and g++?), you can pass
-Wformat
in the compiler options to get the compiler to check that the types of the arguments match the formatting. I hope other compilers have similar options.)Blind faith. It assumes that you have ensured that the types of the arguments match perfectly with the corresponding letters in your format string. When
printf
is called, all the arguments are represented in binary, unceremoniously concatenated together, and passed effectively as a single big argument toprintf
. If they don't match, you'll have problems. Asprintf
iterates through the format string, every time it see%d
it will take 4 bytes from the arguments (assuming 32-bit, it would be 8 bytes for 64-bit ints of course) and it will interpret them as an integer.Now maybe you actually passed a
double
(typically taking up twice as much memory as anint
), in which caseprintf
will just take 32 of those bits and represented them as an integer. Then the next format field (maybe a%d
) will take the rest of the double.So basically, if the types don't match perfectly you'll get badly garbled data. And if you're unlucky you will have undefined behaviour.