We have a code base broken up into static libraries. Unfortunately, the libraries have circular dependencies; e.g., libfoo.a
depends on libbar.a
and vice-versa.
I know the "correct" way to handle this is to use the linker's --start-group
and --end-group
options, like so:
g++ -o myApp -Wl,--start-group -lfoo -lbar -Wl,--end-group
But in our existing Makefiles, the problem is typically handled like this:
g++ -o myApp -lfoo -lbar -lfoo
(Imagine this extended to ~20 libraries with complex interdependencies.)
I have been going through our Makefiles changing the second form to the first, but now my co-workers are asking me why... And other than "because it's cleaner" and a vague sense that the other form is risky, I do not have a good answer.
So, can linking the same library multiple times ever create a problem? For example, could the link fail with multiply-defined symbols if the same .o gets pulled in twice? Or is there any risk we could wind up with two copies of the same static object, creating subtle bugs?
Basically, I want to know if there is any possibility of link-time or run-time failures from linking the same library multiple times; and if so, how to trigger them. Thanks.
All I can offer is a lack of counter-example. I've actually never seen the first form before (even though it's clearly better) and always seen this solved with the second form, and haven't observed problems as a result.
Even so I would still suggest changing to the first form because it clearly shows the relationship between the libraries rather than relying on the linker behaving in a particular way.
That said, I would suggest at least considering if there's a possibility of refactoring the code to pull out the common pieces into additional libraries.
Since it is a legacy application, I bet the structure of the libraries is inherited from some arrangement which probably does not matter any more, such as being used to build another product which you no longer do.
Even if still structural reasons remain for the inherited library structure, almost certainly, it would still be acceptable to build one more library from the legacy arrangement. Just put all the modules from the 20 libraries into a new library,
liballofthem.a
. Then every single application is simplyg++ -o myApp -lallofthem ...
The problem with
is that there is no guarantee, that two passes over
libfoo
and one pass overlibbar
are enough.The approach with
Wl,--start-group ... -Wl,--end-group
is better, because more robust.Consider the following scenario (all symbols are in different object-files):
myApp
needs symbolfooA
defined inlibfoo
.fooA
needs symbolbarB
defined inlibbar
.barB
needs symbolfooC
defined inlibfoo
. This is the circular dependency, which can be handled by-lfoo -lbar -lfoo
.fooC
needs symbolbarD
defined inlibbar
.To be able to build in the case above, we would need to pass
-lfoo -lbar -lfoo -lbar
to the linker. Why?libfoo
for the first time and uses definitions of symbolfooA
but notfooC
, because so far it doesn't see a necessity to include alsofooC
into the binary. The linker however starts to look for definition ofbarB
, because its definition is needed forfooA
to function.-libbar
, includes the definition ofbarB
(but notbarD
) and starts to look for definition offooC
.fooC
is found inlibfoo
, when it processed for the second time. Now it becomes evident, that also the definition ofbarD
is needed - but too late there is nolibbar
on the command line anymore!The example above can be extended to an arbitrary dependency depth (but this happens seldom in real life).
Thus using
is a more robust approach, because linker passes as often over the library group as needed - only when a pass didn't change the symbol table will the linker move on to the the next library on the command line.
There is however a small performance penalty to pay: in the first example
-lbar
were scanned once more compared with the manual command line-lfoo -lbar -lfoo
. Not sure whether it is worth mentioning/thinking about through.