I know that COM provides reusability at the binary level across languages and applications. I read that all components built for COM must adhere to a standard memory layout in order to be language-independent.
I do not understand what "standard memory layout" means.
What makes COM language-independent?
As I remember COM it is language independent ind the way that it's structure is documented and open (?). The downside is that the structure is "binary" and is closely tied to C/C++, which makes it difficult to used from other languages. The good news is that a lot of languages (like Python) have a C/C++ interface, which makes it possible (Win32com Python module) to use from other languages.
JSON is language independent in a way that it does not map directly to any language, although Python, Perl and Ruby (?) are close, but it is almost impossible to use from C/C++
Hope this makes some kind of sense
Don Box wrote an excellent explanation in the first chapter of his book Essential COM. You can read it here for free. I recommend it.
First, some technical background: C++ compilers usually generate something called a "vtable" for any class with virtual functions. This is basically a table of function pointers. The vtable contains a function pointer to every virtual method implemented by a class.
In COM, interfaces are basically abstract base classes which a component implements, e.g.:
The vtable for
CSomeComponent
will include function pointers for all methods defined in these two interfaces.Any instantiated object has a reference to the vtable of its dynamic type. This is how the program knows how to call the proper method in cases where a base method is overridden in a derived class:
The last line should call
Derived::foo
. This works because objectX
has a reference to the vtable for classDerived
. As said, the vtable is like a list of function pointers. Now, vtables have a fixed layout: If classDerived
inherits from classBase
, the function pointer for methodfoo
will be at the same relative location inDerived
's vtable than inBase
's vtable:Now, if the compiler sees something like
X->foo()
, it knows that all for all classes derived fromBase
, methodfoo
corresponds to the first entry in the vtable. So it issues a call to the first function pointer, which inX
's case is a call toDerived::foo
.Answer to your question: Compilers can only generate COM components if they generate the same layout for vtables that the COM specification demands. vtables can be implemented in various different ways, especially when it comes to multiple inheritance (which is required with COM components). Adhering to a certain vtable format is necessary so that when you call a component's method
f
, you will actually call methodf
and not some other methodg
which happens to sit atf
's position in the component class's vtable. I suppose COM-compliant compilers essentially have to produce the same vtable layouts as Microsoft Visual C++, since the COM technology was defined by Microsoft.P.S.: Sorry for being so technical, I hope the above information is of some use to you.
Standard memory layout means a memory layout that is defined (standardised) in the COM specification, rather than whatever default one the calling or defining language uses. And it is exactly this sort of thing that makes it language independent! Code in a particular language calling a COM object doesn't have to care what language that COM object was written in, in terms of knowing its structure in memory.