Given a protocol buffers schema and some data, is the protocol buffers serialization deterministic across libraries and languages?
Basically, am I guaranteed that the same data will always serialize in the same way (down to the byte) regardless of the library used?
In general, the same data will serialize in exactly the same way.
However, this is not guaranteed by the protobuf specifications. For example, the following differences in encoding are allowable and must decode to the same result in all conforming libraries:
- Encoding fields in different order than the tag number order.
- Encoding packed fields as unpacked.
- Encoding integers as longer varint byte sequences than needed.
- Encoding same (non-repeated) field multiple times.
- Probably others.