Is protocol buffer serialization output fully dete

2019-06-24 04:07发布

问题:

Given a protocol buffers schema and some data, is the protocol buffers serialization deterministic across libraries and languages?

Basically, am I guaranteed that the same data will always serialize in the same way (down to the byte) regardless of the library used?

回答1:

In general, the same data will serialize in exactly the same way.

However, this is not guaranteed by the protobuf specifications. For example, the following differences in encoding are allowable and must decode to the same result in all conforming libraries:

  • Encoding fields in different order than the tag number order.
  • Encoding packed fields as unpacked.
  • Encoding integers as longer varint byte sequences than needed.
  • Encoding same (non-repeated) field multiple times.
  • Probably others.