I am trying to read a binary file with the BinaryReader class, and I need to read it in as blocks of UInt32, and then do some bit shifting etc. afterwords.
But, for some reason bit order is reversed when I use the ReadUInt32 method.
If I for example have a file where the first four bytes looks like this in hex, 0x12345678
, they end up like this after being read by ReadUInt32: 0x78563412
.
If I use the ReadBytes(4) method, I get the expected array:
[0x00000000] 0x12 byte
[0x00000001] 0x34 byte
[0x00000002] 0x56 byte
[0x00000003] 0x78 byte
Why is this? Is it just the way .net represents uints in memory? Is it the same across the different platforms (I am running 64bit Windows 7, .net 3.5 sp1)?
Look into Jon Skeet's MiscUtil library for the Endian* classes, like EndianBinaryReader and EndianBitConverter.
http://www.yoda.arachsys.com/csharp/miscutil/
This is an issue of platform Endianess. When you read data from a stream you must read it accordingly to the endianess it was written as. If you created the data in .Net, then .Net will read it correctly.
Read Generic BinaryReader and BinaryWriter Extensions, a great way to handle generic casting the unmanaged way.
For VB.NET (safe code only, can also be achieved in C#) use the following:
Imports System.IO Imports System.Runtime.CompilerServices Imports System.Runtime.InteropServices
You can now implement the same functionality for
BitConverter
, forBinaryWriter
etc.Jon Skeet's written a BitConverter with configurable endian-ness. You might find it useful.
http://www.yoda.arachsys.com/csharp/miscutil/
This seems to be an endianness issue. The docs say ReadUint32 reads in little-endian so the first byte is the least-significant so it goes to the lowest memory location. Your writer must be big-endian?
BinaryWriter.Write(UInt32)
says it writes little-endian too. Is your binary data source not BinaryWriter?Essentially what you need to do to fix it is this:
This shifts the least-significant byte up 24 bits, the 2nd LSB up 8 bits, the 3rd LSB down 8 bits, and the 4th LSB (the MSB) down 24 bits. Doing this is covered in several libraries.
Perhaps using
BitConverter
would be a bit more clear:Yes, this has to do with how your computer hardware stores uints in memory. It can be different across different platforms, although most desktop computers should be the same.
This is called endianness -- see wikipedia here:
http://en.wikipedia.org/wiki/Endian