I can't seem to figure out why this test doesn't pass
The test is:
given the following XML:
<?xml version="1.0" encoding="utf-8"?>
<foo>
<account>
1234567890
</account>
<deptCode>
ABCXYZ
</deptCode>
</foo>
and the following class:
class Foo {
[XmlElement(ElementName = "account", DataType = "normalizedString")]
string account;
[XmlElement(ElementName = "deptCode", DataType = "normalizedString"]
string deptCode;
}
when that XML is deserialized with:
XmlSerializer serializer = new XmlSerializer(typeof(Foo));
Foo myFoo = (Foo) serializer.Deserialize(xmlReader);
I get the following values:
Foo.account = "\r\n 1234567890 \r\n"
Foo.deptCode = "\r\n ABCXYZ \r\n"
instead of the expected
Foo.account = "1234567890"
Foo.deptCode = "ABCXYZ"
How can I make it so that the deserialization process gives me the expected results? I thought the DataType="normalizedString"
might do it, but it seems to have no effect, and when I use XmlReaderSettings.IgnoreWhitespace
, it just takes away the "\r" character, leaving me with "\n 1234567890"
It seems it is working as intended. From IgnoreWhitespace
documentation:
White space that is not considered to be significant includes spaces, tabs, and blank lines used to set apart the markup for greater readability.
Basically, what it does is preserves (when set to false
) whitespaces in between elements such as:
<Foo>
<bar>Text</bar>
</Foo>
The newline between <Foo>
and <bar>
will be returned by reader. Set IgnoreWhitespace
to true
, and it won't.
To achieve your goal you'll have to do programmatic trimming, as mentioned by Kirill. When you think about it, how is reader supposed to know whether whitespace of pure string content of element (as in your examples) is just for indenting purposes or actual content?
For more reading on ignoring whitespaces you may want to take a look here and here.
You can create custom XmlTextReader
class:
public class CustomXmlTextReader : XmlTextReader
{
public CustomXmlTextReader(Stream stream) : base(stream) { }
public override string ReadString()
{
return base.ReadString().Trim();
}
}
Try using XmlTextReader
for deserialization with the WhiteSpaceHandling
property set to WhiteSpaceHandling.None
and Normalization = true