Parsing multiple XML fragments with STaX

2019-05-15 12:45发布

I was hoping the following would be parseable in StAX,

<something a="b"/>
<something a="b"/>

But it chokes when you reach the second element. As there is no common root element. (I'm not too sure why a pull parser cares about this particular issue... anyway...)

I can fake a root element, e.g. Guava:

    InputSupplier<Reader> join = CharStreams.join(
            newReaderSupplier("<root>"),
            newReaderSupplier(new File("...")),
            newReaderSupplier("</root>"));

    XMLInputFactory xif = XMLInputFactory.newInstance();
    XMLStreamReader xsr = xif.createXMLStreamReader(join.getInput());
    xsr.nextTag();  // Skip the fake root

So my question is just: Is there any way to avoid this hack? Some 'fragment' mode that I can put the parser into?

标签： java xml xml-parsing stax

3条回答

你好瞎i

2楼-- · 2019-05-15 13:02

Nope. The StAX API does not support fragments. A XMLStreamReader is suitable for exactly one XML document. However, your "hack" isn't that bad at all...

0人赞添加讨论(0) 举报

Fickle 薄情

3楼-- · 2019-05-15 13:03

The Woodstox StAX implementation does apparently support this: http://woodstox.codehaus.org/3.2.9/javadoc/com/ctc/wstx/api/WstxInputProperties.html#P_INPUT_PARSING_MODE

As it happens we are already using Woodstox in some places, but I didn't think to Google for Woodstox-specific options!

0人赞添加讨论(0) 举报

三岁会撩人

4楼-- · 2019-05-15 13:19

According to XML spec, an XML document must have a single root element, or else it isn't wellformed. So your so called hack isn't a hack at all, it is the best way to fix up the document....

0人赞添加讨论(0) 举报

Parsing multiple XML fragments with STaX

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间