-->

Groovy XmlSlurper get value of the node without ch

2019-07-20 05:47发布

站内文章 / 前端开发

37 0

傲

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I'm parsing HTML and trying to value of a parent node itself, without values of the children nodes.

HTML example:

<html>
    <body>
        <div>
             <a href="http://intro.com">extra stuff</a>
             Text I would like to get.
             <a href="http://example.com">link to example</a>
        </div>
    </body>
</html>

Code:

def tagsoupParser = new org.ccil.cowan.tagsoup.Parser()
def slurper = new XmlSlurper(tagsoupParser)
def htmlParsed = slurper.parseText(stringToParse)

println htmlParsed.body.div[0]

However above code returns:

extra stuff Text I would like to get. link to example

How can I get only parent node value without children? Example:

Text I would like to get.

P.S: I tried removing extra elements by doing substring but it proves to be unreliable.

回答1:

If you switch to using XmlParser instead of XmlSlurper, you can do:

println htmlParsed.body.div[0].localText()[0]

Assuming you are on Groovy 2.3+

标签： groovy html-parsing nodes xmlslurper

傲

女 | 书童

私信

收藏的人(0)

Ta的文章更多文章

0条评论

还没有人评论过~

Groovy XmlSlurper get value of the node without ch

问题:

回答1:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮