How to parse XML in Bash?-第3页回答

How to parse XML in Bash?

2018-12-31 07:18发布

Ideally, what I would like to be able to do is:

cat xhtmlfile.xhtml |
getElementViaXPath --path='/html/head/title' |
sed -e 's%(^<title>|</title>$)%%g' > titleOfXHTMLPage.txt

标签： xml bash xhtml shell xpath

15条回答

笑指拈花

2楼-- · 2018-12-31 07:43

You can do that very easily using only bash. You only have to add this function:

rdom () { local IFS=\> ; read -d \< E C ;}

Now you can use rdom like read but for html documents. When called rdom will assign the element to variable E and the content to var C.

For example, to do what you wanted to do:

while rdom; do
    if [[ $E = title ]]; then
        echo $C
        exit
    fi
done < xhtmlfile.xhtml > titleOfXHTMLPage.txt

0人赞添加讨论(0) 举报

爱死公子算了

3楼-- · 2018-12-31 07:43

Well, you can use xpath utility. I guess perl's XML::Xpath contains it.

0人赞添加讨论(0) 举报

初与友歌

4楼-- · 2018-12-31 07:45

This works if you are wanting XML attributes:

$ cat alfa.xml
<video server="asdf.com" stream="H264_400.mp4" cdn="limelight"/>

$ sed 's.[^ ]*..;s./>..' alfa.xml > alfa.sh

$ . ./alfa.sh

$ echo "$stream"
H264_400.mp4

0人赞添加讨论(0) 举报

上一页 1 2 3

How to parse XML in Bash?

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间