How can I find the link URL by link text with XPat

2019-01-12 19:24发布

I have a well formed XHTML page. I want to find the destination URL of a link when I have the text that is linked.

Example

<a href="http://stackoverflow.com">programming questions site</a>
<a href="http://cnn.com">news</a>

I want an XPath expression such that if given programming questions site it will give http://stackoverflow.com and if I give it news it will give http://cnn.com.

标签: xml xhtml xpath
6条回答
ら.Afraid
2楼-- · 2019-01-12 19:25

Think of the phrase in the square brackets as a WHERE clause in SQL.

So this query says, "select the "href" attribute (@) of an "a" tag that appears anywhere (//), but only where (the bracketed phrase) the textual contents of the "a" tag is equal to 'programming questions site'".

查看更多
虎瘦雄心在
3楼-- · 2019-01-12 19:31

Should be something similar to:

//a[text()='text_i_want_to_find']/@href
查看更多
倾城 Initia
4楼-- · 2019-01-12 19:37

Too late for you, but for anyone else with the same question...

//a[contains(text(), 'programming')]/@href

Of course, 'programming' can be any text fragment.

查看更多
地球回转人心会变
5楼-- · 2019-01-12 19:37

if you are using html agility pack use getattributeValue:

$doc2.DocumentNode.SelectNodes("//div[@class='className']/div[@class='InternalClass']/a[@class='InternalClass']").GetAttributeValue("href","")
查看更多
相关推荐>>
6楼-- · 2019-01-12 19:42

For case insensitive contains, use the following:

//a[contains(translate(text(),'PROGRAMMING','programming'), 'programming')]/@href

translate converts capital letters in PROGRAMMING to lower case programming.

查看更多
够拽才男人
7楼-- · 2019-01-12 19:50
//a[text()='programming quesions site']/@href 

which basically identifies an anchor node <a> that has the text you want, and extracts the href attribute.

查看更多
登录 后发表回答