Right now I successfully grabbed the full element from an HTML page with this:
//img[@class='photo-large']
for example it would return this:
<img src="http://example.com/img.jpg" class='photo-large' />
But I only need the SRC url (http://example.com/img.jpg). Any help?
Using Hpricot this works:
In case you have more than one image, the following gives an array:
However, Nokogiri is many times faster and it “can be used as a drop in replacement” for Hpricot.
Here the version for Nokogiri, in which this XPath for selecting attributes works:
or for many images:
//img/@src
you can just go with this if you want a link of the image.
example:
You are so close to answering this yourself that I am somewhat reluctant to answer it for you. However, the following XPath should provide what you want (provided the source is XHTML, of course).
For further tips, check out W3 Schools. They have excellent tutorials on such things and a great reference too.