I just downloaded the HTMLAgilityPack and the documentation doesn't have any examples.
I'm looking for a way to download all the images from a website. The address strings, not the physical image.
<img src="blabalbalbal.jpeg" />
I need to pull the source of each img tag. I just want to get a feel for the library and what it can offer. Everyone said this was the best tool for the job.
Edit
public void GetAllImages()
{
WebClient x = new WebClient();
string source = x.DownloadString(@"http://www.google.com");
HtmlAgilityPack.HtmlDocument document = new HtmlAgilityPack.HtmlDocument();
document.Load(source);
//I can't use the Descendants method. It doesn't appear.
var ImageURLS = document.desc
.Select(e => e.GetAttributeValue("src", null))
.Where(s => !String.IsNullOrEmpty(s));
}
Based on their one example, but with modified XPath:
I don't know this extension, so I'm not sure how to write out the array to somewhere else, but that will at least get you your data. (Also, I don't define the array correctly, I'm sure. Sorry).
Edit
Using your example:
You can do this using LINQ, like this:
EDIT: This code now actually works; I had forgotten to write
document.DocumentNode
.