I have looked around and only found solutions that render a URL to HTML. However I need a way to be able to render a webpage (That I already have, and that has JavaScript) to proper HTML.
Want: Webpage (with JavaScript) ---> HTML
Not: URL --> Webpage (with JavaScript) ---> HTML
I couldn't figure out how to make the other code work the way I wanted.
This is the code I was using that renders URLs: http://webscraping.com/blog/Scraping-JavaScript-webpages-with-webkit/
For clarity, the code above takes a URL of a webpage that has some parts of the page rendered by JavaScript, so if I scrape the page normally using say urllib2 then I won't get all the links etc that are rendered as after the JavaScript.
However I want to be able to scrape a page, say again with urllib2, and then render that page and get the outcome HTML. (Different to the above code since it takes a URL as it's argument.
Any help is appreciated, thanks guys :)
try webdriver.Firefox().get('url')
You can
pip install selenium
from a command line, and then run something like: