Screen Scraping a Javascript based webpage in Pyth

2019-01-19 23:55发布

问题:

I am working on a screen scraping tool in Python. But, as I look through the source of the webpage, I noticed that most of the data is coming through Javascript.

Any idea, how to scrape javascript based webpage ? Any tool in Python ?

Thanks

回答1:

Scraping javascript-based webpages is possible with selenium. In particular, try the Selenium WebDriver.



回答2:

I use webkit, which is the browser renderer behind Chrome and Safari. There are Python bindings to webkit through Qt.

And here is a full Python example to execute JavaScript and extract the final HTML.



回答3:

You can use the QtWebKit module of the PyQt4 library