Selenium: How to use Headless Chrome on AWS?

2020-05-24 04:55发布

问题:

Today I saw the message UserWarning: Selenium support for PhantomJS has been deprecated, please use headless versions of Chrome or Firefox instead.

I am willing to for for Chrome route. How do I install on AWS and run it on my remote AWS instance?

I will be using selenium in Python.

回答1:

Create a new EC2 instance.

SSH log into the machine.

Install python, selenium, chromedriver, chromium, and python packages what you need.

sudo apt install chromium-chromedriver

Copy your python script to the machine.

Edit the script and add an chromeoption.

import selenium as se

options = se.webdriver.ChromeOptions()
options.add_argument('headless')

driver = se.webdriver.Chrome(chrome_options=options)

Done!



回答2:

I prefer to use Firefox so this is my Python3 implementation

def createHeadlessFirefoxBrowser():
     options = webdriver.FirefoxOptions()
     options.add_argument('--headless')
     return webdriver.Firefox(options=options)

browser = createHeadlessFirefoxBrowser()