可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I am practicing selenium in python and I wanted to fetch all the links on a web page using selenium.

For example, I want all the links in 'a href" tag from this website : http://psychoticelites.com/

I've written a script and it is working. But, it's rather giving me the object address. I've tried using the 'id' tag to get the value, but, it doesn't work.

My current script :-

from selenium import webdriver
from selenium.webdriver.common.keys import Keys



driver = webdriver.Firefox()
driver.get("http://psychoticelites.com/")
assert "Psychotic" in driver.title
continue_link = driver.find_element_by_tag_name('a')
elem = driver.find_elements_by_xpath("//*[@href]")
#x = str(continue_link)
#print continue_link
#print elem
z = elem
print z

Any kind of clue/hint would be appreciated.

回答1:

Well, you have to simply loop through the list.

elems = driver.find_elements_by_xpath("//a[@href]")
for elem in elems:
    print elem.get_attribute("href")

find_elements_by_* returns a list of elements (Note the spelling of 'elements'). Loop through the list, take each element and fetch the required attribute value you want from it. (in this case href)

回答2:

You can import the HTML dom using html dom library in python. You can find it over here and install it using PIP:

https://pypi.python.org/pypi/htmldom/2.0

from htmldom import htmldom
dom = htmldom.HtmlDom("https://www.github.com/")  
dom = dom.createDom()

The above code creates a HtmlDom object.The HtmlDom takes a default parameter, the url of the page. Once the dom object is created, you need to call "createDom" method of HtmlDom. This will parse the html data and constructs the parse tree which then can be used for searching and manipulating the html data. The only restriction the library imposes is that the data whether it is html or xml must have a root element.

You can query the elements using the "find" method of HtmlDom object:

p_links = dom.find("a")  
for link in p_links:
  print ("URL: " +link.attr("href"))

The above code will print all the links/urls present on the web page

回答3:

You can try something like:

    links = driver.find_elements_by_partial_link_text('')

Fetch all href link using selenium in python

问题:

回答1:

回答2:

回答3:

收藏的人(0)

Fetch all href link using selenium in python

问题:

回答1:

回答2:

回答3:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮