How to get the hidden input's value by using p

2020-07-11 07:44发布

How can i get input value from html page

<input type="hidden" name="captId" value="AqXpRsh3s9QHfxUb6r4b7uOWqMT" ng-model="captId">

I have input name [ name="captId" ] and need his value

import re , urllib ,  urllib2
a = urllib2.urlopen('http://www.example.com/','').read()

thanx

update 1

I installed BeautifulSoup and used it but there some errors

code

 import re , urllib ,  urllib2
 a = urllib2.urlopen('http://www.example.com/','').read()
 soup = BeautifulSoup(a)
 value = soup.find('input', {'name': 'scnt'}).get('value')

error

"soup = BeautifulSoup(a) NameError: name 'BeautifulSoup' is not defined"

标签： python python-2.7 urllib2 findall

1条回答

叛逆

2楼-- · 2020-07-11 08:32

Using re module to parse xml or html is generally considered as bad practice. Use it only if you are responsable for the page you try to parse. If not, either your regexes are awfully complex, or your script could break if someone replaces <input type="hidden" name=.../> with <input name="..." type="hidden" .../> or almost anything else.

BeautifulSoup is a html parser that :

automatically fixes minor errors (unclosed tags ...)
build a DOM tree
allows you to browse the tree, search for specific tags, with specific attributes
is useable with Python 2 and 3

Unless you have good reasons not to do it, you should use it rather than re for HTML parsing.

For example assuming that txt contains the whole page, find all hidden fields would be as simple as :

from bs4 import BeautifulSoup
soup = BeautifulSoup(txt)
hidden_tags = soup.find_all("input", type="hidden")
for tag in hidden_tags:
    # tag.name is the name and tag.value the value, simple isn't it ?

0人赞添加讨论(0) 举报

How to get the hidden input's value by using p

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间