Ruby on Rails, How to determine if a request was m

2019-01-22 08:33发布

I've Rails apps, that record an IP-address from every request to specific URL, but in my IP database i've found facebook blok IP like 66.220.15.* and Google IP (i suggest it come from bot). Is there any formula to determine an IP from request was made by a robot or search engine spider ? Thanks

标签： ruby-on-rails ruby-on-rails-3 search-engine web-crawler

4条回答

Luminary・发光体

2楼-- · 2019-01-22 08:57

Robots are required (by common sense / courtesy more than any kind of law) to send along a User-Agent with their request. You can check for this using request.env["HTTP_USER_AGENT"] and filter as you please.

0人赞添加讨论(0) 举报

干净又极端

3楼-- · 2019-01-22 08:58

I think you can use browser gem for check bots.

if browser.bot?
  # code here
end

https://github.com/fnando/browser

0人赞添加讨论(0) 举报

在下西门庆

4楼-- · 2019-01-22 09:05

Another way is to use crawler_detect gem:

CrawlerDetect.is_crawler?("Bot user agent")
=> true

#or after adding Rack::Request extension
request.is_crawler?
=> true

It can be useful if you want to detect a large various of different bots (more than 1000).

0人赞添加讨论(0) 举报

闹够了就滚

5楼-- · 2019-01-22 09:08

Since the well behaved bots at least typically include a reference URI in the UA string they send, something like:

request.env["HTTP_USER_AGENT"].match(/\(.*https?:\/\/.*\)/)

is an easy way to see if the request is from a bot vs. a human user's agent. This seems to be more robust than trying to match against a comprehensive list.

0人赞添加讨论(0) 举报

Ruby on Rails, How to determine if a request was m

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间