Calculate variation of IP addresses column using M

2019-04-15 06:42发布

I'm trying to detect people using proxies to abuse my website.

Often they will change proxies and so forth. But there is definitely a pattern of them using one proxy address many times. Much more than is normal for legitimate visitors.

Usually most accessing of my website is by unique ip addresses that have only visited once or a few times. Not repeatedly.

Let's say I have these ip addresses in a column:

89.46.74.56
89.46.74.56
89.46.74.56
91.14.37.249
104.233.103.6

That would mean there are 3 uniques out of 5. Giving a "uniqueness score" of 60%.

How would I calculate this efficiently using MySQL?

标签： mysql ip ip-address standard-deviation variance

1条回答

Bombasti

2楼-- · 2019-04-15 07:02

plan

get count grouping by ip

divide by ( cross-joining ) the total rowcount

take maximum repeat ratio from above

setup

create table example
(
  id integer primary key auto_increment not null,
  ip varchar(13) not null
);

insert into example
( ip )
values
( '89.46.74.56'   ),
( '89.46.74.56'   ),
( '89.46.74.56'   ),
( '91.14.37.249'  ),
( '104.233.103.6' )
;

query

select max(repeat_factor)
from
(
select ip, count(*) / rc.row_count as repeat_factor
from example
cross join ( select count(*) as row_count from example ) rc
group by ip
) q
;

output

+--------------------+
| max(repeat_factor) |
+--------------------+
| 0.6                |
+--------------------+

sqlfiddle

0人赞添加讨论(0) 举报

Calculate variation of IP addresses column using M

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间