How to apply DBSCAN algorithm on grouping of simil

2019-03-07 07:36发布

how to group similar url using the DBSCAN algorithm. I have seen many datasets but none were on url , I want to take similar type of urls and group it together. Here i am not able to know distance (eps) and minpoints can be the number of urls to be grouped.

标签： data-mining cluster-analysis dbscan

1条回答

迷人小祖宗

2楼-- · 2019-03-07 08:31

DBSCAN needs a distance function and a threshold for detecting similar objects.

So go ahead, first you need to define an appropiate distance function and a threshold, then we can help you with DBSCAN (but you should be able to find DBSCAN implementations that can be extened to arbitrary distance functions).

The key challenge is the distance, and this is up to you, because we do not know what you want to get out. This is very subjective, and we just don't know what you want or need.

0人赞添加讨论(0) 举报

How to apply DBSCAN algorithm on grouping of simil

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间