Googlebots Ignoring robots.txt? [closed]

2019-04-23 08:07发布

I have a site with the following robots.txt in the root:

User-agent: *
Disabled: /

User-agent: Googlebot 
Disabled: /

User-agent: Googlebot-Image
Disallow: /

And pages within this site are getting scanned by Googlebots all day long. Is there something wrong with my file or with Google?

4条回答
我欲成王,谁敢阻挡
2楼-- · 2019-04-23 08:18

It should be Disallow:, not Disabled:.

查看更多
成全新的幸福
3楼-- · 2019-04-23 08:23

Also I believe that the bot goes down the page and takes the first directive that applies to it. In your case, Googlebot and Googlebot-Image would never see their specific directives because they would respect the "User-Agent: *" first.

Disregard this answer. I found information that points to this not being the case. The bot should find the directive specific to it and respect it

查看更多
地球回转人心会变
4楼-- · 2019-04-23 08:33

Google have an analysis tool for checking robots.txt entries, read about it here

You might also want to check the IP addresses of the "rogue" robots to see if they really are owned by Google.

查看更多
女痞
5楼-- · 2019-04-23 08:39

Maybe give the Google robots.txt checker a try

查看更多
登录 后发表回答