Comments 1 - 3 of 3 Search these comments
They don't respect robots.txt but they don't search worth shit on this site either. Why do they even bother to crawl it?
PS I have taken measures on the server side now to block Google by IP address, returning a 403 Forbidden to them.
https://patrick.net/robots.txt
Note that the first thing I do is to tell Google to fuck off:
But Google disrespects the wishes of site owners and indexes anyway! Proof from my web server log:
And that is not a spoof of Google's bot, because 34.32.251.230 is really a Google IP:
So I wrote google-cloud-compliance@google.com to ask them to stop that, but they have not replied, and not stopped.
Summary: Google is evil, and will try to index your site whether you ask them to stop or not.
PS I have taken measures on the server side now to block Google by IP address, returning a 403 Forbidden to them.