The goal robots.txt
is not to distinguish between private areas, because robots do not even have access to them. Instead, if you have garbage or something else that you do not want to be indexed by search engines or so.
Say for example. I write Flash games for entertainment, and I use GitHub pages so that games check for updates. I have this file hosted on my GHP, the entire contents of which
10579
2.2.3
https://github.com/iBug/SpaceRider/tree/master/SpaceRider%202
It contains three pieces of information: the internal version number of the new version, the display name of the new version, and a download link. Therefore, it is certainly useless when indexing with scanners, so when I have it robots.txt
, I would not want to be indexed.
source
share