Forum Moderators: Robert Charlton & goodroi
When I do a site: search, one of the pages Google has looks like this:
User-agent: * Disallow: /file/ (this is the title)
User-agent: * Disallow: /file/ (this is the description)
www.webmasterworld.com/robots.txt - 1k - Supplemental Result
Am I missing something here? Seems like there must be some very simple explanation why G cached my robots.txt, making up it's own title and all. The original file is fine, and it's on two lines, unlike the cached version:
User-agent: *
Disallow: /file/
Maybe I made a mistake, or misspelled a word?
But barring pushing the envelope, only URLs with links stay in the index. That's how Google works; and why no site ever needs submitting; just link to it, and Google will, er, follow the links :)
There's never a need to link to a robots.txt file. Google will find that if the domain is indexed.
On the other hand, having that in Google's index really will do no harm (and no good). Best just to be normal, however - too much experimenting can be harmful to your income ;)
[edited by: Quadrille at 2:55 am (utc) on Jan. 30, 2007]
On the other hand, having that in Google's index really will do no harm (and no good). Best just to be normal, however - too much experimenting can be harmful to your income ;)
Yeah, but if Google has a robot.txt doesnt this mean that it does not look at it as a robots.txt but just a simple text file? Will it obey the disallow?
P.S. or I can submit a page through the sitemaps :)
Is your robots.txt listed in your sitemap file? It seems to me that *might* be considered a link.
I'm beginning to wonder if all this was and is related to the 950 penalty. Many members reported having their supplementals listed above their main pages when first hit by 950 penalty.... I wonder how many have a robots.txt indexed as well?
P.S. or I can submit a page through the sitemaps.
You surely can; you surely can. But it remains a pointless exercise. The way to effectively have a page indexed is to link to that page. Period. "Forcing" Google, repeatedly, to include a file that shouldn't be there does not sound appropriate use of sitemaps or your time; indeed, getting a robots.txt indexed does not sound particularly useful, either.
Whether Google cares either way, I couldn't know. But if they do care, you can bet that's not 'care' as in 'fond affection'.
If you 'care' about your site, I'd suggest you stop playing games with it - sooner or later, the dragon will stir.
Never forget the Hogwarts motto: "Draco dormiens nunquam titillandus," which means "Never tickle a sleeping dragon." ;)
[edited by: Quadrille at 11:32 am (utc) on Jan. 30, 2007]