Forum Moderators: Robert Charlton & goodroi
in my robots.txt I have blocked javscript and was wondering why it is recommended not to block it.
A description for this result is not available because of this site's robots.txt – learn more.
Is it because there is a link somewhere else on the web for that page ?
[edited by: Robert_Charlton at 10:30 am (utc) on Jan 3, 2014]
why do I have for example my page www.mywebsite.com/administrator indexed ( with the message " description not available ) in google when I type site:mywebsite.com when in my robots it says
Disallow: /administrator/
<meta name = "robots" content = "noindex"> <FilesMatch "\.(js|txt|xml)$">
Header set X-Robots-Tag "noindex"
</FilesMatch>
I blocked everything in order to avoid having pages indexed but I am not sure it is the right thing to do ?
why is it bad to block javascript in robots ?
Is it because there is a link somewhere else on the web for that page ?
Does having pages like this diluate page rank ?
Could it be possible that it find those pages on server logs and considers those as duplicate content even if we added a 410 command on our site to remove those pages from google index ?
according to the webmaster tools it did remove almost all of them
"remove" from where?
note that besides anchor elements and the other usual places, a discoverable url may appear in a link element in the head of your document, for example.Even worse, Google would use the URL it sees in HTML comments for discovery. If it looks like URL, Google will try it.
we had a major bug with joomla about a year ago (...) My worry is that we can't get our ranking back because all those pages that it found and created are still on public server logs ?
I have a robots.txt / module but google still shows pages ( with " description not available " ) that include /module in the web address.
I have a robots.txt / module AND THEREFORE google shows pages ( with " description not available " ) that include /module in the web address.