Welcome to WebmasterWorld Guest from 22.214.171.124 , register , free tools , login , search , subscribe , help , library , announcements , recent posts , open posts Subscribe to WebmasterWorld
how to block a page on a particular location experienced msg:3420669 11:04 am on Aug 13, 2007 (gmt 0) i want to block a particular file located under the various folder with the same file name.
domain.com/folder2/file.php domain.com/folder3/file.php domain.com/folder4/file.php
I have approx 15000 folders like this where i want to block the file.php in every folder. what is the best option to block. I dont want to make the robot file so lengthy by putting all the urls in side. any other quick solution.
help appreciated in advance :-)
[fixed confusing typo in title]
edited by: goodroi at 1:54 pm (utc) on Aug. 13, 2007]
goodroi msg:3421723 1:02 pm on Aug 14, 2007 (gmt 0)
An easy way to handle this with the big search engines is to use wildcards aka pattern matching. IMPORTANT This is not supported by most other bots.
Google Robots.txt Pattern Matching Explained
[ ...] google.com
Yahoo Robots.txt Wildcards Explained
[ ...] ysearchblog.com experienced msg:3426850 2:20 pm on Aug 20, 2007 (gmt 0)
will this expression block this file at any level.
if yes then this would be really helpful for me.
experienced msg:3428558 4:49 am on Aug 22, 2007 (gmt 0)
pls reply if possible new_seo msg:3428581 5:39 am on Aug 22, 2007 (gmt 0)
If you want to block a particular file which is located under the various folder with the same file name,then I think you can do it simply by User-Agent:* Disallow: /file.php goodroi msg:3429141 6:01 pm on Aug 22, 2007 (gmt 0)
This will block Google from all filenames containing "my-file-name.php"
Please remember that most smaller search engines do not support this in robots.txt. Also if you want to test other combinations for Googlebot you can go to Google's Webmaster Central and use their robots.txt analysis tool.
g1smd msg:3431010 9:47 am on Aug 24, 2007 (gmt 0)
This will disallow any URL that begins /file.php.... Disallow: /file.php
... and that only works for URLs in the ROOT, i.e. BEGIN with that.
You need the
* to make it work for folders. Disallow: /*file.php
This Rule MUST go in the
User-Agent: Googlebot section.
Other bots do not understand the
If you have a
User-agent: Googlebot section, then ALL of your rules for Googlebot must go in that section as Googlebot will then completely IGNORE the User-agent: * section.
You do this even if it means duplicating a lot of stuff into both sections.