| 12:27 am on Jan 12, 2002 (gmt 0)|
Welcome to Webmaster World, Mud.
| 7:32 pm on Jan 12, 2002 (gmt 0)|
Hmm.. thats not quite what I ment! :)
I mean I want to have one line of code in my robots.txt to include all my pages that have that pattern.
There is lots of artists:
Is there a way to do this with one line of code in the robots.txt, or should I give up looking? :p
| 7:44 pm on Jan 12, 2002 (gmt 0)|
Sure, just include that mid level subdir and it will get everything in the dirs below it.
| 8:08 pm on Jan 14, 2002 (gmt 0)|
I'm having a hard time believing I read this here. See [robotstxt.org ]:
|Note that there can only be a single "/robots.txt" on a site. Specifically, you should not put "robots.txt" files in user directories, because a robot will never look at them. If you want your users to be able to create their own "robots.txt", you will need to merge them all into a single "/robots.txt". If you don't want to do this your users might want to use the Robots META Tag instead. |
I read this as meaning you shouldn't expect any spider to ever look at robots.txt in subdirectories.
| 8:10 pm on Jan 14, 2002 (gmt 0)|
Scareduck - welcome to WmW :)
> I read this as meaning you shouldn't expect any spider to ever look at robots.txt in subdirectories.
You got it. Only /robots.txt ever gets requested.
The line required in the above example to do this is
Every directory below dir will not be indexed by standard compliant robots.
| 3:21 am on Jan 17, 2002 (gmt 0)|
however I still want:
to be spidered, just not:
I know I can do the:
Just would be a lot of work to get a list of 6,000 artists to block the /old/ from. But I guess it sounds like thats the way I'll have to go? :(