Welcome to WebmasterWorld Guest from 54.159.44.227

Forum Moderators: goodroi

Message Too Old, No Replies

Robot.txt File

What is it's importance?

     

Enigmatic

10:26 pm on Feb 24, 2004 (gmt 0)

10+ Year Member



Can anyone tell me the importance of the robot.txt file? And is it robot.txt or robots.txt?

I e-mailed my web host asking where mine was located on the server and they told me they never use one. Should I be trying to make one for the SE's and where should I upload it? Thanks,

Enigmatic

closed

8:12 pm on Feb 25, 2004 (gmt 0)

10+ Year Member



robots.txt is usually used for restricting robots from indexing parts of a site.

pr0purgatory

12:50 pm on Mar 15, 2004 (gmt 0)

10+ Year Member



If you have an META tag instruction for robots, do you need a robots.txt?

Alternative Future

12:55 pm on Mar 15, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>>If you have an META tag instruction for robots, do you need a robots.txt?

Most SE's will/should obey a robots.txt file while there is a lesser amount will read the META tag.

Am open for correction on this, but I think that is what i have came to understand.

-george

closed

5:38 pm on Mar 16, 2004 (gmt 0)

10+ Year Member



If you have an META tag instruction for robots, do you need a robots.txt?

It depends.

At the very least, you should put an empty robots.txt to cut down on 404s on your site.

Using META tags allows you to control how robots index or follow links on your pages.

Using robots.txt restricts access to files (text and binary), as well as directories.

MarkHutch

5:46 pm on Mar 16, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I'm probably opening up a can of worms here, but here we go.

I have wondered for the past year if there may come a time when spiders will not visit sites without a robots.txt file. The reasons are the increasing amount of privacy laws around the world. With email it's opt in, with any information we gather on our sites it's a published privacy policy. In the future, new laws may require web spiders to get an opt in (robots.txt permission slip) before they visit and list pages. I'm just commenting on the way the wind is blowing these days. I'm sure many people disagree with me, but I don't think it hurts to tell all robots it's ok to spider their site. We've added this information to all our domains and sub domains over the past few month. I hope to hear some more comments on this.

User-agent: *
Disallow:

hannamyluv

8:52 pm on Mar 16, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



To tell you the truth, I don't think it will be an issue. It just isn't something the average user (aka politician) understands or even knows about. Heck, I think a survey of websites will show that 50% of webmasters don't know about them.

If nobody knows, nobody will make a law about it.

 

Featured Threads

Hot Threads This Week

Hot Threads This Month