homepage Welcome to WebmasterWorld Guest from 107.20.109.52
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / WebmasterWorld / New To Web Development
Forum Library, Charter, Moderators: brotherhood of lan & mack

New To Web Development Forum

    
Slurp looking for long gone web pages?
Boss didn't like 5,295 404 errors!
kicsibaba




msg:967405
 11:03 pm on Mar 16, 2003 (gmt 0)

Yesterday, I couldn't spell webmaster and today I are one! Long story how I got to be one, but it sure looks like I've come to the right place for help.

I renovated my client's website last August - same domain name with totally new content (not one page has the same name). I just got access to the log files and produced a report showing 5,295 404 Not Found errors since February 1. I knew I couldn't have been that careless...

Looking back through the logs to find all those 404 errors, a large portion of them are ones that look like (I hope this is OK to show this?)
GET /va.htm - 404 Mozilla/5.0+(Slurp/cat;+slurp@inktomi.com;+http://www.inktomi.com/slurp.html)
Note: This is a page from the old August 2002 site,
GET /scripts/..%5c%5c../winnt/system32/cmd.exe /c+dir 404
Note: Is this a hacker type object? Please forgive my ignorance, but why would anybody want my server's cmd.exe file?!
Also found GET /default.ida ones - from the postings I looked at here, this appears to be the Code Red worm but I didn't see any recent information...I'm not in charge of security for the server that my site is on, but how loud (if at all) should I yell?

How can I get inktomi/slurp to stop asking for pages that haven't existed in months? Thanks for any help and insight you can give a real newbie (and sorry about my lengthy first post).

 

pendanticist




msg:967406
 11:17 pm on Mar 16, 2003 (gmt 0)

INK isn't the only stupid bot.

[webmasterworld.com...] msg#3

<added>In fact:

cdev1000.inktomisearch.com - - [16/Mar/2003:15:58:17 -0800] "GET /1CrimDrugAw.html HTTP/1.0" 301 273 "-" "Mozilla/5.0 (Slurp/cat; slurp@inktomi.com; [inktomi.com...]

INK just now asked for a page I have not had in nearly a year....

How stupid can you get?

I mean, why should the site owner pay for the bandwidth used in repeated calls for old or outdated pages?

INK isn't the only one. When are they going to either design an intelligent bot, or revamp the ones they have? Grub isn't any smarter.

</added>

Pendanticist.

[edited by: pendanticist at 12:14 am (utc) on Mar. 17, 2003]

kicsibaba




msg:967407
 12:12 am on Mar 17, 2003 (gmt 0)

I gather from following your links to another post and on to other forums and posts from there, I obviously need to do a 301 redirect. I'm with you so far...

Can you point me in the right direction about how to do that, based on the fact that my site is on a Windows 2000 dedicated server that I have practically no administrative rights to.

I've searched and searched and searched on IIS 301 redirect stuff and still can't make heads or tails of it.

Thanks again for any help.

pendanticist




msg:967408
 12:17 am on Mar 17, 2003 (gmt 0)

Can you point me in the right direction about how to do that, based on the fact that my site is on a Windows 2000 dedicated server that I have practically no administrative rights to.

I know nothing of dedicated servers.

However, in my ignorance it would seem you could put them in your .htaccess file anyway, correct? Perhaps, I should ask: Do you have access to them?

Pendanticist.

kicsibaba




msg:967409
 12:32 am on Mar 17, 2003 (gmt 0)

pendanticist,

I am looking through the directories I have rights for via my FTP program (my only option) - so in answer to your question, I don't have a clue...there is nothing named .htaccess anywhere under the /mydomain/ directory where my site resides. Any idea where it usually (or should) reside (even vague directions would help)?

Thanks a million!

pendanticist




msg:967410
 12:47 am on Mar 17, 2003 (gmt 0)

Any idea where it usually (or should) reside (even vague directions would help

Mine resides near the top of the public_html pages just below the last folder.

Maybe you should start another thread. Title it as accurately as you can and see who bytes on it.

First Line: Windows 2000 dedicated server

Second Line: Need help with .htaccess

Something like that.

There are tons and tons of eminently more qualified folks here to help, so hang in there as someone will help you.

Sorry. I just don't know enough about your type of server rights.

<added> You are most Welcome!</added>

Pendanticist.

kicsibaba




msg:967411
 1:01 am on Mar 17, 2003 (gmt 0)

pendanticist,

Thanks for your advice - just one last stupid question...best to post that question on another forum? or stay on the New to Web Development one (where I belong!).

And thanks for a most pleasant first experience at webmasterworld.com!

pendanticist




msg:967412
 1:10 am on Mar 17, 2003 (gmt 0)

Thanks for your advice - just one last stupid question...best to post that question on another forum? or stay on the New to Web Development one (where I belong!).

Website Technology Issues [webmasterworld.com] "From webservers to text editors, dealing with the alphabet soup of programs and protocols."

This one looks more appropriate.

As for your compliment kicsibaba, I can only say - it was a pleasure :)

Happy Posting!

Pendanticist.

kicsibaba




msg:967413
 1:23 am on Mar 17, 2003 (gmt 0)

pendanticist (thank goodness that was on my clipboard!),

Just doing this first venture into posting has been a great learning experience.

Best regards and a very sincere "Thank you".
Kicsibaba

pendanticist




msg:967414
 1:25 am on Mar 17, 2003 (gmt 0)

Youbetcha! :)

Pendanticist.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / WebmasterWorld / New To Web Development
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved