Forum Moderators: DixonJones

Message Too Old, No Replies

Don't understand entry in log

GET...Yahoo ....

         

mayor

8:52 am on Nov 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Every so often I see a log entry like this:

61.187.156.251 - - [23/Nov/2002:21:29:36 -0500] "GET [yahoo.com...] HTTP/1.1" 200 6732 "-" "Mozilla/4.0 (compatible; MSIE 4.01; Windows 98)"

My site has nothing to do with Yahoo.

Notice it appears to be a successful hit, returning 6732 bytes to the visitor.

Can anyone decipher this hit for me?

Brett_Tabke

9:29 am on Nov 24, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Looks like they are requesting a filenamed "http://www.yahoo.com".

I'd go double check your links. If you have a link to yahoo somewhere, there might be an html error there. Look for something like:

<a href="/http://www.yahoo.com">...</a>

Where the browser could be requesting that as the filename.

If your site is configured just right, it is probably returning your index page. (does that size match up with your index page?)

Someone could have also linked to you wrong. Might double check if there is ever a referrer associated with it.

Is it always the same ip? Then it might be a link on the users local homepage and they botched the link.

mayor

9:49 am on Nov 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks for the reply, Brett.

There's no links to yahoo anywhere on the site.

The size of the home page is 6732 bytes, so that's what was served to them. Good sleuth, Brett.

"-" in the string indicates there is not a known referrer

Brett_Tabke

9:53 am on Nov 24, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



It might also be someones bot trying to find your 404 page. Given where that ip ends up at, I'd guess a botched page link. English and html would be a second language over there.

spock

10:08 am on Nov 24, 2002 (gmt 0)

10+ Year Member



GET [yahoo.com...] HTTP/1.1

This is how a proxy request for Yahoo's home page looks like. Someone is trying to use your web server as a proxy. I gets lots of those log entries; there appears to be many people scanning for open web proxies.

Looks like they are requesting a filenamed "http://www.yahoo.com".

No - the request would look like "GET /http://www.yahoo.com/ HTTP/1.1" (note the slash) if that was the case.

Brett_Tabke

10:22 am on Nov 24, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



It depends on the browser and the actual html error spock.

spock

10:45 am on Nov 24, 2002 (gmt 0)

10+ Year Member



You are saying there's a browser that will translate a link such as <a href="/http://... into a "GET [...."...] request? What browser is that?

Sure, there are some broken browsers out there, but I still think a proxy request is the most likely explanation for this. I get the exact same log entries even for small sites where I know all links to be correct.

mayor

5:18 pm on Nov 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>> people scanning for open web proxies

This is over my head. Would this be some kind of hackers or what? Why would they want someone else's web server to be a proxy server, and why for Yahoo's home page? Are they trying to spoof the referrer?

jdMorgan

5:33 pm on Nov 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I could be either a malformed link or an open-proxy probe:

http://www.webmasterworld.com/http://www.mydomain.com/ [webmasterworld.com]

Jim

spock

5:55 pm on Nov 24, 2002 (gmt 0)

10+ Year Member



Why would they want someone else's web server to be a proxy server, and why for Yahoo's home page?

An open proxy server (that is, one that doesn't require authentication) can be used to conceal what address you are connecting from. There could be many reasons for wanting to do that - including hacking attempts, DoS attacks, and various other illegal acts. They're only using Yahoo for testing whether the proxy is open, it could just as well be any other site.

mayor

6:38 pm on Nov 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



That sure opens my eyes, Spock, thanks.

Can I tell from the log file whether their attempt succeeded or failed?