Forum Moderators: phranque

Message Too Old, No Replies

404 Code from Https link

         

StillTrying

12:12 pm on Jun 17, 2006 (gmt 0)

10+ Year Member



Morning Folks,

My website logs revealed an Amsterdam IP address (proxy?) that scanned all my webpages and then generated a 404 error code for about 15 webpages similar to:

"GET /https://mysite.com/contact.html HTTP/1.1" 404 972 "-" "Java/1.5.0"

Looking at other log files I have never seen a "GET /https: … " command.

Is the 404 a function of the "Get /https …" or is something fishy going on?

Comments much appreciated.

StillTrying

aaanativearts

12:57 pm on Jun 17, 2006 (gmt 0)

10+ Year Member



A 404 means "page not found". I don't know about the GET https: part. Perhaps they were looking for a backdoor to hack into your site? Some contact forms in some popular open source CMS systems are known to have an exploit hackers can use to gain entry if they haven't been patched.

jdMorgan

4:06 pm on Jun 17, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



"GET /https://mysite.com/contact.html HTTP/1.1" 404 972 "-" "Java/1.5.0"

It's likely that this is just a badly-written scraper or robot: The proper form for an HTTP(s) request header would be
"GET /contact.html HTTP/1.1" 404 972 "-" "Java/1.5.0"

That is, the HTTP(s) protocol or 'scheme' should not be included in the request header.

You could block all requests from that IP address or address range, block the Java/ user-agent, or simply let these requests go 404 -- They're not likely to be requests that would benefit you in any way should you 'correct' them using code on your server.

Jim

StillTrying

4:21 pm on Jun 17, 2006 (gmt 0)

10+ Year Member



Jim -

Thanks much - I am just paranoid that the person may be trying or / have hijacked my website somehow -

i.e. they hijack the site for certain IP addresses (not all, but many), then they use the above "get" command to check to see if their code is blocking access to the site.

Regards from "The New Guy" -

Still Trying