Welcome to WebmasterWorld Guest from 54.162.107.122

Forum Moderators: Robert Charlton & andy langton & goodroi

Message Too Old, No Replies

Google Indexing my Home Page as https://.

Why is this happening?

     
5:59 pm on Mar 29, 2006 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Apr 27, 2001
posts:12172
votes: 61


Why is Google indexing my home page as https?

I do a site: search and right there in the #1 position is my home page with an https link.

And no, we do not link to the home page with https.

There are no inbounds that we can find linked to the https version. And I would have thought that by default, if the damn page and/or directory is not set up as https that there should be no reason for you to index it as https.

I'm on a Windows Server running IIS 6. I understand there is a way to prevent this on Apache, how do I do it on IIS?

9:51 pm on Mar 29, 2006 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38073
votes: 17


Maybe the question is: why is the server allowing https versions of the same pages?

I don't think there is anything new here pg1. The https indexing was first spotted in 2002:
[webmasterworld.com...]

Looks like an unfriendly linked to you with https in the url?. Same with the nonWWW and ip address versions of your homepage. It is a classic compeitor trick to mess with your pr values.

I say - tweak it at the server level and that should fix it up?

10:10 pm on Mar 29, 2006 (gmt 0)

Junior Member

10+ Year Member

joined:Nov 2, 2005
posts:140
votes: 0


Hi,

We just been through the same problem.

We fixed it by removing the SSL cert from the main site and serving the https pages on the .net version of our domain.

When the https page(s) show true 404 errors via the headers you can ask Google Help to remove the pages and they will do it quite quickly.

The Google remove tool will not work with https and would take ages anyway.

The next stage was to make ALL internal link absolute because if you use relative links you only need one site to link to your homepage with https to find EVERY page in your site indexed as https. It happended to us.

Two weeks after it happend we are now back with a http homepage site starting to get indexed and all the https pages removed.

Google Help can sometime be quite efficient they simply pick and choose subjects they are willing to assist with. Https pages is one of them!

Good luck.

3:54 am on Mar 30, 2006 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38073
votes: 17


How about putting your ssl on a sub domain like ssl.example.com or secure.example.com.
9:19 am on Mar 30, 2006 (gmt 0)

Junior Member

10+ Year Member

joined:Nov 2, 2005
posts:140
votes: 0


>>>just put your ssl on a sub domain like ssl.example.com or secure.example.com.<<<

Be careful if you use a windows server via IIS.

Windows servers can only have one SSL cert per IP address.

If the SSL cert is removed from the main domain and a new/ammended cert is added to a new sub domain with the same IP address a call for:

[maindomain.com...] will load [subdomain.domain.com...] as that is where the SSL cert now is in relation to the IP address - (after a warning message about the cert name being incorrect.)

Therefore incorrectly indexed https main domain pages will not show a 404 and may not be removed.

You can get around the above by ensuring that the secure pages on the sub domain have different page urls to the non secure pages.

The index page will be a problem as it will always be index and you may wish to delete the index page on the sub domain until Google have removed the incorrect [maindomain.com(...] listing from the index.

2:53 pm on Mar 31, 2006 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Apr 27, 2001
posts:12172
votes: 61


I just can't believe how difficult this has been trying to find an answer for this. How is this handled on Apache?

These are the answers I've gotten so far which pretty much are inline with the above...

From the reading I've done so far, there is not an IIS server config solution to do this. You'll need to handle it one of 2 (or more ) ways.

1. Global include above header as include file in all HTTP only pages.

2. Separate out the https and http application, example, make new subdomain "secure.example.com"

Then you'll need cert for that domain, and all https traffic is written to [secure.example.com"...] and we disable SSL on the www domain, so you don't have to deal with the redirect code on top of each page or the competitor sabotage thing.

And then I have to come back to my original question, why would Google index an https link like that? This technical stuff makes my brain hurt! ;)

11:31 pm on Mar 31, 2006 (gmt 0)

Senior Member

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:May 26, 2000
posts:37301
votes: 0


As far as I know, Google indexes an https: url because

1. somewhere there is a relative link from an https: page (so the protocol does not show in the base url)
... or
2. someone is intentionally pointing https: links at a url.

Using a subdomain for the secure protocol is the solution that all my long term clients have in place -- their http: urls will not resolve with the https: protocol, so those urls don't get indexed, even if they appear somewhere on the web. Then on secure pages, any links to http: pages need to use full, absolute urls.

This is one of those areas where setting it up well from the start is a lot easier than fixing things after trouble develops. I'm still scrambling to uncover best practices for a repair job.

[edited by: tedster at 5:26 am (utc) on April 1, 2006]

5:18 am on Apr 1, 2006 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Apr 5, 2002
posts:884
votes: 2


Pg1, you may find this useful too - [webmasterworld.com...]