Forum Moderators: open

Message Too Old, No Replies

How to Block Static(html) HTTPS pages?

         

jameswatt

8:52 am on Jun 13, 2008 (gmt 0)

10+ Year Member



Hi folks,

My site has few pages in HTTPS version and few in HTTP version.
Problem is I have linked certain pages like home page, sitemap page and services page links in Footer section of HTTPS version pages, now Google has indexed my domain as [mydomain.com...] , yahoo and msn has indexed certain html HTTPS pages which are not linked in footer section of my HTTPS page

For example
Linked pages in footer section of HTTPS version pages
[mydomain.com...]
[mydomain.com...]
[mydomain.com...]

Pages which are not linked in footer section of HTTPS version pages but still got indexed in yahoo and msn
[mydomain.com...]
[mydomain.com...]
[mydomain.com...]

If you click on above pages all pages will redirect to respective HTTP version pages with 302 methods

Now big question how search engine (yahoo and msn) has indexed static html pages with HTTPS version without any link to any of my page.

How I can remove those [mydomain.com...] pages from robots.txt file or .htaccess file

Questions
How can I prevent indexing HTTPS version pages, excluding my landing page?
What should I do to stop crawling my main domain with HTTPS version (i.e. [mydomain.com)?...]

caveman

8:55 pm on Jun 18, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You've probably created some paths for the engines to find the pages they've found...probably from one internal https page to another.

Just exclude the https pages in robots.txt.

Also, why are you 302-ing https pages to http pages (if I'm understanding the post correctly)? That's not good. The SE's will associate the https URL's with contents from the http target pages.

bilalseo

6:01 pm on Jun 19, 2008 (gmt 0)

10+ Year Member



use 301 redirect.. it would help you to target other pages having low worth or rare visited pages...