Welcome to WebmasterWorld Guest from 54.163.35.238

Message Too Old, No Replies

Google indexing https pages as different pages?

     

helenp

9:47 am on Mar 23, 2013 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi,
Google are indexing many pages lately, checking indexed pages in google I saw content like this

Widgets in placename with gizmo for sale
https://www.mysite.com/sales/widgets_for_sale_placename.htm?id=113...
Description 1

Widgets in placename with gizmo for sale
www.mysite.com/sales/widgets_for_sale_placename.htm?id=113
Description 1

Also pages blocked by robots without any parameters are double like that, (had to click on view more pages in google search to see these)

Does I have something wrong in my htaccess file?
AddType application/x-httpd-php5 .htm .html
RewriteEngine On
RewriteCond %{SERVER_PORT} !^443$
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^/]+/)*index\.html?\ HTTP/
RewriteRule ^(([^/]+/)*)index\.html?$ http://www.mysite.com/$1 [R=301,L]
RewriteCond %{SERVER_PORT} ^443$
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^/]+/)*index\.html?\ HTTP/
RewriteRule ^(([^/]+/)*)index\.html?$ https://www.mysite.com/$1 [R=301,L]
# Redirect non-canonical to www
RewriteCond %{SERVER_PORT} !^443$
RewriteCond %{HTTP_HOST} !^(www\.mysite.com\.com)?$
RewriteRule (.*) http://www.mysite.com/$1 [R=301,L]
RewriteCond %{SERVER_PORT} ^443$
RewriteCond %{HTTP_HOST} !^(www\.mysite.com\.com)?$
RewriteRule (.*) https://www.mysite.com/$1 [R=301,L]
#No permitir direcciones como mysite.com/pagina1.htm/maps/ etc
RewriteRule ^((?:[^./]+/)*[^./]+\.(?:html?|php))/ http://www.mysite.com/$1 [R=301,L]


.

[edited by: Robert_Charlton at 4:39 pm (utc) on Mar 25, 2013]
[edit reason] removed specifics, per Charter [/edit]

helenp

5:01 pm on Mar 25, 2013 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Lucy and G1,
I have now changed the order and have it like this:

AddType application/x-httpd-php5 .htm .html

RewriteEngine On

# REDIRECT htm INDEX PAGES to index/
RewriteCond %{SERVER_PORT} !^443$
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^/]+/)*index\.html?\ HTTP/
RewriteRule ^(([^/]+/)*)index\.html?$ http://www.example.com/$1 [R=301,L]

RewriteCond %{SERVER_PORT} ^443$
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^/]+/)*index\.html?\ HTTP/
RewriteRule ^(([^/]+/)*)index\.html?$ [example.com...] [R=301,L]

# Get rid of extra path info
RewriteRule ^((?:[^./]+/)*[^./]+\.(?:html?|php))/ http://www.example.com/$1 [R=301,L]

# Redirect non-canonical to www
RewriteCond %{SERVER_PORT} !^443$
RewriteCond %{HTTP_HOST} !^(www\.example.com\.com)?$
RewriteRule (.*) http://www.example.com/$1 [R=301,L]

RewriteCond %{SERVER_PORT} ^443$
RewriteCond %{HTTP_HOST} !^(www\.example.com\.com)?$
RewriteRule (.*) [example.com...] [R=301,L]

lucy24

9:45 pm on Mar 25, 2013 (gmt 0)

WebmasterWorld Senior Member lucy24 is a WebmasterWorld Top Contributor of All Time Top Contributors Of The Month



# Get rid of extra path info

Does this problem occur only with http pages? Otherwise you'd need a pairing of rules, as with all the others.

helenp

10:07 pm on Mar 25, 2013 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Does this problem occur only with http pages? Otherwise you'd need a pairing of rules, as with all the others.


If you mean the recent issue /folder/page1.htm/page2.htm
I only saw one page with the issue in webmaster tool, however does not mean could be more, and yes it was http.

The previous issue /folder/page.htm/folder/page1.htm was also http

Not sure if the rule covers the recent issue or not, I think not as the issue appeared so I assume the rule is for previos issue /folder/page.htm/folder/page1.htm and not for the just appeared /folder/page1.htm/page2.htm

Thanks for everything, what a mess
This 33 message thread spans 2 pages: 33
 

Featured Threads

Hot Threads This Week

Hot Threads This Month