homepage Welcome to WebmasterWorld Guest from 54.237.151.188
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Code, Content, and Presentation / Apache Web Server
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL & phranque

Apache Web Server Forum

    
.htaccess help to avoid index page duplicates
peego

10+ Year Member



 
Msg#: 4419629 posted 5:57 pm on Feb 20, 2012 (gmt 0)

Hi All,

We're a vendor on Clickbank, and this past week, I can't find our index page in google anymore, although inner pages all rank fine as before. I have narrowed this down to possibly having multiple same pages of our index page due to affiliate URLs through Clickbank. So I'm trying to fix this through htaccess rewrite.

What I found when I do a site:example.com is that I find in the results, instead of having our index page, google had our index page with an affiliate's hop ID attached to the end, and I think google might be confused with all the different ending hop ID's of different affiliates and this also leads to duplication issues.

For example, if our site URL was:

http://www.example.com

There would be many affiliate URLs that look like this:

http://www.example.com/?hop=#*$!XX (where #*$!xx is the affiliate's ID)

And http://www.example.com/?hop=#*$!XX this is the type of URL we found that replaced our index page in google's records (LOL) so I think there's some confusion and possibly a duplicate penalty. I totally overlooked this...

Now I'm trying to fix this through htaccess to do a rewrite that would change all those types of URLS (http://www.example.com/?hop=#*$!XX) to redirect to just my base URL (http://www.mysite.com) without the ending ?hop=#*$!xx)

But I do not have any htaccess knowledge, and was hoping could someone please help me out and let me know what the 2 or 3 lines of code should be to achieve this?

Thanks so much in advance!

[edited by: tedster at 6:15 pm (utc) on Feb 20, 2012]
[edit reason] switch to example.com [/edit]

 

lucy24

WebmasterWorld Senior Member lucy24 us a WebmasterWorld Top Contributor of All Time Top Contributors Of The Month



 
Msg#: 4419629 posted 8:03 pm on Feb 20, 2012 (gmt 0)

Before you start redirecting:

Have you looked at the URL parameters area under Site Configuration in GWT? You can tell them to ignore certain parameters, so all versions of a page will collapse into one.

peego

10+ Year Member



 
Msg#: 4419629 posted 8:22 pm on Feb 20, 2012 (gmt 0)

ok, I looked in GWT under URL parameters, and changed the "HOP" parameter to:

"No: Doesn't affect page content (ex: tracks usage)"

So now the Crawl is set to "Representative URL". There were a few hundred of them. I assume now that I've changed this setting in GWT, that google would/should only crawl/index our original index page, correct?

So can someone help me with the htaccess code to now redirect all the ending "?hop=#*$!" to just root domain? Thanks!

g1smd

WebmasterWorld Senior Member g1smd us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 4419629 posted 8:29 pm on Feb 20, 2012 (gmt 0)

With the GWT settings in place you don't really need the redirect.

peego

10+ Year Member



 
Msg#: 4419629 posted 8:35 pm on Feb 20, 2012 (gmt 0)

Ok I see thanks.

How about for other SE's like BING tho? I assume all these ending ?hop=#*$! could also cause confusion in Bing as well as it did in Google for us?

I was just browsing through our Bing WMT but can't seem to find a similar type of option setting for Bing to tell them to Ignore the "?HOP" parameter.

Would it be good practice to just redirect all those to our base domain anyways?

peego

10+ Year Member



 
Msg#: 4419629 posted 8:36 pm on Feb 20, 2012 (gmt 0)

why does WebmasterWorld change all the "X X X" to "# * $!"? strange.

lucy24

WebmasterWorld Senior Member lucy24 us a WebmasterWorld Top Contributor of All Time Top Contributors Of The Month



 
Msg#: 4419629 posted 4:40 am on Feb 21, 2012 (gmt 0)

Not half as strange as the fact that "D*****T****" is a dirty word :)

Bing has a parameter-tweaking function that is similar to google's. Don't know about the others.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Apache Web Server
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved