Forum Moderators: Robert Charlton & goodroi
Well, Google and others crawled and caught many of the dups before we even knew it!
Since then (30 days ago) a permanent redirect was put in the .htaccess file of A to re-direct each page from A to the same page on B.
BUT,
- Google still shows all the A and B pages (although the B pages are indexed with title and description as they should be).
- Yahoo's index corrected itself fastest 14 days) for the internal pages, but it still shows the home page of A in it's index.
- MSN's index corrected itself quickly also (20 days) for the internal pages, but it also still shows the home page of A in it's index.
QUESTIONS:
1) How long will it take for Google to use the permanent re-direct and remove the site A pages from it's index?
2) Why does Yahoo & MSN remove all site A internal pages, but continue to leave (index) the site A index page?
Just so you have it, here is the .htaccess entry on site A:
Options +FollowSymLinks
RewriteEngine on
Redirect permanent / [site_B.com...]
When you use the correct rewrite rule to redirect a whole site, it usually takes several months for everything to work right. The first effects are usually seen within days.
For example page I can tell you that this page
[site_A.com...] is indexed in Google and if you click on it you WILL get directed to
[site_B.com...]
(2) Thanks for the several months estimate. I'll try to be patient.
(3) Now, if say the re-direct is only redirecting the index page of A to the index page of B, why is it that Yahoo & MSN have already removed all pages BUT the index page?
Just so you have it, here is the .htaccess entry on site A:
Options +FollowSymLinks
RewriteEngine on
Redirect permanent / [site_B.com...]
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^theoldsite.com [NC]
RewriteRule ^(.*)$ http://www.thenewsite.com/$1 [L,R=301]
RewriteCond %{HTTP_HOST} ^www.theoldsite.com [NC]
RewriteRule ^(.*)$ http://www.thenewsite.com/$1 [L,R=301]
and this on the new site:
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^thenewsite.com [NC]
RewriteRule ^(.*)$ http://www.thenewsite.com/$1 [L,R=301]
I learned the method I had for the permanet re-direct somewhere in this forum and thought I had it right.
Now you have me confused and wondering? I'm not sure what to do. Thanks for trying to help though.
Is there a book out that covers this stuff? I find stuff on the forum and the web, but the solutions are always VERY different...
Options +FollowSymLinks
RewriteEngine on
Redirect permanent / http://www.site_B.com/
Redirect permanent / http://www.site_B.com/
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^(www\.)?theoldsite\.com [NC]
RewriteRule (.*) http://www.thenewsite.com/$1 [R=301,L]
Jim
Just to make sure... a final check... using what you said:
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^(www\.)?theoldsite\.com [NC]
RewriteRule (.*) [thenewsite.com...] [R=301,L]
Will a vist to the following "oldsite" URL get properly re-directed to the same page on the "new site" URL :
1) [theoldsite.com?...]
2) [theoldsite.com?...]
3) [theoldsite.com...] ?
4) [theoldsite.com...]
Google loves to keep 404 and forwarded pages for ages. I know I'm losing money because the deleted page shows on the indent, and sometimes people pick that one...only to be surprised by the 404.html page. Average Joe is not going to know what "supplemental" means.
I ended up recreating the 404 pages and redirecting them to similar pages to avoid this. A pain if you have lots of pages, but at least users get something when they click on the link.
The code I posted is intended for use in .htaccess for the case where both domains resolve to the same DocumentRoot directory. If you were able to use the Redirect directive, which makes no provision for conditional execution based on domain (HTTP_HOST) name, then this implies that your domains resolve to different DocumentRoots. Otherwise, you would have created an 'infinite' redirection loop.
So, since Redirect worked for you, your domains must resolve separately.
jgilbert,
Rather than waiting around for me (or someone) to answer, it might be faster to test the code, or analyze how it works by using the Apache documentation. It will do exactly what you describe, but of course, comes with no warranty.
Jim
All of the pages from site A have now been removed from the indexes at Google, Yahoo and MSN leaving only the pages indexed of site B as wanted ---- EXCEPT THE DEFAULT PAGE!
Redirect permanent / [site_B.com...]
(in the .htaccess file definitely works.)
But, for the life of me I can't figure out why all 3 SEs didn't remove the default (www.site_A.com) from their indexes.
Yahoo took about 15 days, MSN about 20 and Google about 45.
Appreciate all the help guys, but will continue to question (until it's removed) why the SEs removed all the internal pages, but left the default?
For a site that has had a 301 redirect in place for a year, the "olddomain" is still listed in Google as a URL-only result. I guess that their spider tests the URL every so often to see if it still redirects or has gone 404 or has now got its own content. Seems reasonable to me.
Seems reasonable to me. Well, seems reasonable he would check, but not reasonable he would continue to index the "URL-only result".
The "URL-only result" you refer to is simply a URL with an associated page name (like index.html, etc.) which makes it really NO different than an internal page.
The SEs (and I hope they read this) should not (under a proper 301 re-direct) treat the default URL page any differently than they would the internal pages when it comes to showing them in the index.