Forum Moderators: open
a) will google see those new urls (assuming the update is not within the next 2 days)
b) even tho deep got some of the old ones, (which still work), will it recognise the new ones and move over
c) IMPORTANT: Should i use
www.widgets.com/something/index.htm
or just
www.widgets.com/something/
It's amazingly refreshing to no-longer display dynamic urls!
I did the same thing and here are my observations (although I did it with a servlet mapping system).
1) leave the old pages up for one deepcrawl update cycle and then phase them out. Seems that Google may get unhappy if your site radically changes.
2) It resulted in URLS that were easier to send to people and that was a Good Thing
3) It does not seem to have made any overall difference to the number of URLs in Google at the end of the day. Basically Google was indexing the parameter versions too.
Exactly the info I was after tho, thanks, I'm just hoping in the long run it'll also pay off not having any file extensions showing up on my site.. as in that URI guidlines document (all urls should stay the same etc etc)
Every shift in tech has always meant a new set of extensions.. getting messy, im out of that game now..
perhaps if i PERM redirect the old urls instead of letting them through...
<MUTTER><MORE REWRITTING!>
<edit>I Got the deepcrawl time wrong</edit>
[edited by: hitchhiker at 3:55 pm (utc) on April 6, 2003]
I think most IIS admins should look into this, it's just one of those things. As much as GG assured us that they try to crawl all pages, (and i'm sure they will soon) my aspx pages never seemed to do that well, I know this will change things.
Regular Googlebot has come a couple times since our change, but we haven't been so lucky, none of the new urls have been picked up yet, only thing I see different, is they picked up some new text on our index page.
Catch all old URLs that are not new URLs:
RewriteCond %{REQUEST_URI} !^/sitearea(.*)$
RewriteRule ^/cgi-bin/script.ext(.*)$ /sitearea$2 [R=301,L]
Now catch all new addresses and bring them back to the original ones internally:
RewriteRule ^/sitearea(.*)$ /cgi-bin/script.ext$1 [L]
so all existing google links turn smoothly into the new addresses, and Googlebot is already busy following the 301s too.
BTW: what exactly does google do with a 301? consider it a link to the target page? index target page but don't consider link?
SN