Forum Moderators: open
A few weeks ago, we did a mod_rewrite, and have our new links off our index page.
How is Google and other spiders finding and crawling these pages? From search engine results?
We want our new pages crawled. (new pages are off the index page, and on our site map)
How is Google and other spiders finding and crawling these pages?
While you may no longer have links to those pages, other sites may.
I've noticed some pages which were removed years ago getting crawled. This is the only reason I can come up with. Someone is still linking to them.
If the pages which contain the link(s) are a below a PR4, it may be hard to located them and ask them to change URL's.
Doing a search via link: to any of the ones their spidering, produces no results
I guess I won't worry about it like mcavic
I will just look at it as doorway pages :)
I will just look at it as doorway pages :)
Yes, absolutely. If the page returns either a 404 with a custom message, or a 301 redirect, then it acts like a doorway, but it's legitimate and can't be penalized.
On my site, when I changed all my links, I used a 301 redirect to get the spiders to update the urls, then when the spiders are done, I'll use a 404 with a link to get people to update their bookmarks.