Forum Moderators: open
My sites got some of its best pages in a folder 3 sub dir's down but they haven't been indexed, and google's been really great about crawling the whole site. And it s a biggy too!
It hasn't crawled anything from domain.com/sub/sub/thisparticularfolder.
Its crawled other pages in other folders that were 3 tiers down, and we were stumped royally as to why it hadn't crawled them. Especially when there were links to pages in the uncrawled folder on the home page & other key site pages. (PR 6's & 7's)
The only thing we could find that was different about this uncrawled sub dir, was that it didn't have an index page.
Could this be the problem? Is Google crawling hierarchically and not crawling folders unless they can start with the index?
Your thoughts are warmly appreciated
The thing that bugs me is that its crawled other pages that are 3 tiers down in other folders, which seem to have very simialr link relationships with the major pages on the site.
Interestingly enough it seems to be doing a daily crawl of everything in fisrt and second sub dir's, and thats enormous.
Like I said...stumped...royally!
My site is all dynamically generated. There was a strong relation between the PR of the site and the number of pages shown in the google-index.
chronix over the past months:
PR4-5 -> around 4400 pages
PR2-3 -> around 2200 pages
PR5-6 -> around 6600 pages
Th interesting thing is, I got a link from a PR7-page mid-month, and a couple days later the number of pages in the index went up and stayed there until the actual google-index-update, when I got the new higher PR myself.
The concrete numbers may well have to do with the structure/link-depth of our pages, so that differently organized pages get different numbers, but I think the general idea that the PR relates to the indexed-pages is definetely applied IMHO.