Forum Moderators: open
The site home page has a PR4. It was a PR5 before Dominic/Esmerelda so I wanted to add the pages to boost PR,traffic and sales. There are thousands of new pages. Each is indexed by an index page with 100 links. The master index is one level below the home page. The master index was already in Google. The new pages have not appeared in the index. I've been waiting since late July.
Here's a diagram:Home Page (PR4)-
¦
New Pages Master Index (PR3)-
¦
Index Page X (PR0)-
¦
Any page from any index Page (PR0)
In theory,the 5000 pages at the lowest level should have a PR1. They all link to their index page and the Master index. The first 1,000 pages are in Google's index. 4,000 are not. The first 1,000 pages have been in place for over a year. I added 4,000 from June-August.
My webhost and I spent weeks figuring out the right way to exclude Googlebot from the virtual urls like #1 and to do a permanent redirect to the domain urls #2. Google should have deleted all the #1 type urls and redirected to the #2 urls. This should have forced an add for each virtual type url that was in the index. Once in,all the urls should have been crawled and indexed.
These are all static html files. I've only just started looking into php sites.
so I wanted to add the pages to boost PR
Arnett, from what I understand, you need to get links to increase PR. Increasing content just gets you more real estate in the search engines.
There are many single page sites out there that have decent PR, and it comes from backlinks. (www.pr10.com, for example)
As far as your main problem goes - has GoogleBot spidered all the new pages? And if so, when?
My logs don't show the pages that Googlebot spiders. The latest logs show Googlebot with over 10,000 hits to the domain. They may finally be getting around to adding the new pages.
I put an SSI date call in the footer of pages to check the cache image in Google to gauge a crawl date. As I recall from the Pre-Dominic days,if a page has a PR less than 4 it gets visited every 90 days. If it has a PR4 or higher it gets visited more often. This may be the cause of the delay.
Well, in your original post you said GoogleBot had been all over your site
"All over" means that the Googlebot has spidered files in every directory in the site. My logs don't report who accessed the file,just that it has been accessed. Since June I have added 4,000 static pages to the site. I have been watching the record of Googlebot accesses and they have increased monthly right along with the number of new pages added. Even though the number of Googlebot accesses has grown monthly with the number of new pages added NONE of the new pages have been added to the index.
...but when it will appear in the index is anyone's guess lately.
At least you're addressing the actual issue. Thanks for your valuable insight.
I put an SSI date call in the footer of my pages. The cache shows dates from late August to early this month. In the past I could tell the spider date by this even though the page didn't show up in the index until the "dance". The pages are getting spidered now and will probably be done entering the index in the next few weeks. Now all I have to wait for is for Google to get around calcuating all the backlinks and to update PR.
Thanks to all of you for your helpful comments.
I thought about it some and did a search for "site:www.domain.com domain". The search results header said that there were 3350 pages found. I did another search for "site:domain.com domain". This time the search results header said that there were 7850 pages found. Only 1000 are listed so I checked them out. There were some www. results included.I put an SSI date call in the footer of my pages. The cache shows dates from late August to early this month. In the past I could tell the spider date by this even though the page didn't show up in the index until the "dance". The pages are getting spidered now and will probably be done entering the index in the next few weeks. Now all I have to wait for is for Google to get around calcuating all the backlinks and to update PR.
Thanks to all of you for your helpful comments.