Forum Moderators: open
It takes more PageRank for a large site to be indexed fully, so using clean URLs (eg. no "?" characters") is important. Often, it's useful to 'promote' the most important bottom level pages up a level. This helps Googlebot (and human users) find them more easily. Sometimes, a 'specials' or 'most popular' link on the home page is appropriate.
* I'm not clear on whether Googlebot is programmed to follow PageRank, or whether Googlebot's crawling and PageRank's 'random surfer' model just happen to coincide as one would expect.
New ecommerce site launched 2 months ago has 300 static html pages and 3,000 template driven pages.
ie - 3,300 pages to crawl - link structure was ok I thought with each of the 3,000 dynamic pages being well-linked to.
However only the main 300 pages have been crawled.
Looks like I should get some PR...
Any other way to encourage Gbot via on-page linking structures?
J
Very important however is to organize the site map in a manner readily usable for the user, that it makes sense. If it makes sense to the user then it'll make sense to the bot.
I enjoy making site maps.
Thing is though, theres so many pages on the site that a static list of 200 urls is only 2 clicks from the home page and I can't get these 200 urls crawled.
I agree that site maps are vital (& easy to do) but I just don't think it will help here because the link structure is exactly the same as the most perfect sitemap I could wish for.
Am a bit lost :(
J
"?" characters are not mandatory in the URLs of any World Wide Web context I'm aware of. I would hassle the server vendor, hosting company, programmer, CMS creater or whoever is responsible to get it changed. Currently, I have two programmers working on applications that normally have "?" characters, making them run smoothly without the "?" regardless of cookies and Javascript support in the browser. A little costly, but worthwhile.
If you don't have the PR to overcome the slight "?" barrier to getting crawled; then IMO you need more PageRank.
I am using this to let most crawlers see my internal pages.
If you are on apache use .htaccess
if you are on IIS then move to apache ;). There is a tool for IIS but it is not as valuable as mod_rewrite
Then add some listing pages to those template driven pages and add the URL of the listing pages to your site map.
So you have this structure
sitemap
listing pages (with navigation and html url if possible)
emulated template driven pages into simple html address
Luck!
takagi is right about just getting more links; the PR should flow around the site (assuming that there's enough of it to flow). DaveN's suggestion will do more than give you PR, those links coming straight into the most valuable deep pages are worth more than the PR benefit the deep pages would get via the home page.