Forum Moderators: open

Message Too Old, No Replies

Will Google Index All 50,000+ Pages Of My New PR6 Site?

         

wchan07

12:29 am on Jun 11, 2004 (gmt 0)

10+ Year Member



Hi,

I just created a website and submitted it to Yahoo Directory. It currently has PR6 with 10 inbound links from Various Yahoo Directory Links.

All Links now point to the INDEX page of my site.

I want to eventually build 3 levels deep with about 10,000+ Pages

I would like my site to look like

index -> directory 1 (8 pages) -> directory 2 (26 pages A-Z alphabetical navigation) -> directory 3 (25-100 items under each letter)->lowest level (5 pages)

So with and average of 50 items in the last directory i'm looking at about 8x25x50x5 pages or 50,000+ pages

If I build all the pages will google crawl AND index them?

I plan to USE PHP but NO DYNAMIC content.

I've heard rumors that google will only take the first 500-1000 pages starting from the index page if you don't have a lot of inbound links going to different parts of you site.

Is that true? I've never built anything with more than a few hundred pages before, let alone more that 50,000 pages. Can I build the site and trust that Google will get all my pages in a reasonably quick amount of time (i'm hoping less than 6 months)

Thanks

doc_z

11:16 am on Jun 11, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



If I build all the pages will google crawl AND index them?

Yes (normally, PR6 should be high enough).

index -> directory 1 (8 pages) -> directory 2 (26 pages A-Z alphabetical navigation) -> directory 3 (25-100 items under each letter)->lowest level (5 pages)

I would add some deep links.

I've heard rumors that google will only take the first 500-1000 pages starting from the index page if you don't have a lot of inbound links going to different parts of you site.

That's incorrect. I've seen sites with 100.000 pages with just one PR7 incoming link.

Google will get all my pages in a reasonably quick amount of time (i'm hoping less than 6 months)

Normally, there shouldn't be a problem. I would gess that most of the pages are spidered within 2 months.

mars9820

11:21 am on Jun 11, 2004 (gmt 0)

10+ Year Member



probably most of your page will be crawled over time however no idea about time frame.

You don't go that many levels deep so it should be possible to get everything spidered.

But as you said you only have few links. Try to get some deeplinks when you building your site. That will really help you.

Also sitemaps will help however 50,000 pages...uhmmm...not easy to create sitemaps for them.

suggy

5:35 pm on Jun 11, 2004 (gmt 0)

10+ Year Member



This kind of question always staggers me. How the heck do you go about creating 50,000 pages of WORTHWHILE content? Yes, I understand that you have lots of products, etc. But in the final analysis unless it's riddled with duplicate content and a dozens of combinations of the same database fields, how can you get anywhere near this number?

From your maths, it appears that you have an average of 50 items per letter of the alphabet and you want 5 pages about each. That's 26 x 50 X 5 = 6500. Do you then have 8 separate areas of your business, to make 52,000 products?

I guess what I am asking is, do you really have 52,000 products or are you just trying to make a 'fat' site?

Nothing against you, but if you don't, then I personally don't think Google should bother. The algo needs to get better at discriminating between real content and 'fat'.

Just my opinion...

Suggy

ALbino

7:23 pm on Jun 11, 2004 (gmt 0)

10+ Year Member



I have over 50,000 pages with 50,000 items and genuinely unique content on each page describing each item. It's not that farfetched. Think of how many Amazon has.

wchan07

4:32 am on Jun 12, 2004 (gmt 0)

10+ Year Member



Oh,

I'm an avid music fan and have lots of free time. I am creating a lyrics database. It's mostly text based.

Thanks for your insight

ThomasB

9:42 am on Jun 12, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I would start thinking about getting a higher PR when you have more than 100k pages, but 50k shouldn't be a problem and imho they should be indexed within 6 weeks if the code is spider friendly.