homepage Welcome to WebmasterWorld Guest from 54.166.228.100
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
621 Pages Indexed But Only 190 Show
Are the other pages accessible to Google searchers?
Whoa




msg:41020
 7:36 pm on Mar 19, 2004 (gmt 0)


A site I work on has over 600 pages of content covering personal finance, real estate, and consumer issues.

When I do a search for "site:www.sitename.com wordthatisoneverypage" is says there are 621 pages, but when I start going through the SERPs, I can only get to about 190 of them.

Does that mean the other 431 pages are not accessible to Google searchers? Why aren't they all displayed?

Thanks for any insights and/or advice concerning this site. It's a really great resource and deserves to do better in the search engines.

p.s. one thing I had heard is that id= in .asp pages is not liked by Google? Why on earth would that be?

Thanks!

 

mcavic




msg:41021
 9:42 pm on Mar 21, 2004 (gmt 0)

It means that the remaining pages are so similar to the ones already displayed that Google doesn't think you need to see the rest.

If you search for a word that's only on some pages, you should see those pages. Also, you can add &filter=0 to the end of the Google URL to see all of them.

id= in .asp pages is not liked by Google? Why on earth would that be?

id= is disliked in all URLs because it looks like a Session ID. Session IDs are bad because they can cause crawlers to see tons of duplicate content on your site.

abates




msg:41022
 3:04 am on Mar 22, 2004 (gmt 0)

I discovered a while ago that some pages on my site weren't showing up for a site: search.

"site:sub.domain.com" showed 190 of 243 pages
"<wordoneverypage> site:sub.domain.com" showed about the same amount, but a different subset of the pages!

And I could search for the missing pages seperately as well, and they came up.

Robert123




msg:41023
 3:11 am on Mar 22, 2004 (gmt 0)

I noticed the same phonomenon this week with my site--or since i checked since the most recent update.

Whoa




msg:41024
 12:08 pm on Mar 22, 2004 (gmt 0)

With &filter=0 I still only see the 190 pages. That actually doesn't display ALL of the results.

Also, the "missing" pages are completely unique - every article is entirely different from the other articles on the site. So it seems unlikely that Google would deem them as being too similar to other pages to be worth showing.

I've put the site in my profile if that helps.

Thanks also for the information on ID= -- I wouldn't think that would stump Google but I will probably change it -- better safe than sorry.

My own theory is just that the pages are too fat, and that if we thin them down, maybe Google will show more than 190.

Whoa




msg:41025
 10:52 pm on Mar 22, 2004 (gmt 0)


Oh well, I take it nobody really understands why this happens - why only 191 of the 621 results can be accessed.

If you find out, please let us know.

Robert123




msg:41026
 6:08 am on Mar 24, 2004 (gmt 0)

I wanted to bump this back up. Any idea of why pages show in the index but are not counted in an allurl: search?

AzCowboy




msg:41027
 6:08 pm on Mar 24, 2004 (gmt 0)

This may be a symptomatic result surrounding a larger issue affecting many sites. Please review thread
[webmasterworld.com...]

jazzx




msg:41028
 7:18 pm on Mar 24, 2004 (gmt 0)

whoa

hi,
the javascript menu is a bit disturbing to me as long as we don't know that it is really indexed.
split the sitemap in several parts

Whoa




msg:41029
 11:15 pm on Mar 28, 2004 (gmt 0)


Good advice. I will split the site map up. Been meaning to do it for a while because it's over the 101k limit.

We are also going to try to shrink down the page size and clean up the files - pull the menus into a .js file.

Thanks to all who looked at this.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved