homepage Welcome to WebmasterWorld Guest from 23.20.28.193
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Some new pages not being indexed
gsmith




msg:3894884
 8:52 pm on Apr 17, 2009 (gmt 0)

On my site I cover a number of businesses, each having its own website. I had a writer produce my own original text describing each business, consisting of one paragraph containing around 120 words. GWT reports that there are no duplicate content issues. Nevertheless, many of these pages have not been accepted into the index, while some others have.

The material for these one-paragraph descriptions was derived mainly from information available on the businesses own website, so the word "rehash" might be appropriate.

Could this explain why many of these pages have not been accepted into the index?

 

tedster




msg:3894985
 11:29 pm on Apr 17, 2009 (gmt 0)

Just to clarify, are you saying that these missing pages are not showing up in the site: operator results?

Additionally, has googlebot even spidered these new pages? There's a big differenece begtween spiderd but not inlcuded, and never spidered.

gsmith




msg:3895002
 12:22 am on Apr 18, 2009 (gmt 0)

Just to clarify, are you saying that these missing pages are not showing up in the site: operator results?

Additionally, has googlebot even spidered these new pages? There's a big differenece begtween spiderd but not inlcuded, and never spidered.

They're not showing up using the site: operator.

I've submitted a Sitemap file, which has been accepted without error. and googlebot has been through my site.

On my GWT Sitemaps page it displays

Total URLs: 316
Indexed URLs: 63

Interestingly, the site: operator shows 46 - 50 pages, but I know that this issue has been discussed elsewhere.

Thanks for your interest.

tedster




msg:3895003
 12:27 am on Apr 18, 2009 (gmt 0)

How new is the site, and how recently were these new pages added? Have you got some backlink love from around the web?

gsmith




msg:3895011
 1:12 am on Apr 18, 2009 (gmt 0)

The site is brand new, and I do have some backlinks, but only to the home page. The home page shows up on page 6 for the most relevant keyword.

I first submitted the sitemap two and a half weeks ago, and have been resubmitting the sitemap as I get new input from the writer since then. While attempting to answer your question I realized that the only business-specific pages that have been indexed are those that Google knew about from before I started submitting the sitemap, but certainly not all of them. The order that they're being added is in accordance with the order that internal links to them were added; so it looks like it's a simple matter of patiently waiting.

gsmith




msg:3895020
 1:29 am on Apr 18, 2009 (gmt 0)

PS: I added a sitemap file because almost all of those business-specific pages are accessible to the user only via some Web 2.0 stuff. There are no links to most of those pages, either internal or from outside.

Does this mean that Google might decide not to index them? Googlebot has been through all of the pages in my sitemap file, so it definitely knows that they're out there.

tedster




msg:3895059
 3:20 am on Apr 18, 2009 (gmt 0)

Is this some kind of Ajax scripting where the content only gets served to the visitor after an on-page action makes a new server request?

Whatever the technology is, I'd suggest running Xenu Link Sleuth on the site and seing if it can find your new pages.

wizboy




msg:3916865
 12:41 am on May 20, 2009 (gmt 0)

I can't believe the spiders can read and differentiate a re-written paragraph of content. That seems very difficult to do with a computer.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved