homepage Welcome to WebmasterWorld Guest from 54.161.246.212
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
Reasons why Google might ignore a site?
Mike521

10+ Year Member



 
Msg#: 20765 posted 5:35 pm on Dec 22, 2003 (gmt 0)

Hello all, I'm new to these forums

Just looking for information on why Google might decide to ignore a site for several months. I'll give some info --

Currently google only looks at the index page and goes NO further. index page has text links, image links, etc. It's PR is 5/10, and most internal pages have a PR of 4/10. However a saturation check comes up showing only the homepage listed in Google. Furthermore I've checked the logs, and googlebot has only ever requested robots.txt (there is none) and the homepage. There are no strange or robot meta tags, only keyword, description, and abstract, which are all probably ignored by google anyway.

I've emailed google and asked if the site was banned, and they replied that it was not banned, and to just wait for the googlebot to crawl the site.

Nevertheless, since roughly june of this year, the site has never been deeply crawled by google.

Other engines seem to crawl it ok, but not spectacular. Inktomi is the best so far, with 40 pages crawled. Alta Vista and Alltheweb only index the pages I submit, but go no further. Ask Jeeves (Teoma) seems to crawl all pages.

Any other info... the site is roughly half html and half cfm pages.. it has a site map in text, linked immediately in text from the index page..

anyway any information and suggestions would be greatly appreciated. This has been stumping me for a while now. Thanks in advance

 

icebane

10+ Year Member



 
Msg#: 20765 posted 10:11 am on Dec 23, 2003 (gmt 0)

I'm currently experiencing the same thing... however it has only been 3 weeks. Only the front page is indexed.
The site will definitely have a PR of 6 given the other sites I own that link to it.

SlyOldDog

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 20765 posted 10:16 am on Dec 23, 2003 (gmt 0)

You say internal pages are PR4 but then you say they are not listed in Google. Which is it to be?

Is your site dynamic? That can cause major headaches.

Netizen

10+ Year Member



 
Msg#: 20765 posted 10:37 am on Dec 23, 2003 (gmt 0)

ou say internal pages are PR4 but then you say they are not listed in Google. Which is it to be?

The toolbar estimates the PR of pages not in the index - typically PR of one less than the page.

As for the original problem - are the links of the front page static or dynamic? If they are dynamic do they have id=blah in the links, or do you use sessionids?

Essex_boy

WebmasterWorld Senior Member essex_boy us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 20765 posted 10:48 am on Dec 23, 2003 (gmt 0)

I had this problem with my site, submitt ALL your pages to google for indexing that was the only way I found.

SlyOldDog

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 20765 posted 10:55 am on Dec 23, 2003 (gmt 0)

>>The toolbar estimates the PR of pages not in the index - typically PR of one less than the page

I haven't seen this since 2002 except for dynamic sites.

superscript



 
Msg#: 20765 posted 11:02 am on Dec 23, 2003 (gmt 0)

Essex_boy

Submit all the pages - that's a blo*dy good idea - certainly worth a try. I'm having this index-only crawl problem at the moment. Index is PR 7, rest of site PR6. It's been up for two years and previously deep crawled, but nothing for months. I was having non-www, www duplicate probs., so I searched for and corrected all the non-www links on the web I could find, and also set up a proper 301. What's happening is the pages listed as non-www are gradually being dropped, but without a proper crawl they're not being reinstated as www. I'm now down to only 12 pages in the Google index. It's been one hell of an end to the year :(

p.s. all static, nothing fancy, standard links, intuitive navigation.

p.p.s.
The toolbar estimates the PR of pages not in the index - typically PR of one less than the page

Concur - missing pages still have an extimated PR along these lines.

superscript



 
Msg#: 20765 posted 11:50 am on Dec 23, 2003 (gmt 0)

I've just manually submitted a small selection of 'pivotal' sub-pages to try to get the bot to remember there's more to my site than the index page - I'll get back with the results - if any.

Mike521

10+ Year Member



 
Msg#: 20765 posted 1:22 pm on Dec 23, 2003 (gmt 0)

Thanks for all the suggestions, much appreciated :)

about the links, all links are static, just regular old html, nothing fancy at all

about the PR issue, yep I guess the internal pages are just an estimate, because they're not listed when I do a search for them in google, yet they all have a PR of 4 or so

perhaps I'll submit the site map and see if that gets me anywhere..

essex boy, how long was your site in that state for, before you decided to submit each page individually? what have your results been since then?

I had been under the impression that submitting a whole slew of pages under one domain could be viewed as spam?

superscript



 
Msg#: 20765 posted 1:38 pm on Dec 23, 2003 (gmt 0)

I had been under the impression that submitting a whole slew of pages under one domain could be viewed as spam?

Yeah, this worried me a bit. But it's likely to be an automated process, and while the guidance says it's only necessary to submit the index (the spider will do the rest ;) - there's nothing in the guidance that actually prohibits the submission of sub-pages / multiple pages as far as I can see. Anyway, to be on the safe side I just submitted a selection of half a dozen select pages that are well linked to the rest of the site. Added: Don't forget its a manual submission tool provided by Google - they frown on automated submission software, but it's highly improbable you could be penalised for using something they provide.

nileshkurhade

10+ Year Member



 
Msg#: 20765 posted 1:47 pm on Dec 23, 2003 (gmt 0)

Does the site have a sitemap.? Maybe this will also help.

SlyOldDog

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 20765 posted 1:53 pm on Dec 23, 2003 (gmt 0)

I don't really get it. How can the pagerank estimation still be working?

Unless someone is using a different toolbar from me, this gives PR0.

[yahoo.com...]

I see PR0 when I type any unindexed page on any existing URL, where it has a special page for 404s or not.

What you are seeing must be something else.

superscript



 
Msg#: 20765 posted 2:02 pm on Dec 23, 2003 (gmt 0)

Hi SlyOldDog, I just tried one of my pages which is definitely not in the index and it shows PR6. In the past I've noticed odd things about PR display, for example landing on one of my pages with something like /?src=fav appended would show zero PR, but navigating to that same page from within the site would suddenly show it up.

Does this page

[yahoo.com...]

actually exist? If not, surely this is an entirely different case.

SlyOldDog

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 20765 posted 2:13 pm on Dec 23, 2003 (gmt 0)

No, that Yahoo page doesn't exist. That's the point. It has not been crawled yet but still shows PR0 - not an estimation based on PR10 from the home page.

I've noticed that if you go to a page with a real variable name which has PR, like:

www.blah.com/index.php?id=25

if that page has pagerank, you can type in an imaginary value, and Google will still give a PR score. eg.

www.blah.com/index.php?id=anynumber

But if you type:

www.blah.com/index.php/imaginarypage

you will get PR0

superscript



 
Msg#: 20765 posted 2:24 pm on Dec 23, 2003 (gmt 0)

I thought we were discussing PR showing on pages that actually exist, within an indexed site, but that are not currently indexed in Google. It's of little surprise that a non-existent page has no PR.

Interesting though - if the PR of a page can be estimated from the general PR of a site, how does Google 'know' whether that page does actually exist? Unless there's some remanant of its prior existence in the DB somewhere.

SlyOldDog

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 20765 posted 2:28 pm on Dec 23, 2003 (gmt 0)

In the past, Google would show a grey bar (unindexed) for unindexed or unranked sites. PR0 was stritcly for indexed pages.

Netizen

10+ Year Member



 
Msg#: 20765 posted 2:32 pm on Dec 23, 2003 (gmt 0)

I just added a new page on my site and it shows zero PR. This is using toolbar 2.0.102-big/en (GGLD). Got to keep up :-)

superscript



 
Msg#: 20765 posted 2:36 pm on Dec 23, 2003 (gmt 0)

It's been my understanding that the distinction between grey/white toolbars hasn't been meaningful for months.

Essex_boy

WebmasterWorld Senior Member essex_boy us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 20765 posted 3:01 pm on Dec 23, 2003 (gmt 0)

Ok, my site is HTML nothing fancy the site was set up earlier this year around March and only the index page was listed although it was updated by Google it didnt go any deeper.

I submitted individual pages via googles own submit page - never use a commercial submitter, I did this on the 8th of December and by the 15th they were listed. Breath takingly quick.

Google does NOT punish the multiple submission of a single URL so I see no reason why you would be punished for submiting several URLS from the same site.

I had no problem. All pages rank well athough im yet to receive any page rank for them.

superscript



 
Msg#: 20765 posted 5:01 pm on Dec 23, 2003 (gmt 0)

Essex_Boy

Confirmed: Google states: "We do not penalize sites for 'over-submitting'"

Whilst at the same time hinting 'but don't bother with submitting' ;)

Essex_boy

WebmasterWorld Senior Member essex_boy us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 20765 posted 5:05 pm on Dec 23, 2003 (gmt 0)

well yes that was probably true pre MAY update since then Google hasnt been anything other than unpredictable. Has it?

Submit away and see what happens.

div01

10+ Year Member



 
Msg#: 20765 posted 5:56 pm on Dec 23, 2003 (gmt 0)

I got a couple of sites in the same predicament. I guess I'll be hand submitting all the pages this afternoon.

Essex_boy

WebmasterWorld Senior Member essex_boy us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 20765 posted 6:37 pm on Dec 23, 2003 (gmt 0)

Yeah its great fun really! Put a CD on and submit away!

superscript



 
Msg#: 20765 posted 6:51 pm on Dec 23, 2003 (gmt 0)

LOL! Pity you can't burn it to CD and send it to the Plex

Mike521

10+ Year Member



 
Msg#: 20765 posted 7:10 pm on Dec 23, 2003 (gmt 0)

LOL

thanks again guys, glad that I'm not the only one experiencing this problem, and glad to see that google was relatively fast for you with indexing your hand submitted pages

my site does have a site map, so I'll submit that, and a handful of other important ones, and see if the spider makes up its mind to dig a little further this time. If not, then I'll be hand submitting every page in the site lol

slydog, I have no idea why the pages inside the site are showing up with a PR but not in the index, but nevertheless, that is what is happening. The idea about it being an estimate sounds good to me. The pages are definitely not indexed though, google confirmed this when I emailed them and asked them if they had a problem with the site.

Mike521

10+ Year Member



 
Msg#: 20765 posted 5:18 pm on Jan 5, 2004 (gmt 0)

well, I tried submitting pages within the site and still they are being ignored by google. I sent them an email today to try to get to the bottom of this :/

DJFlite

10+ Year Member



 
Msg#: 20765 posted 5:45 pm on Jan 5, 2004 (gmt 0)

Every page on my site gets indexed almost every week, but as of two weeks ago I stopped being listed on google completely. Even if I type in www.mydomain.net it doesnt come up. Whats the deal with that?

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved