Pages still in with PR but no Description or Title! Part 2

Forum Moderators: open

Message Too Old, No Replies

Pages still in with PR but no Description or Title! Part 2

GodLikeLotus

2:05 pm on Jan 27, 2004 (gmt 0)

This does not seem to fixing itself. Our sites index page still has its PR5 and the backlinks were updated with the others on Sunday/Monday. What are we to do? I have been waiting since mid December for the googlebot to return, still nothing. We have even acquired a few new links in from quality sites but still Google only shows the URL for about 100 of the 320 pages we have. Google is still showing a URL for https our domain when there has never been one.

I have emailed Google's help email for re-inclusion and although the reply is a standard reply letter, it does state we do not have a penalty on our site.

bluenile

4:42 pm on Jan 27, 2004 (gmt 0)

Quote from ciml at [webmasterworld.com...] #2

I wouldn't expect Googlebot to index very deep if the highest PageRank on the site is 4. On the other hand, PR4 would normally be enough to get 200 pages indexed with static-looking URLs.
If the URLs are quite complex (eg. /index.pl?1=foo&2=bar&3=yin&4=yang ) then Google isn't likely to crawl so deep. If the URLs have a CGI parameter called id then I don't think they'll be followed.

GodLikeLotus

4:45 pm on Jan 27, 2004 (gmt 0)

bluenile, thanks for showing an interest but that nothing to do with this problem, is it Google?

bluenile

4:53 pm on Jan 27, 2004 (gmt 0)

GodLikeLotus - Just to refresh everyone

1) What is the PR of your site?
2) How many backlinks do you have?
3) Any knowledge of having duplicate content?
4) What is the PR of pages that link to you?
5) Any back links from a penalised site?
6) Somebody dropping links to blogs - forums?
7) Java on site that may be putting Googlebot off?
8) Previous problems with the web Host or IP?

rogerd

4:53 pm on Jan 27, 2004 (gmt 0)

GLL, what do your URLs look like, i.e., static or dynamic with long query strings? Are any pages at all getting spidered and indexed properly?

bluenile

5:06 pm on Jan 27, 2004 (gmt 0)

GLL
Just saw a PR7 site that has 7 pages with hardly any content. Most of all the back 717 links come from a single site . All these 717 pages are dynamic. Meaning its just one page widget.php?idcountry=###, where the ### number keeps changing but googlebot thinks these are all different pages and the site seems to be huge with lots of content. Needless to say the site is #1 for all its keywords and related phrases including stemming.

sticky me and i will send the URL.

GodLikeLotus

5:13 pm on Jan 27, 2004 (gmt 0)

1) What is the PR of your site?
Homepage PR 5
2) How many backlinks do you have?
According to ATW 680, Google 42
3) Any knowledge of having duplicate content?
No duplicate content
4) What is the PR of pages that link to you?
PR0's - PR7, mainly PR4's and PR5's
5) Any back links from a penalised site?
No Idea
6) Somebody dropping links to blogs - forums?
No that I am aware of
7) Java on site that may be putting Googlebot off?
NO
8) Previous problems with the web Host or IP?
Yes, in late November hosting interuptions, now sorted and no problems since then.

All URL's static

Many sites seem to have this problem, the previous thread that was edited is at: [webmasterworld.com...]

rogerd

5:21 pm on Jan 27, 2004 (gmt 0)

Pretty strange, GLL. Your answers indicate no obvious barriers to spidering/indexing. Perhaps you could add a few new external links to interior pages.

One thing you might try doing is spidering the site yourself with Xenu, and doing spider simulator, robots.txt, & server header checks using the tools linked from here. Perhaps they will reveal some clue.

Check out your robots.txt manually, too, to be sure there's no weirdness there. One condition that results in URLs only being indexed (no titles & descriptions) is when a page is linked to but is put off limits by robots.txt. Google indexes the URL, but doesn't spider the page and hence has no more info.

jdurkin

12:10 pm on Jan 28, 2004 (gmt 0)

Every website from my server has gone from google since december (36 in total)
The url is there but no title/description (no cache)
Emailed google got the usual pre-written reply.
Emailed them back and done a lot of shouting, this was there last reply:

<edit: Sorry, no email quotes please.>

Well now im lost what to do and now facing bankrupcy after five years on the net.

Hope google get what they deserve for tring to be "GOD" who do they think they are "GATES"
It was people and small companies like my self that got them where they are today, with us there would be no Google

[edited by: ciml at 12:18 pm (utc) on Jan. 28, 2004]

ciml

12:40 pm on Jan 28, 2004 (gmt 0)

GodLikeLotus, do you see Googlebot in your logs, requesting pages on your site? If so, what HTTP status does it get?

Previously, you were returning 403 for /robots.txt. Have you seen Googlebot requesting /robots.txt since it started returning 404 (and more recently 200)?

jdurkin, do you see Googlebot in your logs? I don't know if there's a good test for Google crawler connectivity to a server.

Perhaps someone has experience of this from the Godaddy <-> Google problem a while ago. Maybe Google translate or WAP gateway is a good test to see if the robot can reach a URL?

jdurkin

8:33 pm on Jan 28, 2004 (gmt 0)

Hi ciml
Thanks for the interest.
No googlebot this month?

Tested me server for crawling -OK

I'm nacked chaps if something does not happen soon, just started paying server fees and a lot more bills out my own bank account, layed off two of my staff and google doesn't give a <snip>

No more sob stories from me chaps.
The worst thing of it all is pure frustration.

[edited by: ciml at 8:46 pm (utc) on Jan. 28, 2004]

GodLikeLotus

12:09 pm on Feb 5, 2004 (gmt 0)

Googlebot we Love you. Over 1000 hits since Feb 1.

I took the site away from our crap hosting company last week and all of sudden Googlebot is everywhere and our pages are returning to the index.

Conclusion is that our particular problem was being caused by our host. I am thinking they must have blocked Googlebot but never bothered to tell us.

The lesson I have learned here is how important Hosting is.

Thanks to everyone at webmasterworld who took an interest in my nightmare.

Sorry Googleguy for some of my posts over the last couple of months, its was nothing personal, just pure frustration.

power_iq

9:03 am on Feb 6, 2004 (gmt 0)

Wow, many of my sites are now showing title and description again!

But no change in serps :-/

Best regards, Tom

BallochBD

4:01 pm on Feb 6, 2004 (gmt 0)

My site was dropped although all the pages are still there (about 80) with no titles. I know this is a long thread. I have read most of it but I am still confused. Can anyone quickly summarise it for me? I mean do I just wait until Google catches up or what?

GodLikeLotus

12:35 pm on Feb 8, 2004 (gmt 0)

Guess I spoke too soon.

Pages starting to disappear all over again.

No index page, URL only, after showing fresh date almost every day last week.

New pages that where added over the last week now only showing URL.

Googlebot hits in the last week 1400+ with over 200 last night.

I am lost.

johnlim

12:41 pm on Feb 8, 2004 (gmt 0)

Does anybody has experience that many pages dropped by google then the pages are re-indexed by google? if several can speak out their practice then we can figure out.

GodLikeLotus

7:27 pm on Feb 8, 2004 (gmt 0)

Moved site to a new server and googlebot very busy taking pages all of last week. Today homepage only shows URL. Is it possible that Googlebot has gone back to the old server and been given the "blocked crawler thing" that my last host had imposed? Could this be the case? Does anyone know what is going on? Google seems to be lost also, they don't know when the same URL is on every page of the SERPS, so webmasters here help them by sending in examples, in return webmaster get what back from Google? Nothing, does not sound like a good trade to me.

We help you, you just ignore us.

wanna_learn

7:53 pm on Feb 8, 2004 (gmt 0)

One of my Slow Death site,
I can see about 107 pages reindexed without any crawl.
All pages bear a tag of supplemental result.

Goglebot had been hitting the IP which I stopped using almost an year back. After I left the old IP... I had several deep Crawl on new IP.

I wonder what made Googlebot go to old IP again.

Whatever,
several sites are screwed like this without any clue and Google is speechless on that.

[edited by: ciml at 1:00 pm (utc) on Feb. 9, 2004]
[edit reason] Please see StickyMail. [/edit]

power_iq

8:11 pm on Feb 8, 2004 (gmt 0)

>>I can see about 107 pages reindexed without any crawl.
All pages bear a tag of supplemental result

The same here, no crawl, but many sites re-indexed.
I did nothing for it. It happened simply!

Best regards, Tom

GodLikeLotus

8:12 pm on Feb 8, 2004 (gmt 0)

wanna_learn

So this is exactly what happened to you, change IP google spiders it and then for some unknown reason starts going back to the old IP. Surely this is a problem.

Google maybe you should add a little more weight to the actual domain and not the IP.

Having pages in with "supplemental result" is worth what? They don't show in searches and as in my case where several months old.

GodLikeLotus

8:18 pm on Feb 8, 2004 (gmt 0)

lost it

Becky

9:16 pm on Feb 8, 2004 (gmt 0)

GodLikeLotus,

With all due respects, GG has been actively involved in the discussions about the multiple URL listings in the serps. He seemed very surprised that this was occurring.

He has already flagged the appropriate people at g and they have agreed that a problem occurred but will take a few days for them to be removed from the current serps.

steveb

9:19 pm on Feb 8, 2004 (gmt 0)

If pages just reappeared in serps, and then disappeared, this would be normal freshbot activity for new pages. That may not be what is happening, but that is what I would *expect* to see happen in this situation.

Becky

9:19 pm on Feb 8, 2004 (gmt 0)

here is the link where gg acknowledges the problem and comments on steps being taken to correct:)

[webmasterworld.com...]

GodLikeLotus

9:25 pm on Feb 8, 2004 (gmt 0)

Becky

This thread is about "Pages still in with PR but no Description or Title!" GG has never commented on this issue which has affected many sites and is still an on going problem.

My reference to GG and the same URL appearing on each page of the SERPS was ment to highlight that it is unfair that GG asks webmasters for help and does often comment on problems faced by webmasters.

As far as the same URL appearing on many pages I personnaly could not care less, but for Google they cannot have this. Might make it seem that Google could be having problems.

Net_Wizard

9:45 pm on Feb 8, 2004 (gmt 0)

I was wondering about the over reliance on GG too...I thought he's never officially sunctioned to represent Google and the poor guy is just trying to help us webmaster on his free time ;)

*A chance to test my flame retardant armor ready and preparing for any fire to come* :D

Becky

9:50 pm on Feb 8, 2004 (gmt 0)

No, I just checked it again and the link I supplied was the thread discussing the muliple urls in the serps.

Here is the exact quote from GG...

"I talked to the person about identical urls showing up and they found the problem. It will take a binary push, but should be fixed pretty soon (I would guess early to middle of next week)."

GodLikeLotus

10:11 pm on Feb 8, 2004 (gmt 0)

Becky,

Try reading this thread from the start, its only 2 pages so should not take you long. We have all seen the thread about duplicate URL's in the SERPS and don't need a link to find it or need to be shown the quote he made. Again this thread is about "Pages still in with PR but no Description or Title!"

seodave

1:03 am on Feb 9, 2004 (gmt 0)

I have a site that I'm sure is banned (partially) from Google because of the links to it. It has a fair number of guestbook, memberlists and blog links (great site sort of comments) to it.

I've done some detailed research on this and if you search for the mt-blacklist you will find a lot of sites (including mine) on this blacklist (added for blog comment spamming) that have plenty of links to the site (as shown in allthewebs backlink check), but problems in Google. Either no backinks, no PR, no cache on indexed pages or some of the above while other things (like PR) is fine.

When my site went down it started mid december 2003 (I'd just got the traffic to 7000 unique visitors a day). Prior to the problems the home page peaked at PR7 for one month, but dropped to PR6, then PR5 backlinks also decreased in Google, but nothing to be overly worried about.

The site had and stil has over 6000 backlinks to the home page in Alltheweb. In Google it dropped to 2 backlinks and then to 0 over a couple of days. Traffic halved over a few weeks as pages were removed from the index.

At one point there was every page indexed (2500+), it has dropped to as low as 200, but now shows 600 but lots have no cache (about 180 have a cache).

The PR remained stable at PR5 until January when it dropped to Pr1 (still is PR1) which it clearly shouldn't be as I know there are PR5+ links to it.

I understand SEO (I offer SEO services) and so know what does and doesn't work with google. Other than the links to the site nothing else could cause this.

So has anyone else who has seen their sites PR/backlinks drop etc.. have a lot of spam type links to their sites?

David

Net_Wizard

2:18 am on Feb 9, 2004 (gmt 0)

I'm watching a very competitive keyword/s where the top10 are not in any of the ff;

1. allintitle
2. allintext
3. allinurl
4. allinanchor

The only the Top10 resemble or equal to is allineverywhere :D

Not one site of the Top10 showed up in any of the advance operator.

Interesting enough 'all' top10 are 'heavily' supported by spam links and anchors coming from spammed blogs, spammed forums, free web pages, all kinds of free account, rampant interlinkings, multiple throw-away domains.

You would think that this tricks have been filtered a long time ago. One spammy site might have slip the filter/s but the entire Top10? I didn't bother checking the rest of the serp.

Top10 is enough to convince me that the algo is way-off target.

BTW, I've been refreshing it from time to time hoping that this is just a local data center problem but it's sticky.

What can I say, spam links is the future of Google?
Reminds me of then defunct Snap Directory.