Forum Moderators: Robert Charlton & goodroi
It is still getting spidered heavily by mozilla googlebot. It still gets traffic from google every day just not hardley as much, maybe 10% of before.
It doesn't seem to be banned because the pages it has left still rank the same as before?
I'm stumped? This is new to me.
Was over 200K in the index, good traffic
Now down to under 300 pages, being spidered at a rate of 6000+/day for weeks but nothing appearing.
My pages each have unique content, generated pages by php but similar in that they have the same templates (6 different templates) and format.
There are some pages which I could understand if G decided were not unique enough, yet they are still there! - can't understand it.
My pages were mostly created in the last few months - is that true of anyone else who has this problem?
Sent google an email today - see what happens.
Morever, the traffic has gone down by almost 70% this really sucks.. The problem has hit in last 2 weeks reducing/deteriorating day by day.
Whats wrong exactly here?
Don't know if anyone saw this overnight, but one of my "barometer" sites (www.cnn.com) lost 1.1 million indexed pages ... However, as of 15 minutes ago, they have gotten 500,000 of them back ... WOW!
Maybe google is dumping pages that get very few clicks
Not so sure of that as of the 130 or so pages still indexed of the 1300 for my site some only get 5-10 page views a day and most of those dont come from G.
the pages that are left are current and top 1-2 tier of the site
Definately not the case at my end, the pages are randon, 26 directory folders with 50 pages in each, it's like a lucky dip.
Today I got an increase in my indexed pages, another 40, they are supplemental though.
I personally dont believe there is anything we can do. I submitted a thread on a simillar subject and in that I mentioned that on checking the errors in the sitemaps pages I seen 7 very old directories from 2002 on the errors list. Very strange as they have long gone. Like three years gone.
I think it's google fixing itself. One of my forums lost 50k pages, however I notice the pages that are left are current and top 1-2 tier of the site. SO I think it's reindexing things from the top down.
all my main pages are shown by a site: search, too.
the number increased since yesterday by 5%. hopefully, they go up.
Obviously, Google is testing some new stuff, but I hope the Eastern is the dead line for this flop test :)Sunday's weekly indexing will show the final results and what will happen further. Hopefully, nothing bad for us, because there's no actual reason for it!
The site is an ecommerce shop and all of the product details have been dropped. We still have categories indexed. We have taken great lengths to make sure content is unique and also stopped google indexing print pages etc. so that we are left with one url one page of content on the site.
Its a bit of an arse at the moment as we had only just moved our site to a dedicated server based on increased traffic.
It seems that anything 3 levels deep has been stripped.
we have a google site map too and it doesn't report any problems spidering the site. If you would like a can email the url?
(won't be around for the next couple of days because of Easter so apologies in advance if you don't get any replies for a while.)
We can hope they follow the christian holidays here. Good Friday sees the death of the old index and easter shows a nice resurrection.
One thing is for sure ... I can't wait to see what comes out of the Sunday re-indexing myself ...
BTW ... Does anyone remember what was going on last year, at about this same time? Sure seems quite similar to me ... I'm one of those "hoarders", who saves every log archive from every site that I have ... I looked over some of the archives from last year, and the similarities are quite recognizable ...
a) Cache summer 2005
b) Supplmental
c) canonical issues (non www. variety, etc)
I sure as heck this gets fix but I am glad to see this isn't an isolated issue. I was freaking out and immediately came to WebmasterWorld for comfort and solace.
most of the web pages on both the websites has been dropped from the google index, don't know the reason.
Google search results shows me the results which is one month old, yes , the new updated web page cache has reversed back to the old one.
You can find hijackers by looking at your inurl:yoursite.com command in Google and checking every listing manually to see if it redirects to your site and if so run a server header check on the link to see if it's a 302 redirect). If people are framing your site put a pop out of frames script on every page.
To find content thiefs you need to search for content from your pages in Google in quotes (1 sentence at a time).
My site should have about 130,000 now seems to have settled at around 250.
I think the problem is duplication, a few of my 3rd and 4th level pages are still in there and holding well (fresh cache), and I can now see that they all seem to have a little extra info in there - sometimes just 5 extra words than all the others.
I guess they are trying to cut down on the overall index size by attempting to kick out obvious trash, but are obviously not detecting what is trash very well.
IF a site sells widgets and uses urls such as widgets.com/usa/sc/greenville - the difference in the pages in this section will only be maybe a small description, a photo and a price - but surely that does not make it junk and not worth including?
I am getting closer to the "0"
Hi I have 6 sites, of a site with 300 pages only 68 indexed, a site with 600+ pages only 4 indexed, a site with 1500 pages only 14 indexed, etc. etc. googlebot still crawling all my sites, and all my pages daily, (what for?)
I can see the end for me very near, good luck to all with Google results
We lost our #1 ranking for our search phrase on MSN at the same time, now we don't even show up at all.
The traffic has plummeted 80% since the 12th April.
It's a bad way to treat the folks who make your business. Without websites Google would not exist.
The sites are photography and information based with unique content that I wrote myself.
Martin.