Welcome to WebmasterWorld Guest from 54.197.171.28

Message Too Old, No Replies

Has the removal of the 5 billion page site hurt you?

or am I the only one?

   
3:10 pm on Jun 22, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



My primary site is a pretty large site (> 250,000 pages of user generated content) and has done well in Google through Big Daddy and the subsequent tweaking until this past week. Honestly I'm not sure I had ever even seen a supplimental listing until this week. We've not paid a lot of attention to SEO since whatever we have been doing has seemed to work well.

Until Sunday June 18th at around 12:00 PM CST.

Sunday afternoon I noticed a sharp dip in traffic and when I investigated, all I could find was the post from Adam about the 'bad data push' and how they were working on fixing the problem. Is there some way that my site got trapped in this fix?

Something that I should add is that I use GoogleAlerts to notify me of when my site name appears in a new listing on Google, and for weeks I have been getting almost daily notices from GoogleAlerts showing the "5 billion page site" with content scraped from my pages. I don't know that any of these pages actually linked to my site, I never thought to look.

Now traffic is down 40% from the norm and I've also lost about 2 thirds of the pages that were shown by doing a "site:domain.com" command. I wonder if by removing this site caused the backlinks to my site to be recalculated, though currently a 'link:domain.com' shows the same number of backlinks as it has for the past month or so.

Just to be clear, I have no connection with the '5 billion page site' other than they scraped a lot of my content.

Any ideas or suggestions or am I alone on this?

12:09 am on Jun 23, 2006 (gmt 0)

10+ Year Member



Google seems to have dropped a number of my pages sometime within the last week or so. The largest of my subdomains also has a great deal of pages listed as supplimental now. It could be related? I don't know anything about the scraper site you mention though, so I don't know if it was copying my content...
1:05 am on Jun 23, 2006 (gmt 0)

WebmasterWorld Senior Member billys is a WebmasterWorld Top Contributor of All Time 10+ Year Member



>>GoogleAlerts showing the "5 billion page site" with content scraped from my pages.

I got a lot of those too. What a mess. All their brain power is distracted by beta launches. No one pays attention to search anymore and it shows.

4:26 pm on Jun 23, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I don't know anything about the scraper site you mention though

It's funny how this was such a big issue that it required immediate attention at the plex but it has hardly been discussed here at WW, unless I missed it or it just didn't effect that many people.

4:40 pm on Jun 23, 2006 (gmt 0)

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member



All their brain power is distracted by beta launches.

It's a big company -- different brains work on different areas and they've been hiring a lot.

5:36 pm on Jun 23, 2006 (gmt 0)

WebmasterWorld Senior Member netmeg is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



On that same Sunday (June 18), some of my client sites went Suddenly Supplemental on account of the page titles going corrupt - somehow text from the pages was appended to the titles, and the meta description tags were ignored entirely. Most of the pages are still supplemental, today, on Friday. I don't know if this is the bad data push or what, but nothing seems to be happening as far as fixing it. It's not *all* of our sites - maybe 15 out of 200. I haven't been able to pinpoint why some sites were affected and others weren't.

Still more sites have completely lost their page titles and are only using titles from DMOZ.

It hadn't occurred to me that any of this might be related to the five billion page site, but it's an interesting thought.

I just want to get my page titles back and my descriptions read, though.

6:05 pm on Jun 23, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I had reported some other sites that were doing the 5 billion page thing using text from my site (among others) a long while back. This was pre-BD, and after almost 6 months, G removed them from the index. Some of them still exist, and still keep popping up once in a while.
6:24 pm on Jun 23, 2006 (gmt 0)

5+ Year Member



how can i setup googlealerts to warn me if my site makes it's presense known?
10:33 pm on Jun 23, 2006 (gmt 0)

10+ Year Member



>There was a very interesting thread about that >site and other spamming activities but it >suddenly vanished from this forum.

There were too many specifics in that thread and the thread was removed as a violation of TOS.

11:04 pm on Jun 23, 2006 (gmt 0)

5+ Year Member



dataguy -

you are definitely not alone in this. one of our sites has close to 30 million pages in the index. we have witnessed about a 40% decrease since the same day you mentioned. our site: is also showing weird data...the numbers returned are all over the place. however, the number of backlinks has remained the same.

4:19 am on Jun 24, 2006 (gmt 0)

10+ Year Member



i had a similar problem
the pages dropped from
2 million to half million now
traffic is lower by 30 - 35 %
4:25 am on Jun 24, 2006 (gmt 0)

10+ Year Member



< somehow text from the pages was appended to the titles

Yes! I noticed that too... on many of the pages that went supplemental, the titles displayed by Google were all screwed up.

4:29 am on Jun 24, 2006 (gmt 0)

10+ Year Member



And even stranger, the page text that is appended to the title tags on the supplemental pages, has not been on the site for about 3 months.
12:41 pm on Jun 24, 2006 (gmt 0)

5+ Year Member



To implement the Google ALerts:

1) Go to Google'sHome Pahe
2) On Top of the search box, you'lll see Google services:

Web Images Groups News Froogle Maps more

Click on "MORE"

3) The next page that pops up, click on Alerts

4) In the search entry box, enter your site name example.com

5) In the pulddown menu select NEWS & WEB.

6) Finalize by clicking on the Create ALert Button.

Then Google will alert you every day of your site name appearing on other sites, or if it is the subject of a news article.

3:23 am on Jun 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



My indexed pages have bottomed out and started climbing back up in the early Saturday morning... anyone else having the same experience?
9:18 am on Jun 25, 2006 (gmt 0)

10+ Year Member



@ Dataguy

I can fully backup that. Big change here. Do you think there is a correlation between our increased saturation and the removal of the 5 billion page(s)?

Regards

itloc

11:12 am on Jun 25, 2006 (gmt 0)

5+ Year Member



"one of our sites has close to 30 million pages in the index."
Good for Google if start cutting thoce mega usless sites ,and BTW may I ask you what was your subject that needed 30 million pages? another wiki? ...don't tell me that you sale widgets because not even TESCO SPAR or ASDA has 30 million items to sale ,....conclusion probably your 30 mils was another "you search for blue widgets in galapagos" scraper site .Keep the good job Google cut them B$%^&s
11:30 am on Jun 25, 2006 (gmt 0)

10+ Year Member



>>>My indexed pages have bottomed out and started climbing back up in the early Saturday morning... anyone else having the same experience?<<<

Notice this also... Pages started to decline a week or so ago. Now I see the count starting to go back up.

11:30 am on Jun 25, 2006 (gmt 0)

10+ Year Member



>>>My indexed pages have bottomed out and started climbing back up in the early Saturday morning... anyone else having the same experience?<<<

Notice this also... Pages started to decline a week or so ago. Now I see the count starting to go back up.

12:54 pm on Jun 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Good for Google if start cutting thoce mega usless sites

That's alright, we'll forgive your ignorance, toothake. The site owners that I have been communicating with all have user generated content sites, and it's not uncommon for UGC sites to be quite large.... and they have nothing to do with selling widgets or anything else for that matter. Besides, with Google, having "30 million pages in the index" does not mean at all that jonathan's site has 30 million pages. My 250,000 page site shows over six times that amount normally in Google's index.

Back OT:
My site is now about 50% restored to the index this morning. About half of you who stickied me your URL's are growing as well, the other half is still shrinking.

2:41 pm on Jun 25, 2006 (gmt 0)

10+ Year Member



So lets get this clear - are the 30 million pages that jonathannelson states just one of his sites has/had in google all good quality user generated content?

[edited by: futuresky at 2:48 pm (utc) on June 25, 2006]

2:45 pm on Jun 25, 2006 (gmt 0)

5+ Year Member



Some of my sites indexes were almost back to normal but in the last hour I had one lose it all and is starting all over again. This pattern of building then losing seems to never end.
3:00 pm on Jun 25, 2006 (gmt 0)

5+ Year Member



Exactly.

I started climbing about 2 weeks ago (almost fully indexed) and in the last 3 days its the complete opposite. I am slowly dropping pages yet again.

3:11 pm on Jun 25, 2006 (gmt 0)

5+ Year Member



futuresky et al-

just to clear the air here. yes, we have lots of user generated content. yes, we have millions of products. our direct competitor is froogle and yahoo shopping.

4:30 pm on Jun 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I am experiencing the same thing. About a 50% decrease in traffic. About a 66% drop in indexed pages.
4:53 pm on Jun 25, 2006 (gmt 0)

10+ Year Member



I for one would like to know a little more about the "5 billion pages removed" comment? Are these google index pages, a large site, multiple spam/scraper sites?

Please enlighten me...

4:58 pm on Jun 25, 2006 (gmt 0)

10+ Year Member



The sstory is here: [digg.com...]
5:01 pm on Jun 25, 2006 (gmt 0)

5+ Year Member



"and they have nothing to do with selling widgets "
Well the same old junk sites(write a review about x place)really sorry for those lonely poor web sods out there ,they write fresh free content for you and they haven't got an idea about your revenues ,while other webmasters sit down there A%%% and write content by them self ,right? Mr dataguy...
5:39 pm on Jun 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Do you think there is a correlation between our increased saturation and the removal of the 5 billion page(s)?

I can't say for certain, of course, but we monitor traffic hour per hour, with a weekly differential so we can tell within an hour when Google hiccups, and the hiccup began for us last Sunday at around noon CST, which is pretty much exactly when Adam Lasnik was saying this issue was being addressed, over at threadwatch: [threadwatch.org...] (this is a pretty good read if you have the time to read through it...)

6:45 pm on Jun 25, 2006 (gmt 0)

5+ Year Member



I gotta wonder. What the heck does someone need 30 million pages for?

One of my sites has only 200 pages, the ither site has 31.

What could you possiblye need 30 miilion pages for? What kind of site is it?

This 36 message thread spans 2 pages: 36
 

Featured Threads

Hot Threads This Week

Hot Threads This Month