Forum Moderators: phranque

Message Too Old, No Replies

A Call for Content

Do we really need more mindless dribble?

         

zulufox

11:38 pm on Sep 4, 2004 (gmt 0)

10+ Year Member



On another thread, a user discussed his "380,000 page site" and complaining that it had fallen out with google.

I have not seen his site, but let us assume he does not have a staff of 4000 writers working full time for the past 10 years.

Do we, as internet users, WANT this autogenerated crap site on our net? Did google do wrong by eliminating them (in some places) from their ranking?

NO and NO!

These site are complete crap, filled with autogenerated pages with affiliate products, ads, and newsfeeds. They do not input anything useful, to anybody.

Sure they get clicks, and some have said that this shows that they are useful to users, but this is mistaken.

Just because a user clicks a link on these pages doesnt mean they serve a purpose, it only means that most likely the keyword they googled was too crowded with SEOed crap pages that anything good was on page 291, thus they clicked on the one that SEEMED to be what they wanted, NOT becuse it was infact useful.

A small of 2 pages of original content provides more than 380,000 pages of autogenerated search engine optimizatext.

I know many WebmasterWorld members have sites such as these, so I ask you, what do you think, can you justify your exsistence in the grande scheme and goals of the internet?

encyclo

12:20 am on Sep 5, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



the grande scheme and goals of the internet

I'm not sure that there ever was a grand scheme, and the internet is not an entity and has no real goals. However, the essence of your argument is well-founded. When a spam site such as your 380,000 page site falls out of the SERPs, it is usually a victory for the end user.

It is possible to have a 380,000+ page site with real content on every page, of course - just look at the site you're on now! These kind of sites tend to be very strong and steady in the results.

Spamming the index is always a short-term strategy, but it can pay hansomely. However, it is the good content sites which win in the long run.

ronin

3:40 pm on Sep 5, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The sad fact is that a well-written, well-informed, 1000 page website probably costs a lot more to put together and makes a lot less money than a hastily written, derivative 100,000 page website.

The web will change for the better when the majority of people who build websites become more concerned over the quality of their output than they are about short-term (or even long-term) profits.

Until that time (ie. never) the best we can hope for is for search engines to develop algorithms which return quality websites in the SERPS and bury the rest of the rubbish.

digitalv

3:49 pm on Sep 5, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have to say I agree, yet I don't believe I've ever found myself on such a site. Google's results have always been pretty good to me, but maybe I just "search better" than other people? I've never come across one of these spammy crap article sites you all are always talking about. Perhaps it's not as bad as you think?

ergophobe

4:17 pm on Sep 5, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Actually, lately I find that .... I'm guessing here ... at least 1% (I would say 5% but that's just my aggravation that gets me to say so) of my Google searches results in a hit on a page that is just a directory of auto-generated terms.

One that I come across often because of my research comes up if you type in the name of a French village and then a word like "histoire". Even if the village has valid pages for it, there is somebody with a good domain name who has created pages with names of a village as the <h1> and then just a few hundred common secondary search terms. I have never found a page on this site with actual information about one of the towns.

More recently, I come across pseudo-directory sites. You type in a search like "pseudo-directory" in Google and it gives you pages that consist of nothing but ad banners, your search term and variations on your search term and supposedly related terms (pseudo-psychology, pseudo-science and then from there on to parapsychology and occult). These links just take you to a similar auto-generated pages.

The last one I remember (yesterday) was a search that had the word "separation" in it and got one of these pages that had nothing to do with my search phrase, but it did have links for "divorce" and "breaking up" (nothing to do with the search, but okay if they led to actual content pages, which they don't. Then there were links for "degrees of separation" which triggers inclusion of a "Kevin Bacon" link and, having found an actor in the list of related auto-generated terms, the page also had a link for "Clint Eastwod" and so on. I can't remember the search itself because the only thing the terms I mention had in common were their utter disconnectedness from the original search.

The page then has banner ads hawking books on overcoming divorce, DVDs of Clint Eastwood movies, etc.

I've been wondering about a Google proxy where you can build in spam filters for your searches. If you click the "Spam" button in the browser bar, your proxy will henceforth filter out Google results including that site...

Okay, I'm ranting and getting carried away, because this phenomenon has been really aggravating me lately.

Tom

BTW, I wrote two really good (I think) pages yesterday - significant research, including several articles in the American Journal of Sports Medicine and other scholarly studies. If I can do that 190 days per year, that 380 pages. So it would only take one hundred of us working ten years to get 380,000 pages. If I let quality suffer a little, we can probably bring that down to 50 of us. I just need to find 49 more guys and another ten years and I'm set! Realistically, though, the topic in question will likely be utterly exhausted in about 1000 pages.

Stefan

1:05 am on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Okay, I'm ranting and getting carried away, because this phenomenon has been really aggravating me lately.

It wasn't a rant. It was identifying a problem that makes the internet next to useless at times.

The one site I manage always does well in the serps, because it's a content site with a specific reason for its existence. BUT, when I use the internet to find information for my own purposes, the serps are often a joke. It p*sses me off to no end. Google, Yahoo, whatever, mainly present a load of garbage that needs much sifting to find anything other than dross. A pox on all their houses; they're primarily giant databases full of spam.

freshfish

1:14 am on Sep 6, 2004 (gmt 0)

10+ Year Member



The only reason that guy is complaining about his multitude of filler pages being dropped is that they were listed in the first place. The problem is, is that a site like Google becomes a household name in no time and a project like DMOZ is only used as a filler for the larger engines.

Just my 2...uhm 1 cent worth :)

Rosalind

1:25 am on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I came across an entry in my logs just now that was a referral spammer peddling both referral spammer software and software to auto-generate huge websites which "grabs real and relevant information to feed to the search engines", ie snippets of the carefully-written content from decent sites.

The mindless dribble mostly dominates location-based searches, but what if people start using software like this to target most other keywords?

There is some evidence that many users don't look at the site descriptions, or at least they look then are easily distracted by off-topic results. I see this in my logs when my sites are accidentally well-optimised for something irrelevant. So these sites will be getting hits, followed by people leaving to get what they really want, and possibly clicking Adsense or banners on the way out. So if the webmaster's measure of success is these clicks then those sites will not go away.

encyclo

1:37 am on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This thread made me think of another:

[webmasterworld.com...]

Especially post #5 and Brett's reply (post #7). It's the fundamental truth brought about by click-through advertizing such as Adsense or similar - if can be advantageous to have crap rather than decent content as it pushes the user away from your site - and if you've got it set up just right, the "away" will be through an ad.

Aside: Now with three Adsense panels permitted, the technique has even better chances of being a good earner.

dcheney

3:19 am on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Of course it is possible to have large-ish auto-generated content-rich sites. My major site is over 40k pages, almost completely auto-generated based on a custom database system. (Granted, it has taken me about 8 years of adding data into those databases - and its definately a niche audience and non-commercial.)

Its been in Google for well over 2 years and continues to do quite well.

zulufox

4:18 am on Sep 6, 2004 (gmt 0)

10+ Year Member



I took a look at your site dcheney, cool site, this threa d isnt really about sites such as yours.

You created original content by inputting data, such as names, dates, descriptions which was then autogenerated and compiled into useful pages.

Many ecommerce sites have thousands of pages and are created much in the same way as your site.

The sites this post discusses are "crap sites" without any REAL content.

mincklerstraat

3:28 pm on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



no dcheney, completely in agreement with zulufox here. I think 'auto-generated' has more to do with grabbing content from other sites and organizing it in an automated manner. You need to have a really, really, really smart bot / site organizer to design a site like this that's also worth visited, and I don't think I've really seen any. Btw, almost laughed off my chair seeing what yer favorite song is.

ergophobe

3:35 pm on Sep 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



DCheney,

As the previous poster said, I am emphatically not talking about sites like the one in your profile. Eight years of entering data is not exactly auto-generated! What you have is a dynamic, database-drive, content-oriented site with real value. I'm a historian - your site is the kind of think that I'm usually looking for.

I'm talking about sites where people basically enter something like the crossword dictionary that cross-links 20K terms. They then create pages like the ones I mentioned. So for example, I would enter "Bishop of Digne" and I would get a page with <h1>Bishop of Digne</h1> and then I would get links like
Bishop of New York
Bishop of Boston

which sound okay, except the pages linked to have NO INFORMATION about those bishops.

Then I would get links like

->chess, checkers, games, poker, gambling (keying on "bishop")
->rook, pawn, queen (keying on "chess")
->Freddie Mercury, Bob Dylan, REM (keying on "Queen" the rock group).
-> -> Las Vegas, Atlantic City... (keying on "gambling")

Not a single one of these links would lead to a real page. Every link leads to another set of auto-generated links on the same domain with no content, but liberally sprinkled with advertisements. As someone said higher - a POX ON THEM. They are wasting my time.

Tom