Welcome to WebmasterWorld Guest from 54.224.44.168
Forum Moderators: Robert Charlton & goodroi
I realize that many of the list items would logically just be a perceived "signal of quality" reversed but if we all dig down into our experience then this exercise just might bring up a few unique and useful insights. Here's a start:
25 Signals of Crap
I'm not claiming any of these are definitely a signal of crap, but they make my list of possibilities based on my own conversations and observation.
Please add/subtract/modify and let's see if we can find a new perspective and learn something.
They are generally 300-500 words long with multiple repetitions of keywords/phrases but a low unique word count.
eg
APPLES
Apples are a type of fruit. They grow on trees and can be red apples or green apples. Apples can be eaten raw or cooked. etc etc
These articles almost always contain within them 2 or more large adsense or other advertising units.
Reciprocal link request pages.
Depends on what you are linking too, just adding any link because the the other site has done the same then I can agree. I use such pages to gather links for my sites but will only link to sites that are of the same nature. My sites are also aimed primarily at locals so I may include some links that may be of interest to the local visitors.
No Privacy policy.
Don't have one... :p Then again the only e-mail addresses I gather are through my forums and the user only ever gets email they request through the forum. I haven't ever sent a single e-mail to anyone from any of these e-mail addresses and never will. Just because the site has no privacy policy doesn't necessarily make it crap, in my case your information is more secure than most places because I'm never going to use nor will I ever share it. What percent of users do you think ever read that anyway? That's besides the fact that most of the "biggies" will use some big long privacy policy that appears to protect your information right up until the last paragraph or so.... Who would you rather trust your info too, me or the guy with the big long privacy policy your not going to read anyway?
Stagnation (Site never changes)
I have one site that hasn't changed in 3 or 4 years. It's for a single niche product and has ranked at the top for the last 2 years. There is simply nothing to add that will add any value to the end user. I will be updating it shortly but they will mostly be cosmetic changes.
No / very few inbound links.
The above mentioned site is so niche that there is very few places that would have any interest in linking to it, certainly doesn't make it crap though.
For the most part the rest I can agree with.
Just because the site has no privacy policy doesn't necessarily make it crap
No, but a search bot is not going to sit down and have a nice friendly chat with you about your attitude to privacy, it will just note that it didn't find a policy and may (who knows?) chalk that up against you.
Aside from which it is a legal obligation in many juristictions to publically disclose on your site how you use personal information provided by the user.
Similarly how does "No published contact address, email address or phone number" have the slightest relevance to quality? But more to the point, how is a bot going to read a graphic showing an email address?
Ya gotta think like a search bot looking for quality, not some paranoid widget buyer. The two have little in common.
On websites that don't collect information privacy policies are irrelevant.
Very few sites should be without an "honest" privacy policy.
Do you allow the user to contact you by email/form?
Do you set a cookie?
Do you track your sites visitors, what info is collected in your stats program?
Do you require registration for posting comments etc?
Privacy policies are not only for sites which collect personally identifying information. I think they are a good idea for any site.
Ya gotta think like a search bot looking for quality, not some paranoid widget buyer.
I didn't realize this thread was only about "bots" and not real visitors?
Similarly how does "No published contact address, email address or phone number" have the slightest relevance to quality?
Depends what type of site you are discussing. If it's a "real" business/organization it better have a complete address.
Also, many "human" edited directories will not list a site in any regional categories unless there is a identifiable address listed on the site - why would they?
[edited by: The_Contractor at 11:35 am (utc) on Jan. 27, 2007]
Profanity or explicitly adult language on a non-adult site.
May I take exception, please?
I assume you really mean the "inappropriate" use of profanity or explicitly adult language. My own slang web site naturally contains profanity, is not an adult site, but nor is the profanity gratuitous - it is merely defined.
Within literature and art, profanity can add impact. I remember reading a poem when I was young, suddenly out of nowhere the narrative refered to his emotions being told to "F**** off!", it added creative and artistic impact.
The inappropriate or gratuitous use of profanity, I agree is totally unneccessary and lowers the value of a piece of work.
Matt
Privacy policies are not only for sites which collect personally identifying information. I think they are a good idea for any site.
Yes of course they are but as Steveb says they are hardly a quality indicator. While we are on the subject, I agree with some of your assertions but IMHO some of the others are not really quality related. I suppose it depends on whether you mean signals of crap to the casual visitor or to another designer doing an analysis of the site.
Reciprocal link request pages.
Why is this a problem? Many quality sites offer this and done properly it is not a problem.
Outdated copyright date or last modified date visible on the pages.
This could be a minor oversight. It is quite common and it would not necessarily indicate to me that the site is of poor quality.
Massive numbers of incoming links from link farms.
How would a site visitor recognise this?
No / very few outbound links.No / very few inbound links.
Related to quality? Not in my view.
All inbound links are to homepage only.
Why is this a problem?
joined:Oct 27, 2001
posts:10210
votes: 0
Adsense
Well, there go THE WASHINGTON POST and THE NEW YORK TIMES. :-)
Well yes, but to be fair, most people already have opinions on the value (or quality) of the NYT and WP. The fact that they carry adverts is probably neither here nor there, particularly where advertising revenue provides the main part of their hard copy income.
Quite simply: The standard URL for your compact privacy policy is:
/w3c/p3p.xml
and a link to your human-readable privacy policy is included in that file.
Jim
joined:Oct 27, 2001
posts:10210
votes: 0
- A large number of template-based, keyword-driven, computer-generated pages that contain little or no content, and which are waiting to be filled with user content. (Such pages may not be populated with content, but you can be sure that they're filled with AdSense ads, affiliate links, and/or price-comparison logos and links.)
Google could go a long way toward cleaning up its index by penalizing sites that have a significant percentage of "placeholder pages."
-- The drop list for the user to select their credit card expiry date still includes last year.
I think this one needs a bit more qualification. Making these sites disappear would remove a LOT of reference sites and LOTS of valuable information.
How does one define stagnation anyway? (sounds like something putrid or decaying).
Now, get rid of those pond scum sites and I'd be happy
I think there's a major flaw in threads like this. Vague questions that attract vague answers.. (whoops!) Though unspecified I was referring to things that could potentially be determined during a search engine bot visit and the subsequent processing and use of that information (Hence my posting in this forum).
Remember that I'm not claiming to 'know' anything here - I certainly know nothing about what goes on in the search engines - though I do have my opinions..
A lack of a privacy policy is complete nonsense. Having a privacy policy obviously is not a signal of quality, and not having one certainly does nothing to imply lack of quality.Similarly how does "No published contact address, email address or phone number" have the slightest relevance to quality? But more to the point, how is a bot going to read a graphic showing an email address?
You may be right, but isn't it possible that a bot can determine whether your site is collecting information through cookies and forms and such? How many privacy policies have you ever seen without the phrase "Privacy Policy" in them? Aside from that, how hard would it be for an algorythm to identify a TOS or PP simply based on the content of the page?
As for my guess that a lack of either an address, email, or phone number could possibly hurt... I agree that's on the edge of reasonability, but maybe it would depend on the site in question, and what 'kind' of site the engine determines it to be?
A good question to ask at this point may be just how complex of a system are search engines using to rank sites? Do they know if you're a store? Do they link sites together by webmaster (where they're able?) Do they look for 'sets' of things? (What if they find a cart but no privacy policy or SSL?) Certainly some of these signs of crap may look like crap when taken by themselves - but what about when they're combined with other 'signs' that the bot can recognize and catalog?
Maybe every site is not measured by the same things.
Ya gotta think like a search bot looking for quality, not some paranoid widget buyer. The two have little in common.
But isn't this the point? Are search engines not trying to increase their ability to analyze pages and sites to the point where their bots and algo's can see them more or less as users do? What other direction could you choose when tweaking a SE algo that exists largely to deliver the best site to every "paranoid widget buyer"?
Reciprocal link request pages.Why is this a problem? Many quality sites offer this and done properly it is not a problem.
Apologies again, I wasn't specific enough. I should have coupled that along with existing recip links to non-topical or relevant sites.. If I were the big G I would take that as a sign of trying to game me and may or may not decide to throw a dunce cap on the site.
What else? C'mon is this all you got?
:)
Check out the forum, dude. This is the Google forum.
We aren't talking about real visitors here. Real visitors are completely irrelevant to the concept of "signals of quality" and of crap too. Real visitors for example can read "Geore washington was the 37th President of the US" and know this is a signal of crap. A search engine bot on the other hand can't judge a fact like that. Bots look for signals of quality (like links from respected sites) or signals of spam (like having only links from banned sites).
Addresses or privacy policy are useless to bots a signals of quality. Likewise, any content that has to be coherent and accurate is no signal of quality or crap to a bot. Bots only can look for signals that make it likely something is good quality or not.
And that is why they are called "signals", not "answers".
Adsense on a website rather than a site's own advertising is a signal of lower quality in general, but that doesn't mean a high quality site can't use adsense. An element that is low quality is just a signal, not a definitive answer. In this case, those two sites have a near infinite number of signals of quality that overwhelm the few signals of crap.
grep "w3c" years.logs
1 single spammy? bot.
The w3c policy isn't linked from the sites pages,
but w3c policy pages does link to the main/user privacy page which is obviously seen by the bot as it follows the on page links.
Does any SE actually even check for a "w3c" folder privacy policy?
Example: "Many people get the i and e the wrong way around and write 'freind' when the rule 'i before e except after' would indicate that it would more likely to be written as 'friend'"
Breaking one rule possibly doesn't indicate crap quality, but breaking a number possibly does!
veto.
cf. Dmoz.
> How about crappy html coding!
<fnord>
Yes, trinorthlighting, we both are the last dinosaurs checking w3c-conformity. As paranoids by passion, we know that google's own policy in this respect is just a test;)
</fnord>