Forum Moderators: open
”I think I would try wrapping the "terrible" words in non-breaking spaces, e.g. sexy ”. - KeyMaster
Has anybody tried this before and does it work. Would blue widgets be seen by Google as one big word? If not, then how does Google interpret it?
Also, does Google look down on this as a form of deception like it does by using “$” within words in place of “S”? I am wondering because I can think of a couple of ways that some people could abuse the use if it were to lump words together. It could be used to artificially increase your keyword density or to hide duplicate content etc.
Sometimes I want to use certain keywords as examples, like blue widgets, even though my pages are not relevant to blue widgets. I don’t want to waste the searchers time on viewing my page that is irrelevant to the keyword and I don’t want to waste my time when combing through my log files. Is the answer? Also, will Google penalize sites for abusing ?
Cut & paste search URL into address bar of browser for link to work.
<http://www.google.com/search?q=%26nbsp%3Bsexy%26nbsp%3B>
Try to search a little deeper for keywords listed on the page that pops up in the search results and Google comes up dry:
[google.com...]
Remove the word "Sexy" and the listing pops back up:
[google.com...]
hehe! ;)
I don't think you would be penalized for it, you just won't get much traffic for those keywords (which is what you want).
[edit]Repaired link[/edit]
[edited by: Key_Master at 10:09 pm (utc) on Sep. 23, 2002]
They are not the web design police. The day they start deciding which web page parameters and commands are allowed/not-allowed will be a very sad one indeed.
I don't think you have a problem. I use and many other similar devices for a variety of valid purposes and have never had a problem.
http://www.google.com/search?q=egocentric+Evokation+Dwarf - Key_Master
If I followed this example correctly then the page you are referring to actually has the visible when looking at it with my browser. The source code is not Sexy but rather &nbsp;Sexy&nbsp;".
It appears that Google resolves & into & and thus allows the search for sexy to return the page containing &nbsp;Sexy&nbsp;. Based on this example I conclude that a source code of “blue widgets” will be viewed by Google as just “blue widgets”.
If the conclusion in the above paragraph is correct then the questions in the title and description of this thread have been answered: The can’t be used to stop unwanted log entries and there is no way anybody can abuse this and thus there is no reason for Google to penalize sites.
Well, I am still looking for a way to avoid unwanted log entries for keywords. Does anybody have any other ideas. Perhaps somebody else can make a better case for because I have given up hope for using it to stop unwanted log entries.
But if you really want to be on the safe side, don't use keywords that do not relate to the content of your site. If you must use them, disallow spiders from indexing that page via robots.txt
Key_Master’s suggestion about using robots.txt is a good one for many cases where you want to avoid unwanted log entries, but in my case there is plenty of other relevant content on the page for keywords that I do want referrals from. I think I will borrow the underscore in Key_Master’s member name and write my examples in the form of blue_widgets. That ought to cut back on unwanted log entries as well as highlight that it is just an example:)
It would be nice to know how Google indexing and search handles special characters like ­ (soft hyphen
It would seem that Google doesn't handle the soft hyphen very well.
http://www.google.com/search?q=floccinaucinihilipilification
http://www.google.com/search?q=floc%ADci%ADnau%ADci%ADni%ADhil%ADi%ADpil%ADi%ADfi%ADca%ADtion
http://www.google.com/search?q=floc+ci+nau+ci+ni+hil+i+pil+i+fi+ca+tion
Too bad. Scandinavian words tend to be quite long and it would be nice to be able to hyphenate them without loosing any keywords in Google.
Anyway, assuming one of the techniques above can be successfully implemented in a way to break up “blue wi​dgits”, then is there any risk of upsetting Google by feeding Googlebot the equivalent of “blue wi dgits” and feeding the people “blue widgets”? I know that it seems like a silly concern but the technique might be able to be abused by somebody trying to hide duplicate content or something.
Maybe Hagstrom’s problem with soft hyphens will be my solution:)
Hopefully not, non-english webmasters will probably ask Google to handle soft hyphens properly until they comply. (Some of us will probably ask for compliance with other unicode stuff as well ;))
zero width non breaking space
You're right, I forgot the breakability of the zero width space. Please note that there are some issues related to the zero width non-breaking space and a word joiner would probably be more appropriate.