Welcome to WebmasterWorld Guest from 54.162.239.134

Message Too Old, No Replies

Will Google ignore urls that use tildes?

     
7:51 am on Nov 17, 2008 (gmt 0)

5+ Year Member



I see a site with typical type of URL structure like the one listed below:

http://www.example.com/d~c-home_appliances~b-96648.aspx

Can anyone highlight whether Google will ignore this type of url and thus the page may not rank ever for any keyword? I say so coz there are a lot of tilde sign like this ~ in the url structure.

What do you think?

[edited by: tedster at 8:25 am (utc) on Nov. 17, 2008]
[edit reason] switch to example.com - it can never be owned [/edit]

9:16 am on Nov 17, 2008 (gmt 0)

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member



In earlier days of the internet, before HTML 3.2, the tilde character (~) was explicitly NOT allowed in a url. It had to be encoded as %7e. This restriction was later relaxed, and Google will index a url that contains a tilde.

It's been quite a few years since the absolute prohibition on using tilde in a url was relaxed, and standardization has moved ahead rapidly. But it's still a bad idea, for many reasons - here are a few:

1. Although today's Google may handle the tilde, this doesn't mean that other programs will not have trouble. I'm not sure about the most recent versions, but Adobe's PDF Reader used to choke on urls that included tildes.

2. Log analysis software also comes to mind. It will probably be OK if character conversion is performed at the browser or server level first, as happens with "most" of these apps. Otherwise, who knows.

3. Webmasters actually typing your urls, rather than doing a copy/paste, may also not get it right. For one thing, not all keyboards even HAVE the stand-alone tilde character and these will require keystroke combinations. This could cost you backlinks.

4. The tilde is not widely known as a stand-alone character, but only as a diacritic mark placed ABOVE a basic character in some languages -- such as the widely recognized Spanish () character. If your link is lucky enough to get a press mention in a newspaper or magazine, a stand-alone tilde might well be typeset as a hyphen.

5. The tilde is among the ASCII characters whose positions are sometimes replaced by regional/national alphabet letters. For example, the code position that tilde is assigned in international ascii has u umlaut () in several variants of ASCII, the German sharp "s" () in German ASCII, etc. So beyond the keyboard problems I mentioned earlier, this oddity can sometimes cause incorrect representations on both screen and paper

Even the %7e substitution can be a problem. When a % is handwritten, it might later look like an ampersand (&) for instance.

12:06 pm on Nov 17, 2008 (gmt 0)

5+ Year Member



What do you suggest then.. should a url rewrite solve this type of a problem. Or do you suggest some other means.
11:05 pm on Nov 18, 2008 (gmt 0)

WebmasterWorld Senior Member 5+ Year Member



We use the tilde and haven't had any issues, but then again we're a 12 year old site
2:45 am on Nov 19, 2008 (gmt 0)

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



The tilde is heavily used in folder names on many old-skool sites with multiple user name accounts - especially in the .edu area. There's no real issues other than it appearing as %7E in many URLs.
2:07 pm on Nov 20, 2008 (gmt 0)

10+ Year Member



It looks to me that Google views the tilde in the same way as a Hyphen, in other words it views it as a space between 2 words.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month