homepage Welcome to WebmasterWorld Guest from 54.242.231.109
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
Google and Special Chars
Trax




msg:95362
 6:15 pm on Apr 26, 2004 (gmt 0)

How does GoogleBot treat sites like www.ä.com or www.δ.com

i was offered a nice domain using such language specific chars. I however get redirected to verisign first. Does anyone know how google crawls those?

Thanks

 

cabbie




msg:95363
 3:22 am on Apr 27, 2004 (gmt 0)

As far as I know verisign have talked about supporting special characters in .com and .net domains but haven't yet done so which is why they resolve back to verisign.
Also I don't think # ,& are allowed in domain names.
However,.info tlds are about to allow registration of special characters https://www.domainregistry.de/multilingualdomains2.html
and I know that .nu domains support some western european characters but it seems that of all the ones I have seen all have grey bar in googles toolbar which suggests to me that google has trouble reading them.e.g.
[omvärlden.nu...]

GoogleGuy




msg:95364
 6:25 am on Apr 27, 2004 (gmt 0)

Cool. I learn something new every day. If the url escaping uses hyphens, 0-9, a-z, or A-Z, then you should be quite golden. Other special characters or escaping methods might work too, but I can believe that it would be less likely.

adfree




msg:95365
 7:50 am on Apr 27, 2004 (gmt 0)

Special characters are still an issue with IE as well. As long as the top browser does not support them (and MS does not seem to move that way), there won't be too many developers owning and using them, ergo: G and others will not feel any heat on the topic.

Just my $0.02

Trax




msg:95366
 8:02 am on Apr 27, 2004 (gmt 0)

sad :(
the omvärlden.nu example proves me right.
Sadly the domain is useless for me if google isnt able to read it.

GG: any chances of changes with google in the next months regarding those chars?

zgb999




msg:95367
 8:29 am on Apr 27, 2004 (gmt 0)

I guess Google will treat those special caracters in a domain correctly one day but what you have to keep in mind is that though those domains are sold now most people won't be able to use them with their browsers.

This will probably change in coming years but right not those domains are not what they promise to be.

cabbie




msg:95368
 9:55 am on Apr 27, 2004 (gmt 0)

If I understand what GoogleGuy says it depends on how the characters are translated into resolving as to how google reads the domain.

At present,I think, a domain with special characters must first be converted into a normal character string without special characters.I think the powers to be are working on a industry standard.
here is a link to whats involved in IDNS.
[verisign.com...]

takagi




msg:95369
 10:47 am on Apr 27, 2004 (gmt 0)

See also the thread: æøå domains - When will it be safe to start using non-english characters? [webmasterworld.com]

tschild




msg:95370
 11:20 am on Apr 27, 2004 (gmt 0)

At present,I think, a domain with special characters must first be converted into a normal character string without special characters.I think the powers to be are working on a industry standard

That's already standardized (RFC 3492). IDNs cannot work without this ASCIIfied representation (the ACE string) as the DNS system uses this as the "real" domain name. For example, www.ä.com is 'really' (i.e. as far as the DNS system is concerned), www.xn--4ca.com. If you use this representation in links etc. all browsers can use the domain.

GoogleGuy




msg:95371
 2:59 pm on Apr 27, 2004 (gmt 0)

I believe what tschild said makes a lot of sense. A "special" character gets escaped into a longer string of normal characters. I think we can handle those domains just fine, but I'm not positive. The next time I see someone who works on i18n, I'll ask them to check out this thread though. :)

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved