Forum Moderators: open

Message Too Old, No Replies

Special characters.

How does Google handle files that are almost completly special characters?

         

Jesse_Smith

5:45 am on Jun 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



How does Google handle pages that are almost completly special characters using &####; codes to create the characters.....

וַיֹּאמֶר יְהוָה, אֶלאַהֲרֹן, אַתָּה

bufferzone

5:02 pm on Jun 19, 2004 (gmt 0)

10+ Year Member



In my view, badly. Google, and a lot of other SE’, have problems with ordinary dynamic linking, the links you describe would not help the SE. or I would be very surprised. If you read the Google help pages, you will se that they actively limits the number of pages the bot will index if the site is dynamic. This to prevent eternal loops and overloading the bot. I think the same would hold true for your type of links.

If you want to do well in Google I would advice that you find a way to change these links to something more standard

WebWalla

5:27 pm on Jun 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I don't see what it has to do with dynamic pages. If you do a search for the first line of characters you mention, Google brings up several pages no problems in what I take to be hebrew but may of course be completely wrong.

I don't think you'll have problems.

Abdelrhman Fahmy

5:29 pm on Jun 19, 2004 (gmt 0)

10+ Year Member



I think Jesse dosen't mean the special characters in the links URL ,it's about using the corresponding Unicode values from characters table to compose the html page content ,I think it's not a problem as long as all the web browsers parse it and read it as a normal characters so Google and other search engines should read it as a normal characters also
you may search Google for any of your characters ו [google.com] and you'll find a lot of pages indexed has the character in it's source code while in the viewers seen code you'll find the corresponding character.

Abdelrhman,

Jesse_Smith

6:20 pm on Jun 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Not the URL, but the content being almost completly text, the most populer book in the histery of the world in a different language, that has the whole text created using those codes. The URL, title, header and footer is 100% normal characters, while all the content uses the special characters.

*Looks at search.* Yep, they get indexed! Though I'm guessing the odds are one in a million of some one making a search with that character code or the actual text in that language. When you search with the actual text it get's turned into the other code and gives out? in the title and

The summary for this Hebrew page contains characters that cannot be correctly displayed in this language/character set.

in the description, in the English Google.