Forum Moderators: open

Message Too Old, No Replies

Greek problems

I find no way how to paste and copy greek

         

jetteroheller

5:55 pm on Feb 6, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I have a Microsoft Word document in greek.

I have to copy the contence into my CMS content management system.

My content management system is Perl based and uses MSIE as GUI.

So I copy from the document, paste it into a TEXTAREA of my CMS.
Since I put

<meta http-equiv="Content-Type" content="text/html; charset=windows-1253">

The pasted contence in the TEXTAREA looks greek.

Next step is, that my CMS reads the contence of the TEXTARE and puts the contence in the database.

After this, I built the screen with an innerHTML new.
I put the just copied contence into the TEXTAREA, but now it does not look like greek.

Any idea how to solve my greek problem?

appi2

6:05 pm on Feb 6, 2007 (gmt 0)

10+ Year Member



Clucthing at straws but maybe your database charset?

jetteroheller

8:30 pm on Feb 6, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I tested the same with notepad

<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=windows-1253">
</head>

<body>

te?a?dat?

<textarea>
</textarea>

</body>

The strange te?a?dat? should be something greek.
When I open this as a page with MSIE, I can paste the same from the clipboard and it looks like greek.

But I have no idea how to read out the TEXTAREA, that it remains greek.

bcolflesh

8:44 pm on Feb 6, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



What database? You'll probably want to store the text data as Unicode for it to work the way you are expecting.

jetteroheller

9:09 pm on Feb 6, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I just read out the textarea,
for example named "display_description" in Perl

$value = $main::window->document->all->display_description->{'Value'};

But the string is not greek.

LifeinAsia

9:41 pm on Feb 6, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



MS Office products are notoriously bad for cutting/pasting "non-standard" content into non-Office applications.

What we usually have to do for the Asian languages we work with is to first us the "Save As" function in Word and save the file as type "Encoded Text" to save it as a text file with the proper encoding for that character set. You should then be given a list of options for which encoding format to use. (You may need to try several types before you find what works best.) After you save the file, close it, then open it with Notepad. Cut and paste text from that file into your form.

jetteroheller

5:48 pm on Feb 7, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I can save the word document as HTML. In this HTML file, all is coded like this

&tau;&epsilon;&chi;&nu;&omicron;&lambda;&omicron;&gamma;&#943;&alpha; &#973;&delta;&alpha;&tau;&omicron;&sigmaf;

LifeinAsia

5:57 pm on Feb 7, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Saving directly from MS Word to HTML is way beyond evil- avoid it like the plague (unless that's whe only way to get it to work)! The conversion ads all kinds of junk to the file.

Did you try my suggestion?

jetteroheller

3:28 pm on Feb 11, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Saving directly from MS Word to HTML is way beyond evil- avoid it like the plague

I thought to extract out of the HTML file only the codes.

Now I have an even better way.

I mark and copy out of the word document

I open a new email, switch format to Rich Text HTML.

I paste into the email

I switch to "show source"

Now I have all the &#nnn; codes to copy the to my database