Forum Moderators: bakedjake

Message Too Old, No Replies

PS/PDF to HTML

         

dotancohen

12:59 pm on Dec 25, 2005 (gmt 0)

10+ Year Member



Hi all, I need a program that will convert PS and PDF to HTML. I run Fedora Core 4 at home with KDE. Thanks.

Dotan Cohen

[edited by: bakedjake at 9:33 pm (utc) on Dec. 25, 2005]
[edit reason] no sigs, thanks [/edit]

dotancohen

6:32 pm on Dec 30, 2005 (gmt 0)

10+ Year Member



Bump for something important.

Is there no way to convert PDF (and GS) to HTML on linux?!? I'm looking to get _away_ from windows- not to go back!

brakthepoet

8:41 pm on Dec 30, 2005 (gmt 0)

10+ Year Member



Have you tried pdftohtml?
[sourceforge.net ]

It should be available in the Fedora Extras repository (yum search pdftohtml). Haven't used it myself, but it's probably worth a shot.

dotancohen

1:02 pm on Jan 1, 2006 (gmt 0)

10+ Year Member



Thank you, I jsut tried it. No good at all- the HTML it produced was just a bunch of numbers, each on it's own line. That may be because the PDF was in Hebrew?

Dotan Cohen

brakthepoet

9:26 pm on Jan 2, 2006 (gmt 0)

10+ Year Member



You will probably need to set the encoding option. While the program does not have a man page, starting the program without any options present will list the possible options. Just type pdftohtml and hit enter.

I assume that UTF8 is what you need, but I don't have much understanding of the various encoding options. I'll presume that you have a better understanding of Hebrew and the encoding that you need.


pdftohtml -enc UTF8 OLDFILE.pdf NEWFILE.html