PDA

View Full Version : How to convert PDF to DOC or HTML?


Delphi123
May 30th, 2007, 06:29 PM
Dear friends:

I need to convert a PDF document into MSWord Doc or HTML? How do I do this?

I own a copy of Adobe Acrobat version 5 (the program, not just the reader), but unless I am mistaken, there is no option in it to convert a PDF document into MSWord, RTF or HTML. How do I do this, please? What free program or web service would you recommend?

Thank you.

Benjamin

rockboy
May 30th, 2007, 07:19 PM
Send it to yourself as an attachment using Gmail. When received, open the attachment as HTML. Save the new window as HTML.
http://mail.google.com/support/bin/answer.py?answer=30719&topic=1517

If you don't have Gmail you can now open a new account without an invitation.

Delphi123
May 30th, 2007, 09:03 PM
Dear rockboy:

Thanks so much for the tip. Unfortunately, the HTML document I got back was gibberish. I tried different Cyrillic encodings, but it still yielded only gibberish.

But there is a better solution offered by Adobe itself. In fact, they offer several free PDF to HTML and PDF to TXT conversion services. Click below:

http://www.adobe.com/products/acrobat/access_onlinetools.html

Unfortunately, there is a file size limit.

Benjamin

Delphi123
May 30th, 2007, 09:27 PM
Dear friends:

Here is a better solution from PDF995:

http://www.software995.com/

Be sure to download all four PDF files, including the PDF Free Converter.

This program is free. It has a nag screen but it is legally free.

Benjamin

Delphi123
May 30th, 2007, 09:57 PM
Dear friends:

An explanation is in order:

The Russian Cyrillic text in PDF format is NOT in contemporary Russian orthography. It is in pre-Revolutionary (i.e. pre-1917) orthography. This is the reason why any attempt to convert the PDF to HTML or Doc or TXT ( as the recent Acrobat Reader v. 8 does -- i.e. PDF to TXT) results in distorted, computer garbage. I may just have to print it out and edit it and then save it as HTML.

Benjamin

smurfy
May 30th, 2007, 10:02 PM
Benjamin.
What happens if you open in Acrobat 5 and select all, copy and paste into Word?

Delphi123
May 30th, 2007, 11:26 PM
Dear smurfy:

Nope, the copy and paste works fine, no question about it. The problem is that what is copied and pasted remains in the pre-Revolutionary Russian orthography (which included certain letters that are not used today and others that have since been eliminate). The result: computer gibberish.

The only solution, short of retyping, is to convert the pre-Revolutionary Russian text to post-Revolutionary Russian. Then, you can do everything.

Fortunately, the text in question is rather short: only six pages. So, I have already started retyping it and will have it done within an hour or two. Better that than spending hours looking for this special kind of code converter. Naturally, if I have tons of PDF texts in this orthography, I will pursue the conversion issue further.

Thank you all.

Benjamin