View Full Version : How to convert PDF to DOC or HTML?
Delphi123
May 30th, 2007, 06:29 PM
Dear friends:
I need to convert a PDF document into MSWord Doc or HTML? How do I do this?
I own a copy of Adobe Acrobat version 5 (the program, not just the reader), but unless I am mistaken, there is no option in it to convert a PDF document into MSWord, RTF or HTML. How do I do this, please? What free program or web service would you recommend?
Thank you.
Benjamin
rockboy
May 30th, 2007, 07:19 PM
Send it to yourself as an attachment using Gmail. When received, open the attachment as HTML. Save the new window as HTML.
http://mail.google.com/support/bin/answer.py?answer=30719&topic=1517
If you don't have Gmail you can now open a new account without an invitation.
Delphi123
May 30th, 2007, 09:03 PM
Dear rockboy:
Thanks so much for the tip. Unfortunately, the HTML document I got back was gibberish. I tried different Cyrillic encodings, but it still yielded only gibberish.
But there is a better solution offered by Adobe itself. In fact, they offer several free PDF to HTML and PDF to TXT conversion services. Click below:
http://www.adobe.com/products/acrobat/access_onlinetools.html
Unfortunately, there is a file size limit.
Benjamin
Delphi123
May 30th, 2007, 09:27 PM
Dear friends:
Here is a better solution from PDF995:
http://www.software995.com/
Be sure to download all four PDF files, including the PDF Free Converter.
This program is free. It has a nag screen but it is legally free.
Benjamin
Delphi123
May 30th, 2007, 09:57 PM
Dear friends:
An explanation is in order:
The Russian Cyrillic text in PDF format is NOT in contemporary Russian orthography. It is in pre-Revolutionary (i.e. pre-1917) orthography. This is the reason why any attempt to convert the PDF to HTML or Doc or TXT ( as the recent Acrobat Reader v. 8 does -- i.e. PDF to TXT) results in distorted, computer garbage. I may just have to print it out and edit it and then save it as HTML.
Benjamin
smurfy
May 30th, 2007, 10:02 PM
Benjamin.
What happens if you open in Acrobat 5 and select all, copy and paste into Word?
Delphi123
May 30th, 2007, 11:26 PM
Dear smurfy:
Nope, the copy and paste works fine, no question about it. The problem is that what is copied and pasted remains in the pre-Revolutionary Russian orthography (which included certain letters that are not used today and others that have since been eliminate). The result: computer gibberish.
The only solution, short of retyping, is to convert the pre-Revolutionary Russian text to post-Revolutionary Russian. Then, you can do everything.
Fortunately, the text in question is rather short: only six pages. So, I have already started retyping it and will have it done within an hour or two. Better that than spending hours looking for this special kind of code converter. Naturally, if I have tons of PDF texts in this orthography, I will pursue the conversion issue further.
Thank you all.
Benjamin