Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number
 
PerlMonks  

Re^2: Convert PDF file into HTML file

by elef (Friar)
on Dec 22, 2010 at 12:23 UTC ( #878497=note: print w/replies, xml ) Need Help??


in reply to Re: Convert PDF file into HTML file
in thread Convert PDF file into HTML file

Well said.

This probably won't be any use, but here it goes anyway: pdftotext (part of the xpdf pdf viewer) can programmatically convert pdf to "formatted" txt. All it takes is system (\"pdftotext -layout -enc UTF-8 \"$infile\" \"$outfile\"") It approximates the original layout by inserting spaces in the txt.
As you need HTML, you're probably better off with pdf2svg, this is just a note in case pdf2svg fails or whatever.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://878497]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (9)
As of 2016-09-29 21:44 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Extraterrestrials haven't visited the Earth yet because:







    Results (561 votes). Check out past polls.