Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"

Re^2: Convert PDF file into HTML file

by elef (Friar)
on Dec 22, 2010 at 12:23 UTC ( #878497=note: print w/ replies, xml ) Need Help??

in reply to Re: Convert PDF file into HTML file
in thread Convert PDF file into HTML file

Well said.

This probably won't be any use, but here it goes anyway: pdftotext (part of the xpdf pdf viewer) can programmatically convert pdf to "formatted" txt. All it takes is system (\"pdftotext -layout -enc UTF-8 \"$infile\" \"$outfile\"") It approximates the original layout by inserting spaces in the txt.
As you need HTML, you're probably better off with pdf2svg, this is just a note in case pdf2svg fails or whatever.

Comment on Re^2: Convert PDF file into HTML file
Download Code

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://878497]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (5)
As of 2015-11-26 16:10 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (701 votes), past polls