Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine

Re^2: Convert PDF file into HTML file

by elef (Friar)
on Dec 22, 2010 at 12:23 UTC ( #878497=note: print w/replies, xml ) Need Help??

in reply to Re: Convert PDF file into HTML file
in thread Convert PDF file into HTML file

Well said.

This probably won't be any use, but here it goes anyway: pdftotext (part of the xpdf pdf viewer) can programmatically convert pdf to "formatted" txt. All it takes is system (\"pdftotext -layout -enc UTF-8 \"$infile\" \"$outfile\"") It approximates the original layout by inserting spaces in the txt.
As you need HTML, you're probably better off with pdf2svg, this is just a note in case pdf2svg fails or whatever.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://878497]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (3)
As of 2018-03-18 22:05 GMT
Find Nodes?
    Voting Booth?
    When I think of a mole I think of:

    Results (231 votes). Check out past polls.