in reply to Re: Convert PDF file into HTML file
in thread Convert PDF file into HTML file
Well said.
This probably won't be any use, but here it goes anyway: pdftotext (part of the xpdf pdf viewer) can programmatically convert pdf to "formatted" txt. All it takes is system (\"pdftotext -layout -enc UTF-8 \"$infile\" \"$outfile\"") It approximates the original layout by inserting spaces in the txt.
As you need HTML, you're probably better off with pdf2svg, this is just a note in case pdf2svg fails or whatever.
This probably won't be any use, but here it goes anyway: pdftotext (part of the xpdf pdf viewer) can programmatically convert pdf to "formatted" txt. All it takes is system (\"pdftotext -layout -enc UTF-8 \"$infile\" \"$outfile\"") It approximates the original layout by inserting spaces in the txt.
As you need HTML, you're probably better off with pdf2svg, this is just a note in case pdf2svg fails or whatever.
|
---|
In Section
Seekers of Perl Wisdom