Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?

Re^2: PDF File Merging Data

by LanX (Chancellor)
on Dec 22, 2010 at 20:54 UTC ( #878663=note: print w/ replies, xml ) Need Help??

in reply to Re: PDF File Merging Data
in thread PDF File Merging Data

>I tried figuring out how pdf::reuse works, even looked at the PDF::Reuse::Tutorial, and have no clue how you could find the tags and replace them on the fly using that.

Can't say much about this, the docs seem to talk about form fields and mention JS, so it might not be what the OP wanted.

Anyway the OP was talking about "replacing placeholders".

With a PDF version where allowed fields are completely filled with blind text in a dedicated font, pdftohtml -xml could easily be used to parse and grep the absolute positions for these text boundaries.

And plotting text by absolute positions is trivial with PDF::Reuse. (of course into a PDF without blind text)

But as I already said, one has to care about line breaks and staying within bounding boxes, cause PDF is not HTML, it's a print format!

see also: Parsing PDFs by text position?

Cheers Rolf

Comment on Re^2: PDF File Merging Data
Download Code

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://878663]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chanting in the Monastery: (5)
As of 2015-11-28 12:41 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (741 votes), past polls