|
|
| Don't ask to ask, just ask | |
| PerlMonks |
Read PDF files & do regex through Perl.by an_ordinary_man (Initiate) |
| on Feb 11, 2002 at 18:03 UTC ( [id://144729]=perlquestion: print w/replies, xml ) | Need Help?? |
This is an archived low-energy page for bots and other anonmyous visitors. Please sign up if you are a human and want to interact.an_ordinary_man has asked for the wisdom of the Perl Monks concerning the following question:
Hi All. I want to extract information given in annotations (comments) inside PDF files, using regex. When I open the PDF file in EditPlus it shows me a large file with about 10,000 lines containing the data I need & also lots of text that looks like junk, but when I read it through Perl it just reads about 29 lines & lots of the junk from PDF is missing. I tried the texttopdf modules availabe, but they do not give me the contents in the annotations. Please tell me how can I read the whole file line by line (if possible). Regards. An_Ordinary_Man
Back to
Seekers of Perl Wisdom
|
|
||||||||||||||||||||||||||||||||||