No such thing as a small change | |
PerlMonks |
Re^2: Example Of Using CAM::PDF Like HTML::TokeParserby Limbic~Region (Chancellor) |
on Oct 11, 2011 at 13:17 UTC ( [id://930807]=note: print w/replies, xml ) | Need Help?? |
Anonymous Monk,
If you are referring to the non-existant PDF parser that this thread is about, then no. The internal structure of a PDF wouldn't lend itself to XPath diving. If you are referring to the way I go about creating an parser using HTML::TokeParser then the answer is "it depends". Node traversal is usually the last tool in the box I reach for. I am not even opposed to using regular expressions (*gasp*) if each page is consistent enough. It all depends on how consistent one page is to the next. Cheers - L~R
In Section
Seekers of Perl Wisdom
|
|