I need to get several tables out of a large pdf document and do some processing on them. The processing is easy enough, but I'm thoroughly stumped on how to access the tables in the first place. Modules and command-line tools for creating PDFs abound. Getting the data back out, not so much.
I've checked CPAN and found the PDF and Text::PDF modules, but both are pretty sparse on documentation. I think one or both may be able to do it, but if the docs are unclear about if they can do it, then they're even less helpful in figuring out how to do it.
Any suggestions on how I might be able to accomplish this?