in reply to Re^4: CAM::PDF did't extract all pdf's content
in thread CAM::PDF did't extract all pdf's content

Try xpdf: http://www.foolabs.com/xpdf/ (GPL) Converts pdf to i.e. text/html/xml and seems to handle font subsets well.
  • Comment on Re^5: CAM::PDF did't extract all pdf's content