http://www.perlmonks.org?node_id=871802


in reply to How to extract image captions from a PDF file using perl

Normally captions have a separate font-setting, which should help identifying them, especially when located near to an image.

See "Parsing PDFs by text position?" and included links for a start. HTH!

Cheers Rolf