I want to write an application which makes web crawling on a certain page and all his children (just 1 level).
I have the following requirements:
- pdf, word and ppt parsing
- authentication by cookies. There are pages where first you have to login and then you are authenticated by the cookie in all other clicks
Do you know such a cpan module which can provide this
I saw in google that part of these questions were asked in the past, but i want to know if there is a module which have all this features combined.