Just another Perl shrine | |
PerlMonks |
comment on |
( [id://3333]=superdoc: print w/replies, xml ) | Need Help?? |
httrack does that by mining the javscript for links, gets the more common ones, but doesn't get them all, and some javascript will redirect you from your local copy back to the internet http://crawler.archive.org/ does that by inserting its own javascript which does url rewriting so the images show up (even the dynamic ones), but like httrack, actual links are rewritten ... Then there is Mozilla Archive Format (with Faithful Save), which does a much better version of save-as, its close to perfect :) Another common tactic is to print-to-pdf from a browser like firefox via automation In reply to Re^3: (almost) preserving a web page
by Anonymous Monk
|
|