Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: Simple link extraction tool

by davidrw (Prior)
on Jan 03, 2007 at 18:00 UTC ( [id://592803]=note: print w/replies, xml ) Need Help??


in reply to Simple link extraction tool

a lynx/perl solution:
lynx --dump http://www.example.com | perl -0777 -pe 's/.+^References[\ +r\n]+//sm' # or, to also strip the numbers: lynx --dump http://www.example.com | perl -0777 -pe 's/.+^References[\ +r\n]+//sm; s/^\s*\d+\. //mg'

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://592803]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others contemplating the Monastery: (5)
As of 2024-04-18 06:17 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found