Beefy Boxes and Bandwidth Generously Provided by pair Networks RobOMonk
Just another Perl shrine
 
PerlMonks  

Re: An internet garbage filter

by hossman (Prior)
on Oct 27, 2003 at 07:04 UTC ( #302345=note: print w/ replies, xml ) Need Help??


in reply to An internet garbage filter

Haven't tried running it, but i would suggest converting your banned_sites hash into an array of regexes ... that way you don't have the seperate lists of fixed hosts in the hash, and host regexes in the is_banned_site method.


Comment on Re: An internet garbage filter
Re: Re: An internet garbage filter
by pg (Canon) on Oct 27, 2003 at 07:21 UTC

    For anyone use this program, my suggestion is to have a big hash for banned sites, but only a much smaller array for banned sites expressed with regexp.

    For example, if there are four sites you want to ban:

    • a.foo.com
    • b.foo.com
    • c.foo.com
    • d.foo.com

    It is better to put them all in the hash for fixed site, instead of using regexp, unless that site has a rich variety of names. If it only has three or four different names, put them in hash.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://302345]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (8)
As of 2014-04-23 23:30 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (557 votes), past polls