Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery
 
PerlMonks  

Re: An internet garbage filter

by hossman (Prior)
on Oct 27, 2003 at 07:04 UTC ( #302345=note: print w/ replies, xml ) Need Help??


in reply to An internet garbage filter

Haven't tried running it, but i would suggest converting your banned_sites hash into an array of regexes ... that way you don't have the seperate lists of fixed hosts in the hash, and host regexes in the is_banned_site method.


Comment on Re: An internet garbage filter
Re: Re: An internet garbage filter
by pg (Canon) on Oct 27, 2003 at 07:21 UTC

    For anyone use this program, my suggestion is to have a big hash for banned sites, but only a much smaller array for banned sites expressed with regexp.

    For example, if there are four sites you want to ban:

    • a.foo.com
    • b.foo.com
    • c.foo.com
    • d.foo.com

    It is better to put them all in the hash for fixed site, instead of using regexp, unless that site has a rich variety of names. If it only has three or four different names, put them in hash.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://302345]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (5)
As of 2014-07-26 05:47 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (175 votes), past polls