Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^2: Does this exist yet?... (eq not diff)

by tye (Cardinal)
on May 22, 2003 at 01:07 UTC ( #259958=note: print w/ replies, xml ) Need Help??


in reply to Re: questions concerning using perl to monitor webpages
in thread questions concerning using perl to monitor webpages

Don't use Algorithm::Diff (nor stuff like Text::Diff that uses it) to simply compare for equality.

The person wanted to know which pages had changed, not which lines of each page were unchanged, deleted, added, or modified. Going to the effort to try to find the greatest common sequence of unchanged lines between old and new versions of each page could be a huge waste of resources when all you really want back is a simple Boolean value (per page).

                - tye


Comment on Re^2: Does this exist yet?... (eq not diff)
Re: Re^2: Does this exist yet?... (eq not diff)
by TVSET (Chaplain) on May 22, 2003 at 01:20 UTC

    I agree, that was a bit of ahead of time to rush with Text::Diff from my side, but the memories are still alive in my head when I had to do something very similar. Suprisingly, simple equality tests will not live for a long time untouched in this case. See L~R's comment about dates and other small dynamics on websites.

    But your point taken. :)

    Leonid Mamtchenkov aka TVSET

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://259958]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chanting in the Monastery: (19)
As of 2014-09-23 15:14 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    How do you remember the number of days in each month?











    Results (224 votes), past polls