Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot

Re^2: Does this exist yet?... (eq not diff)

by tye (Sage)
on May 22, 2003 at 01:07 UTC ( #259958=note: print w/ replies, xml ) Need Help??

in reply to Re: questions concerning using perl to monitor webpages
in thread questions concerning using perl to monitor webpages

Don't use Algorithm::Diff (nor stuff like Text::Diff that uses it) to simply compare for equality.

The person wanted to know which pages had changed, not which lines of each page were unchanged, deleted, added, or modified. Going to the effort to try to find the greatest common sequence of unchanged lines between old and new versions of each page could be a huge waste of resources when all you really want back is a simple Boolean value (per page).

                - tye

Comment on Re^2: Does this exist yet?... (eq not diff)
Replies are listed 'Best First'.
Re: Re^2: Does this exist yet?... (eq not diff)
by TVSET (Chaplain) on May 22, 2003 at 01:20 UTC

    I agree, that was a bit of ahead of time to rush with Text::Diff from my side, but the memories are still alive in my head when I had to do something very similar. Suprisingly, simple equality tests will not live for a long time untouched in this case. See L~R's comment about dates and other small dynamics on websites.

    But your point taken. :)

    Leonid Mamtchenkov aka TVSET

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://259958]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others cooling their heels in the Monastery: (3)
As of 2016-02-14 21:06 GMT
Find Nodes?
    Voting Booth?

    How many photographs, souvenirs, artworks, trophies or other decorative objects are displayed in your home?

    Results (471 votes), past polls