Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number
 
PerlMonks  

Re^3: [OT] SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections

by pKai (Priest)
on Oct 09, 2008 at 11:12 UTC ( #716194=note: print w/ replies, xml ) Need Help??


in reply to Re^2: [OT] SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections
in thread [OT] SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections

Might as well be a firewall problem.

Just checked at $work and trying to access that Url http://dbpubs.stanford.edu/pub/...&name=2008-10.pdf, redirects me to

HTTP/1.x 302 Found Date: Thu, 09 Oct 2008 10:54:28 GMT Server: Apache/2.0.49 (Fedora) Location: http://DBPubs.stanford.edu:8090/pub/showDoc.Fulltext?lang=en +&doc=2008-10&format=pdf&compression=&name=2008-10.pdf Content-Length: 400 Connection: close Content-Type: text/html; charset=iso-8859-1

And that server (notice the port number in there) is reported to never answer…

I'll have to try that later @home…


Comment on Re^3: [OT] SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://716194]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (9)
As of 2014-08-28 03:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (256 votes), past polls