Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Stats for super search look buggy

by LanX (Saint)
on May 07, 2021 at 20:43 UTC ( [id://11132246]=monkdiscuss: print w/replies, xml ) Need Help??

Is it me or are the stats for super search confusing?

Please open ?node_id=3989;BIT=www.google.com and click Search

I see

2021-05-07 and 2018-06-14 (searched 89.07% of DB).

90%? seriously?

Cheers Rolf
(addicted to the Perl Programming Language :)
Wikisyntax for the Monastery

Replies are listed 'Best First'.
Re: Stats for super search look buggy
by pryrt (Abbot) on May 07, 2021 at 21:20 UTC

      Great. Now I feel even worse about that. :-(

        Well, the percentage calculator could be presumably changed to something like
        use constant jump => 9866219; my $delta = $last_node_in_search_range - $first_node_in_search_range; $delta -= jump if $delta > jump; my $percent = 100 * $delta / ($most_recent_node_id - jump);
Re: Stats for super search look buggy
by jdporter (Paladin) on May 20, 2021 at 18:58 UTC

    I have patched it. Please let me know whether things look better now. Thanks!

    I reckon we are the only monastery ever to have a dungeon staffed with 16,000 zombies.

      FYI: first search results page just showed me "Press <Next> to continue searching remaining 707.00% of DB." In case it's relevant, this was a search for all of qw{ dump opaque object } in any text. The next page showed something more reasonable (remaining 89.4%, or somesuch).

      Thanks, as promised I wanted to do it myself this weekend (Pentecost brings an extra holiday in DE :)

      > Please let me know whether things look better now

      They definitely look better now, but it's still weird

      Please try this search

      The first reply looks OK

      Found 50 nodes roughly between 2021-05-20 and 2018-06-02 (searched 4. +01% of DB). ... Press [NEXT >] to continue searching remaining 95.99% of DB.

      but after [NEXT] it doesn't add up anymore

      Found 50 nodes roughly between 2018-06-02 and 2015-02-16 (searched 7. +80% of DB). ... Press [NEXT >] to continue searching remaining 88.18% of DB.

      That's less than 100%

      If you want I can care about it this weekend.

      Cheers Rolf
      (addicted to the Perl Programming Language :)
      Wikisyntax for the Monastery

        Sure, if you'd like to look into it, that would be great. Thanks!

Re: Stats for super search look buggy
by LanX (Saint) on May 30, 2022 at 12:27 UTC
    Not sure if this is still the same bug, but I got this today around "May 30, 2022 at 13:56 CEST"

      
      Remainder of query was not run (used 10.01 seconds so far)
      
      
      Found 0 nodes roughly between 2022-05-30 and 0 (searched 567.28% of DB). 
      
      
      where any text contains all of "debugger", "ornament"
      
      Date:	Author/owner:	Title:	Node type:
      
      Press >NEXT< to continue searching remaining 304.68% of DB.
      Please be patient after submitting your search.
      
      Link to preload this search: ?node_id=3989;BIT=debugger%20ornament
      

      (emphasize added)

    Please note the 0 too, there seems to be a date problem in edge cases causing such high percentages ...

    EDIT

    Couldn't reproduce it now, probably dependent on other side effects.

    update
    Kind of a Heisenbug, if you run the exact same search, you'll end up in this new node now.

    You need an "impossible" query to force a "Found 0 nodes" response.

    update

    My best guess, this happens only if the 10sec timeout ends above the giant jump from 2019, i.e. the server must be busy and this problem will happen more and more often in the future.

    Cheers Rolf
    (addicted to the Perl Programming Language :)
    Wikisyntax for the Monastery

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: monkdiscuss [id://11132246]
Approved by Bod
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others rifling through the Monastery: (2)
As of 2024-04-26 01:33 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found