Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re^2: Data compression by 50% + : is it possible?

by LanX (Archbishop)
on May 14, 2019 at 17:37 UTC ( #1233778=note: print w/replies, xml ) Need Help??

in reply to Re: Data compression by 50% + : is it possible?
in thread Data compression by 50% + : is it possible?

Many people in this thread miss crucial information already given in the OP's text!

(apart from reading the explicit example code given)

> - order needs not to be preserved

> - occur only once in a given line.

> - They cannot be consecutive (meaning there is no sequence in a dataset).

I.e. tuples like (3,1, ...), (1,1,...) or (1,2, ...) are impossible. (see OPs if condition)

But the OP's format is obviously highly redundant, he's not only

  • allowing such tuples
  • but also unsorted input
  • and wasting a full byte to encode an 0..9 increment in 9 intervals

Alone the last point leaves sufficient room for compression far beyond near 50%.

Roboticus and I already elaborated this explicitly by demonstrating all possible independent tuples and pointing to their near optimal compression using Huffman coding.

Cheers Rolf
(addicted to the Perl Programming Language :)
Wikisyntax for the Monastery FootballPerl is like chess, only without the dice

  • Comment on Re^2: Data compression by 50% + : is it possible?

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1233778]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (11)
As of 2019-06-18 12:54 GMT
Find Nodes?
    Voting Booth?
    Is there a future for codeless software?

    Results (81 votes). Check out past polls.

    • (Sep 10, 2018 at 22:53 UTC) Welcome new users!