Break when best possible filter result found #648

andrews05 · 2024-11-26T08:29:17Z

I had this sudden idea for a tiny optimisation: For heuristic filter strategies, skip checking remaining filters if we find we have the best possible result. We can do this for MinSum, Entropy, Bigrams, and BigEnt, but not Brute.
For most images this will have little-to-no impact, but for some such as the test file "palette_1_should_be_palette_1.png" (where most lines are all zeros) we can see a 20% performance improvement at o2 and 10% at o5.

src/png/mod.rs

AlexTMjugador

Thanks for the PR, it looks great now! Feel free to merge after rebasing and once CI passes 🤞

ace-dent · 2024-11-26T11:57:39Z

Sorry- if this seems obvious- but how is the best possible result determined?

AlexTMjugador · 2024-11-26T12:30:37Z

Sorry- if this seems obvious- but how is the best possible result determined?

I think it's determined from the Shannon's source coding theorem, assuming that the entropy is as ideal as it can get, which is the case for rows with a single constant value. See also this random Reddit topic with less mathy discussion about it.

Edit: to be more clear, not all heuristic filter selection policies implemented in Oxipng are based on entropy. In any case, the best result is chosen to be the optimal size value that can be computed according to the heuristic.

TPS · 2024-11-26T12:42:33Z

I love the concept (& sheer nerdliness) of this — thanks, @andrews05 & @AlexTMjugador! — but is there way to cheaply determine & disable automatically when this shouldn't be used? (E.g., perhaps when random/static or photographic images?) Or is the overhead truly so negligible that's not relevant?

andrews05 · 2024-11-26T19:20:03Z

For each delta filter, the heuristic strategies apply the filter to the current line and then run an algorithm to produce a single value. They then choose the filter based on which one produced the "best" value.

MinSum chooses the one with the smallest value and for this algorithm the smallest possible value for any line is 0.
Bigrams is similar but the smallest possible value is 1.
Entropy and BigEnt choose the largest value and the maximum possible value can be calculated from the length of the line.

To be clear, the best possible value is always deterministic and it's not possible for this change to have any impact on the output file size. It's purely a performance gain that will occur if all bytes in the line are zero. (In retrospect, I could have just checked for this up front and picked the None filter automatically. I might play around with that...)

#648 may have been a bit hasty - I realised afterward that there's a simpler way to achieve the same thing, and include the Brute filter as well. This reverts #648 and instead just picks None up front if the line is all zeros. This is guaranteed to be the chosen filter for MinSum, Entropy, Bigrams and BigEnt. It's almost certainly true for Brute as well but this is harder to prove. I've tested this across hundreds of images and found no change in output.

andrews05 force-pushed the best-result branch from 11885ac to e753810 Compare November 26, 2024 08:34

AlexTMjugador reviewed Nov 26, 2024

View reviewed changes

src/png/mod.rs Outdated Show resolved Hide resolved

AlexTMjugador approved these changes Nov 26, 2024

View reviewed changes

andrews05 added 2 commits November 27, 2024 00:46

Break when best possible filter result found

f6842f2

Use best_possible for all strategies

c80d273

andrews05 force-pushed the best-result branch from b805ad4 to c80d273 Compare November 26, 2024 11:46

andrews05 merged commit 1efacac into shssoichiro:master Nov 26, 2024
12 checks passed

andrews05 deleted the best-result branch November 26, 2024 11:52

andrews05 mentioned this pull request Nov 27, 2024

Assume None if the line is all zeros #650

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Break when best possible filter result found #648

Break when best possible filter result found #648

andrews05 commented Nov 26, 2024 •

edited

Loading

AlexTMjugador left a comment

ace-dent commented Nov 26, 2024

AlexTMjugador commented Nov 26, 2024 •

edited

Loading

TPS commented Nov 26, 2024 •

edited

Loading

andrews05 commented Nov 26, 2024

Break when best possible filter result found #648

Break when best possible filter result found #648

Conversation

andrews05 commented Nov 26, 2024 • edited Loading

AlexTMjugador left a comment

Choose a reason for hiding this comment

ace-dent commented Nov 26, 2024

AlexTMjugador commented Nov 26, 2024 • edited Loading

TPS commented Nov 26, 2024 • edited Loading

andrews05 commented Nov 26, 2024

andrews05 commented Nov 26, 2024 •

edited

Loading

AlexTMjugador commented Nov 26, 2024 •

edited

Loading

TPS commented Nov 26, 2024 •

edited

Loading