Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

proposal: sort: return equal values in non-deterministic order #13884

Closed
rsc opened this issue Jan 8, 2016 · 25 comments
Closed

proposal: sort: return equal values in non-deterministic order #13884

rsc opened this issue Jan 8, 2016 · 25 comments

Comments

@rsc
Copy link
Contributor

rsc commented Jan 8, 2016

Crazy idea, but what if sort.Sort randomly permuted its input a bit before starting?

Go 1.6 has a different sort.Sort than Go 1.5 and I've seen at least a dozen test failures at Google that were implicitly depending on the old algorithm. The usual scenario is that you sort a slice of structs by one field in the struct. If there are entries with that field equal but others unequal and you expect a specific order for the structs at the end, you are depending on sort.Sort's algorithm. A later sort.Sort might make a different choice and produce a different order. This makes programs not portable from one version of Go to another, much like map hash differences used to make programs not portable from one architecture to another. We solved maps by randomizing the iteration order a bit. In the map case it's not a full permutation, but just enough variation to make tests obviously flaky.

I wonder if we should do the same for sort.Sort. It would only take N swaps to shuffle things quite well, and we already use N*log(N) swaps, so N*(log(N)+1) is not likely to be noticed. That would also protect a bit against malicious inputs.

It would surprise people, especially people who think sort.Sort == sort.Stable. But the rationale is that it's better to surprise them the first time they run the code instead of however many Go releases later.

/cc @robpike, @griesemer, @ianlancetaylor, @aclements, @dr2chase

@rsc rsc added this to the Go1.7Early milestone Jan 8, 2016
@bradfitz
Copy link
Contributor

bradfitz commented Jan 8, 2016

Interesting. Perhaps we only do it for "small" inputs (most likely: tests), where it would especially unnoticed.

@rsc
Copy link
Contributor Author

rsc commented Jan 8, 2016 via email

@ALTree
Copy link
Member

ALTree commented Jan 8, 2016

There's one thing to consider.

The work the runtime does to randomize map iteration order is 100% justified because every time you iterate over a map you get a different order, so you are seeing the effects of the randomization 100% of the times.

That wouldn't be true for sort.Sort randomization. You write

It would only take N swaps to shuffle things quite well

but you don't really care about "shuffling the array well". What you really care about is the chance that N swaps change the order of the sorted array. Suppose you do the math and find out that that chance is really small, unless the array has a lot of repeated entries.

That would mean that you are always paying a price for a benefit that has zero chances to affect the user for arrays with no repeated entries, a very small chance to be seen if the array has a moderated number of repeated entries, and a decent chance to be a real benefit only for arrays with a decent number of repeated entries.

The bottom line is that one thing is to pay a (maybe) small price for something that has a visible effect 100% of the times, and another thing is to pay a (maybe) small price for something that has a (maybe) very small chance to actually do something useful. If the final result of the sorting is unchanged, the work spent shuffling is 100% wasted.

Just a random thought. I didn't do any math, nor run any benchmark.

@griesemer
Copy link
Contributor

I like it. If people want stable sort they should use stable sort.

@rsc
Copy link
Contributor Author

rsc commented Jan 8, 2016

@ALTree The math works out OK. If you have two equal elements and sort.Sort puts them in one order, then if you swap them sort.Sort will necessarily put them in the other order, since it can't tell them apart. For any pair of places they might be, as long as the chance of one being before the other is 50/50 it works out.

Another possibility is a final run over the sorted list looking for equal entries and swap them randomly. That's N comparisons instead of N swaps though, and swaps are usually cheaper.

The price here is being paid on behalf of future compatibility. It's always 100% wasted if you ignore that. It would only make sense if the price were low enough to make sense to pay all the time. (And the larger the data being sorted the lower the price.)

@ianlancetaylor
Copy link
Member

We could simply randomly flip the order of the insertion sort we use in the final step. If the comparison function is not fully specified, we should get different results. That would be essentially free.

@rsc
Copy link
Contributor Author

rsc commented Jan 8, 2016 via email

@robpike
Copy link
Contributor

robpike commented Jan 8, 2016

This gives me a bad taste. You're (mildly) punishing all users to make it easier for you (you in particular) to find bugs in others' code triggered by changes in the code base. I understand the argument for maps, but there the cost is minute and the frequency of dependence high. Here you need multiple sort operations even to create the possibility of the issue.

Much as I understand your frustration in dealing with the problem, it bothers me that that your solution is to take it out on everyone.

@ianlancetaylor
Copy link
Member

The insertion sort code is

func insertionSort(data Interface, a, b int) {
    for i := a + 1; i < b; i++ {
        for j := i; j > a && data.Less(j, j-1); j-- {
            data.Swap(j, j-1)
        }
    }
}

We randomly flip between that version and

func insertionSort(data Interface, a, b int) {
    for i := b - 2; i >= a; i-- {
        for j := i; j < b-1 && !data.Less(j, j+1); j++ {
            data.Swap(j, j+1)
        }
    }
}

I probably have an off-by-one error in there, but that's the basic idea.

@cespare
Copy link
Contributor

cespare commented Jan 8, 2016

I like this idea. WRT the performance hit: people who need really fast sorting aren't using sort.Sort anyway.

@robpike
Copy link
Contributor

robpike commented Jan 9, 2016

It's not the performance hit that bothers me, it's the message: We will deliberately surprise you with our algorithms because of something that doesn't really matter to you.

I'm not totally opposed, I just find it distasteful.

@davecheney
Copy link
Contributor

I'm with Rob. The understand the sentiment of this and the precedent of the
map change but on balance I would prefer not to see this change land.

To give some background, juju is close to 200k lines and many of our tests
were faulty and needed to be fixed as I'm trying to drag canonical off Go
1.2. This is a good thing.

However I've also spent an equal amount of time internally defending the
map change from people who persist that Go changed something by randomising
map ordering, when of course there was never a promise of ordering in the
first place. My coworkers are unable to see this as their own carelessness,
and blame Go for deliberately breaking their code to this day.

This sort change may help people project themselves from themselves, but
they aren't going to thank us, no matter how well intentioned our
motivations.

Thanks

Dave

On Sat, 9 Jan 2016, 15:35 Rob Pike notifications@github.com wrote:

It's not the performance hit that bothers me, it's the message: We will
deliberately surprise you with our algorithms because of something that
doesn't really matter to you.

I'm not totally opposed, I just find it distasteful.


Reply to this email directly or view it on GitHub
#13884 (comment).

@griesemer
Copy link
Contributor

@davecheney I understand the sentiment in this case. But a future perhaps more efficient implementation may lead to a similar unanticipated change of order, with the same effect on code that makes implicit order assumptions. It seems to me it would be better to eliminate those errors earlier than later (or possibly even multiple times).

It would be interesting to make the experiment. GO16SORTEXPERIMENT=1 could enable it. (disabled by default). That way teams willing to take the plunge can try it out and see if it exposes any real errors. It would be fine to remove again for 1.7 if there's no real benefit.

@minux
Copy link
Member

minux commented Jan 9, 2016 via email

@davecheney
Copy link
Contributor

Thanks for your comments Robert. My position is that it was not that this
sort change or the map change were the results of an accidental side effect
of an optimisation, but that it was a willful, however well intentioned,
change that my colleagues are unable to accept.They cannot get past the
fact that this was a deliberate chnage aimed at flushing out their code.

I honestly believe if a change to the sort algorithm caused the effect that
is being deliberately proposed today, people would be more accepting of
this change as a bug in their code than they were about the map change. I
don't agree with the argument that we'll probably break this invariant
(which wasn't an invariant to being with) later, so its best to do it in a
controlled manner. Developers have a controlled manner, they can stay on
the previous version of Go until they have fixed their buggy code.

Re making this optional, nobody used the vendor experiment til it was
enabled by default, and few people test the beta versions of go we push out
no matter how hard we pleed, if you're going to make this change, please
don't sit on the fence by making it opt in.

Thanks

Dave

On Sat, 9 Jan 2016, 17:58 Robert Griesemer notifications@github.com wrote:

@davecheney https://github.com/davecheney I understand the sentiment in
this case. But a future perhaps more efficient implementation may lead to a
similar unanticipated change of order, with the same effect on code that
makes implicit order assumptions. It seems to me it would be better to
eliminate those errors earlier than later (or possibly even multiple times).

It would be interesting to make the experiment. GO16SORTEXPERIMENT=1 could
enable it. (disabled by default). That way teams willing to take the plunge
can try it out and see if it exposes any real errors. It would be fine to
remove again for 1.7 if there's no real benefit.


Reply to this email directly or view it on GitHub
#13884 (comment).

@rsc
Copy link
Contributor Author

rsc commented Jan 9, 2016 via email

@davecheney
Copy link
Contributor

On Sun, Jan 10, 2016 at 5:55 AM, Russ Cox notifications@github.com wrote:

@davecheney, regardless of this issue, sort.Sort is changing in Go 1.6 (it
already has). The new code runs significantly (10-20%) faster than the old
code, but it also leaves ostensibly equal elements in a different order
than Go 1.5's sort.Sort did. We are not including the old sort algorithm as
a user-controlled option in Go 1.6. From what you wrote, it sounds like
your colleagues, if they are affected, will be surprised that sort.Sort has
fewer guarantees that they realized and upset that Go 1.6 has changed a
detail they didn't realize could change. If so, I am sorry.

Hi Russ,

This is fine, sort.Sort is still operating within its contract. I'm sure
people will be fine with this change.

But that situation is exactly the rationale for making sort.Sort just a
little unpredictable in its handling of equal elements. It keeps people
from getting too comfortable with the new sort.Sort implementation and
being surprised the next time we take a significant optimization
opportunity. It also keeps people from being surprised if there's ever a
competing Go implementation with a different algorithm.

I just cannot relate this paragraph to the previous one. In the previous
paragraph sort.Sort's performance has improved, but as a side effect it can
produce output that is not identical, but still compliant. This is fine.
Every release we tweak the scheduler and the result of programs changes if
people haven't been very careful to synchronise them; although arguably
they shouldn't have been relying on output from multiple goroutines in the
first place because it is not guaranteed.

But, what this proposal is suggesting to me is deliberately introducing
(compliant) instability into sort.Sort -- which is fine, as you and Robert
have pointed out, if you need stable sort, you should use sort.Stable -- as
"tough love" to remind people that they are not being studious enough in
reading the documentation.

Here's how the conversation usually goes on IRC, mailing list, internal
chat, etc

A: hey, sort.Sort broke in 1.6, it doesn't give me the same results!
B: sort.Sort got faster in 1.6, as a result the sort ordering is not the
same, i know this sounds weird that sorting could return things in a
different order, but that's how it is. Maybe you want to use sort.Stable
instead?
A: hmm, I never knew about sort.Stable, I guess I'd better go an read the
documentation again, thanks

But instead I'm worried about the conversation going like this

A: hey, sort.Sort broke in 1.6, it doesn't give me the same results!
B: yeah, so we found that lots of people where using sort.Sort where they
really should have been using sort.Stable. Because they were relying on an
implementation detail of how sort.Sort worked; it wasn't ever guarenteed to
give a stable sort, it just mostly did.
A: So Go changed sort.Sort in 1.6 ? How is this not a Go 1 contract
violation ? Was it a bug ?
B: Well, sort.Sort got optimised, it got another 10-20% faster, but then we
started to find code that was relying on the output of the old
implementation. So, to make sure people can find all these places in their
code sort.Sort now shuffles some elements before sorting to make the sort
deliberately unstable.
A: What ? sort does't sort, it shuffles ?
B: No, the results are always sorted, but this is the difference between a
stable sort and an unstable sort.
A: So why did Go 1.6 change sort.Sort to be unstable ?
B: It was always unstable, you were just lucky that it looked stable enough
for your tests.
A: And now sort.Sort is always unstable, so now I have to update all my
code and check all my tests?
B: Yes, sorry.

I'd much rather be having the first conversation, not the second.

Thanks for your time

Dave


Reply to this email directly or view it on GitHub
#13884 (comment).

@griesemer
Copy link
Contributor

@davecheney "A: And now sort.Sort is always unstable, so now I have to update all my
code and check all my tests?" I'm pretty sure engineers rather have correct code than accidentally correct code. But I do understand the nuisance of having to update tests at inopportune moments. Hence maybe an environment variable (to disable rather then enable shuffling as I first suggested) might be just the ticket to get over this hurdle.

@rsc rsc changed the title sort: randomize input before beginning? sort: return equal values in non-deterministic order? Jan 9, 2016
@rsc
Copy link
Contributor Author

rsc commented Jan 9, 2016 via email

@davecheney
Copy link
Contributor

Thanks for pointing out that this is for 1.7, I had lost that part of the
thread.

Thanks for listening to my side of the story.

Dave

On Sun, 10 Jan 2016, 08:07 Russ Cox notifications@github.com wrote:

Please note that this issue is not for Go 1.6. It is marked as a crazy idea
for Go 1.7, one that we could try early and see (or not).

Please also note that the suggestion is to flip a coin to decide the order
of equal values. Shuffling the input was an implementation detail, and Ian
came up with a simpler, cheaper implementation. But the implementation is
not important, just the effect. I've retitled the bug accordingly.

The only possible visible difference between sorting algorithms is how they
arrange equal values. There was discussion on the CL about whether to
accept the Go 1.6 change at all, exactly because it would break things. I
decided that we couldn't reasonably turn down a 10-20% improvement for fear
of breaking code depending on unspecified behavior. But as expected it does
break things.

Re "now I have to update all my code and check my tests", that's basically
true for Go 1.6 with just the algorithmic change. The idea in this issue -
flipping a coin for equal values, so that all future algorithmic changes
are invisible - is meant to make it possible to respond "and this is the
last time you'll need to do that when you update to a new Go version."

But again, it's a crazy idea, and it's not for Go 1.6.


Reply to this email directly or view it on GitHub
#13884 (comment).

@robpike
Copy link
Contributor

robpike commented Jan 10, 2016

How about a much weaker compromise: Make it stronger in the documentation that the sort is not stable and that it may change between versions and implementations.

@rsc
Copy link
Contributor Author

rsc commented Jan 10, 2016 via email

@bradfitz
Copy link
Contributor

(Aside: the scheduler randomness for -race has been invaluable this cycle. I've used it to fix a number of tricky bugs.)

@dr2chase
Copy link
Contributor

Would it make sense to have sort (and other potentially-future-different interfaces) behave more randomly under "go test", combined with flags to allow you to control where the wiggles are? That makes testing more aggressive and production more predictable (usually). So the randomness would be opt-out for testing, opt-in for production.

In the SSA code I get a lot of mileage from a hash-based match for where to (not)activate buggy code; I'm not sure how we would do that for APIs, but it would be nifty to be able to isolate the exact buggy call in the same way that I can (now automatically) isolate exactly which function is incorrectly compiled. Contra that, there's a bit of a learning curve to managing all the random-activation knobs.

@rsc
Copy link
Contributor Author

rsc commented Feb 26, 2016

I wish we'd done this when we introduced sort.Sort, but I think it's too late to do the experiment now. Proposal withdrawn/declined.

@rsc rsc changed the title sort: return equal values in non-deterministic order? proposal: sort: return equal values in non-deterministic order Feb 26, 2016
@rsc rsc modified the milestones: Proposal, Go1.7Early Feb 26, 2016
@rsc rsc closed this as completed Feb 26, 2016
@golang golang locked and limited conversation to collaborators Feb 28, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests