document random reproducibility policy #33350

stevengj · 2019-09-21T15:46:40Z

See e.g. discussion in #30494 and on discourse.

See also https://numpy.org/neps/nep-0019-rng-policy.html for a similar discussion by the NumPy developers.

tpapp · 2019-09-21T16:17:36Z

Thanks for writing this up.

Do you think we can/should make a commitment to reproducibility across OSs/architectures? The native Julia code should have this property (though we should be careful about endianness if the issue arises), but I don't know if the dSMFT code can guarantee this.

stevengj · 2019-09-21T17:49:40Z

I would hope that the MT code would guarantee portability, since it is supposed to be following a specific number-theoretic sequence...

stevengj · 2019-09-23T12:05:59Z

Should be good to merge? CI failure (cp: cannot stat 'dist-extras/7z.*': No such file or directory) is unrelated, obviously.

KristofferC · 2019-09-23T12:08:38Z

cc @staticfloat for the win failure

GregPlowman · 2019-09-24T03:05:26Z

I would hope that the MT code would guarantee portability, since it is supposed to be following a specific number-theoretic sequence...

Yes, but perhaps higher-level rand functions might be architecture-specific.
32-bit vs 64 bit
Alternative/future RNGs (other than MT)
Secure/crypto rands with hardware support

Would it be better to explicitly state policy w.r.t architecture?

#29240 (comment)
#29240 (comment)
#29240 (comment)

rfourquet · 2019-09-24T16:29:52Z

stdlib/Random/docs/src/index.md

+
+Software tests that rely on *specific* "random" data should also generally save the data or embed it into the test code.  On the other hand, tests that should pass for *most* random data (e.g. testing `A \ (A*x) ≈ x` for a random matrix `A = randn(n,n)`) can use an RNG with a fixed seed to ensure that simply running the test many times does not encounter a failure due to very improbable data (e.g. an extremely ill-conditioned matrix).
+
+The statistical *distribution* from which random samples are drawn *is* guaranteed to be the same across any minor Julia releases.


Would this rule out the possibility to change the distribution of rand(::Type{<:AbstractFloat}) from uniform in [0,1) into uniform in (0,1) which is currently discussed? or is this a "minor" enough change?

In measure theoretic terms, those are technically the same distribution 😬

document random reproducibility policy

bebbb73

stevengj added docs This change adds or pertains to documentation randomness Random number generation and the Random stdlib labels Sep 21, 2019

stevengj added 3 commits September 21, 2019 11:48

Update index.md

0ec10fc

Update index.md

4c822ff

Update index.md

4fe3306

StefanKarpinski approved these changes Sep 21, 2019

View reviewed changes

Update index.md

7dfd48a

KristofferC merged commit cc6ae96 into master Sep 23, 2019

delete-merged-branch bot deleted the randdoc branch September 23, 2019 12:08

stevengj mentioned this pull request Sep 24, 2019

implement "nearly division less" algorithm for rand(a:b) #29240

Merged

rfourquet reviewed Sep 24, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document random reproducibility policy #33350

document random reproducibility policy #33350

stevengj commented Sep 21, 2019 •

edited

Loading

tpapp commented Sep 21, 2019

stevengj commented Sep 21, 2019

stevengj commented Sep 23, 2019

KristofferC commented Sep 23, 2019

GregPlowman commented Sep 24, 2019

rfourquet Sep 24, 2019

StefanKarpinski Sep 25, 2019


		Software tests that rely on specific "random" data should also generally save the data or embed it into the test code. On the other hand, tests that should pass for most random data (e.g. testing `A \ (A*x) ≈ x` for a random matrix `A = randn(n,n)`) can use an RNG with a fixed seed to ensure that simply running the test many times does not encounter a failure due to very improbable data (e.g. an extremely ill-conditioned matrix).

		The statistical distribution from which random samples are drawn is guaranteed to be the same across any minor Julia releases.

document random reproducibility policy #33350

document random reproducibility policy #33350

Conversation

stevengj commented Sep 21, 2019 • edited Loading

tpapp commented Sep 21, 2019

stevengj commented Sep 21, 2019

stevengj commented Sep 23, 2019

KristofferC commented Sep 23, 2019

GregPlowman commented Sep 24, 2019

rfourquet Sep 24, 2019

Choose a reason for hiding this comment

StefanKarpinski Sep 25, 2019

Choose a reason for hiding this comment

stevengj commented Sep 21, 2019 •

edited

Loading