Gpu samplers by asmushetzel · Pull Request #8179 · apache/mxnet

asmushetzel · 2017-10-08T16:08:07Z

Provide a generic implementation for all random samplers that can run on cpu and gpu.

Remarks:

This will fail until PR297 is merged into mshadow
The design pattern adopted for GPU is to generate as many random seeds as samples should be drawn and then draw each sample using its own randstate. This is a pattern that is considered to be ok according to Nvidia documentation. There is a theoretical chance that the generated random sequences by the individual randstates are somehow correlated but Nvidia claims that they never observed such an effect and just want to mention that they can't prove that it can never happen theoretically. So in practice, this should generated high-quality uncorrelated samples.
Above design pattern is also ok w.r.t. setup time of the randstates as we use consistently sequence 0 of the randstate (so the setup does not have to skip through multiple sequences when initializing). Again according to Nvidia.
I decided to use a consistent design pattern even for uniform/normal distributions as this makes the code more consistent.
The design pattern allows easy implementation of all types of rejection sampling methods, so we should be able to add other distributions easily whenever we need them.
Above pattern will automatically enable multi-threaded sampling on CPU if openMP is on.
The sampling methods for exponential/gamma/Poisson/negative Binomial are all standard and mostly the same as STL uses. The rejection method for big lambdas for the Poisson-distribution is slightly different but also theoretically sound (reference in the code)
There is a bit of a problem for the case of fp16. Basically there do not exist any samplers that natively work on this limited precision. I don't know why we recently added fp16 as a valid output for random sampling operators. We should have never done this but instead insist that an explicit casting operator must be used that then can also handle all issues of overflow/underflow in a centralized way. Anyway, somehow this got added which causes the problem that we have to convert fp32 samples to fp16 and as these are samples from a distribution, there is always the chance of an overflow. Ugly. This was already the case in the prior implementation, so it is not newly introduced here. And IMO, we should not solve it within the samplers but instead by a separate casting operator.
Adding additional outputs/functionalities to the random sampler such as re-parametrization, CDF etc is not part of this PR. We should add it as a separate step.

piiswrong · 2017-10-08T23:00:15Z

tests/python/unittest/test_random.py

            assert np.abs(check_func(ret1, params)) < tol, "ndarray test: %s check for `%s` did not pass" % (check_name, name)

-        # check multi-distribution sampling, only supports cpu for now
-        if device.device_type == 'cpu':


This is for testing imperative.

asmushetzel · 2017-10-09T13:24:33Z

pls do not yet merge. investigating one test failure related to test_loss.py (which is related to randoms).

asmushetzel · 2017-10-09T14:48:31Z

Found the issue. Can you please integrate PR 300 for mshadow.

BTW: in test_loss.py there are a lot of calls to np.random.seed() where I think you want to call mx.random.seed() instead. I realized that you changed it recently Eric, so you may still working on it.

piiswrong · 2017-10-10T02:50:40Z

Could you put the imperative tests back? Then we can merge

asmushetzel · 2017-10-10T06:45:08Z

Concerning imperative tests, I just enabled them also for GPU, but they are still in. Just had to change intendation of the whole code block. Can you take a second look pls.

Other than that, I'm done.

BTW: Mid-term, I would like to get rid of any dependency on mshadow::random and move the remaining stuff (having some generators centrally allocated) to src/operator/random. What is your take on that?

piiswrong · 2017-10-10T19:31:21Z

Sure, sounds good to me.

* Samplers for GPU * adjustments for GPU samplers

Samplers for GPU

b925936

asmushetzel force-pushed the gpu_samplers branch 2 times, most recently from d8fd75e to d25943d Compare October 8, 2017 17:52

piiswrong reviewed Oct 8, 2017

View reviewed changes

asmushetzel force-pushed the gpu_samplers branch 2 times, most recently from ebfd5f2 to eeaf4dd Compare October 9, 2017 11:10

asmushetzel force-pushed the gpu_samplers branch from eeaf4dd to bbbdbbb Compare October 9, 2017 20:34

adjustments for GPU samplers

42d1324

asmushetzel force-pushed the gpu_samplers branch from bbbdbbb to 42d1324 Compare October 9, 2017 21:53

piiswrong merged commit d22373b into apache:master Oct 10, 2017

mbaijal pushed a commit to mbaijal/incubator-mxnet that referenced this pull request Oct 12, 2017

Gpu samplers (apache#8179)

43e51a9

* Samplers for GPU * adjustments for GPU samplers

crazy-cat pushed a commit to crazy-cat/incubator-mxnet that referenced this pull request Oct 26, 2017

Gpu samplers (apache#8179)

03fbdbf

* Samplers for GPU * adjustments for GPU samplers

asmushetzel deleted the gpu_samplers branch October 29, 2017 13:59

reminisce mentioned this pull request Nov 29, 2017

[gluon] segmentation error when getting the shape of output ndarray of transpose conv #8863

Closed

yzhliu mentioned this pull request Dec 18, 2017

fix random generator: do not gen seed each time #9119

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gpu samplers#8179

Gpu samplers#8179
piiswrong merged 2 commits intoapache:masterfrom
asmushetzel:gpu_samplers

asmushetzel commented Oct 8, 2017

Uh oh!

piiswrong Oct 8, 2017

Uh oh!

asmushetzel commented Oct 9, 2017

Uh oh!

asmushetzel commented Oct 9, 2017

Uh oh!

piiswrong commented Oct 10, 2017

Uh oh!

asmushetzel commented Oct 10, 2017

Uh oh!

piiswrong commented Oct 10, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

asmushetzel commented Oct 8, 2017

Uh oh!

piiswrong Oct 8, 2017

Choose a reason for hiding this comment

Uh oh!

asmushetzel commented Oct 9, 2017

Uh oh!

asmushetzel commented Oct 9, 2017

Uh oh!

piiswrong commented Oct 10, 2017

Uh oh!

asmushetzel commented Oct 10, 2017

Uh oh!

piiswrong commented Oct 10, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants