Re: Demo for Parallel Core Collection API

Tristan Yan Thu, 19 Dec 2013 04:18:29 -0800

Hi Paul And Everyone
Sorry for getting back late.

I took Paul's suggestion and have written other two demos which presentsusage of parallel computation. One is using Monte-Carlo to calculatevalue of PI. Other is find a big prime by given length. Please review it.http://cr.openjdk.java.net/~tyan/sample/webrev.00/<http://cr.openjdk.java.net/%7Etyan/sample/webrev.00/>There is another demo which present mandelbrot set was designedAlexander Kouznetsov has been already in reviewing. It's not my codereview request.

Thank you very much
Tristan



On 10/15/2013 11:20 PM, Paul Sandoz wrote:

On Oct 15, 2013, at 4:35 PM, Tristan Yan <tristan....@oracle.com<mailto:tristan....@oracle.com>> wrote:
Hi Paul
you have comments "suggest that all streams are sequential. There isan inconsistency in the use and in some cases it is embedded in otherstream usages."
We do not really understand what exactly is meant, could youelaborate a little bit. Is it because we want to show ppl that weshould use stream more than parallelStream?
Going parallel is easy to do but not always the right thing to do.Going parallel almost always requires more work with the expectationthat work will complete sooner than the work required to get the sameresult sequentially. There are a number of factors that affect whetherparallel is faster than sequential. Two of those factors are N, thesize of the data, and Q the cost of processing an element in thepipeline. N * Q is a simple cost model, the large that product thebetter the chances of parallel speed up. N is easy to know, Q not soeasy but can often be intuitively guessed. (Note that there are otherfactors such as the properties of the stream source and operationsthat Brian and I talked about in our J1 presentation.)
Demo code that just makes everything (or most streams) parallel issending out the wrong message.
So i think the demo code should present two general things:

1) various stream functionality, as you have done;
2) parallel vs. sequential for various cases where it is known thatparallel is faster on a multi-core system.
For 2) i strongly recommend measuring using jmh [1]. The data sets youhave may or may not be amenable to parallel processing, it's worthinvestigating though.
I have ideas for other parallel demos. One is creating probably primes(now that SecureRandom is replaced with ThreadLocalRandom), creating aprobably prime that is a BigInteger is an relatively expensiveoperation so Q should be high. Another more advanced demo is aMonte-Carlo calculation of PI using SplittableRandom and a specialSpliterator, in this case N should be largish. But there are othersimpler demonstrations like sum of squares etc to get across that Nshould be large. Another demo could be calculation of a mandelbrotset, which is embarrassingly parallel over an area in the complex plane.
So while you should try and fit some parallel vs. sequential executioninto your existing demos i do think it worth having a separate set ofdemos that get across the the simple cost model of N * Q. So feel freeto use some of those ideas previously mentioned, i find those ideasfun so perhaps others will too :-)
Paul.

[1] http://openjdk.java.net/projects/code-tools/jmh/
On Oct 15, 2013, at 4:37 PM, Tristan Yan <tristan....@oracle.com<mailto:tristan....@oracle.com>> wrote:
Also there is one more question I missed
You suggested ""ParallelCore" is not a very descriptive name. Suggest"streams"."
1) yes we agree this demo is not for parallel computation per se
2) but we do not have a clear demo for parallel computation
3) if we are to rename this, we need to develop another one, do youhave a scenario for that?

Re: Demo for Parallel Core Collection API

Reply via email to