Re: [Jgeneral] speeding up J

Skip Cave Tue, 13 Feb 2007 13:44:46 -0800

Raul Miller wrote:

What does "first" mean in the context of "parallel"?  (Does it mean
anything different than "randomly choosen"?)

Skip says:

In the case of my "parallel" operation definition, "first" has a veryspecific meaning. The parallel function I described is defined such thatit is impossible to start two or more parallel operations at the sametime. Of course, many parallel operations can be _executing_ at the sametime, but each of the operations must start in a sequential manner. Bydefinition, all parallel tasks started by the parallel function willhave a well defined, and well known, order of execution start. Thisestablishes a "priority" of parallel tasks, which helps to prevent thetypes of conflicts that can arise when multiple tasks attempt to work onthe same data at the same time. So when I say "first", I mean the orderof execution start, which defines priority, nothing more. In your example:


  'A B C'=: exp"0(4 4 4)
  parallel each A;B;C

The order of starting of the functions A, B, and C must be defined. Ifwe say the order is start A, then start B, then start C, that definesthe start order. The actual delays between the consecutive starts couldbe a few nanoseconds. Thus if A is started first, then any reference tovariables accessed or modified in A by B or C would be blocked until Acompleted operations on that variable. It would be the job of theinterpreter to make sure this rule was enforced. If there are commonvariables, the B & C processes would have to wait on the A process tofinish operating on the shared variable before they could operate on it.


Raul Miller wrote:

For that matter, what does "lock" mean?  Does that mean that any attempt
to read that variable must wait until the other expression is complete?

Skip says:

The lower-priority parallel process doesn't necessarily have to waituntil the higher priority process has completely finished to access ashared variable, but the lower task at least must wait until thehigher-priority process has completely finished with its work on theshared variable, before it can proceed. .


Raul Miller wrote:

Or does it mean that writes to the variable are atomic?

Skip says:
No. Atomic writes would be a much more complex problem to prevent conflicts

Raul Miller wrote:

[Do you see how these can be different?] Or does it mean something else?

Skip says:
Yes, they are obviously different, and the second one is prohibited.


Raul Miller wrote:

What happens when thread A is waiting on thread B, but thread B is waiting on
thread A?

Skip says:

This will never happen with the parallel function, because the "first"started parallel task, in this case A, will always preempt the otherstarted tasks. A will never wait on B, as A has priority on all variableoperations that may conflict between A & B (or C). B will always wait onA if there are shared data operations because of this priority.Similarly, B will never wait on C, as B has priority on all variableoperations that may conflict between B & C. C, being the low man on thetotem pole, will have to wait on anybody who is working on a variablethat they share in common.

This encourages the programmer to break up the data under work in such away to prevent shared access of the same data, if the programmer wantsto optimize parallelism. Otherwise, the program will run sequentially,with little or no gain in execution speed. I learned this priority trickto prevent parallel conflicts years ago, working on real-time parallelschemes for character recognition engines.

Raul Miller wrote:
Consider:

 a=:a,b

What happens here in a parallel context?  (Remember that J allocates
up to double the strictly necessary space for arrays to make
this expression fast in the typical case.)

Skip says:

Since this example does not use the parallel function, there will be noparallel execution of the assignment.

(Would it acceptable to make J substantially slower in
contexts where it's currently fast so that it can run multiple
threads?)
No. The interpreter should be designed to attempt parallelization
only if it will significantly speed up the execution of a
specific primitive.


I thought we were discussing your "parallel" operation, hinted at
above?  Are you saying that certain primitives would throw an
error if they were used in one of these threads?

You are justified in being confused here. I was mixing discussions. Myresponse was about the parallelization of primitives. If primitives in Jwere to support parallelism, then they should only resort to theparallelism when it benefited performance, and not impact performance ifparallelism were not used. The parallelism should be invisible, and noerror should be thrown if parallelism was not deemed efficient enough touse by the interpreter. A slight degradation in overall J performance (<0.1%) would be acceptable to deal with the overhead in making theparallel/no-parallel decision.

Moving to the coarse parallelism discussion, the parallel operationshould be a separate function in J, and should not impact the executionof any code that was not requested to be run in parallel by theprogrammer, using the parallel function..


Skip Cave

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jgeneral] speeding up J

Reply via email to