Re: [Jbeta] J904 multithreading

'Pascal Jasmin' via Beta Sun, 10 Apr 2022 06:52:34 -0700

Some of my naive observations,

I like the design of "if out of threads, run on main thread".  If you run out 
of execution units, then the likely higher performance is to run on main 
instead of queueing/switching between "too many threads".  This does not 
actually guarantee that setting T. to cores -1 produces predictable performance 
because computer can be busy including other J environments that also set T. to 
cores -1 under a presumption of being the only application running.

An implication of this model is that a thread is only running one pyx at a 
time, and deleting a pyx should include the safe termination of that thread.  A 
command to at least delete all idle threads should be simple enough (main goal 
is to avoid restarting J).

A different multiprocessing model is one permitted by zeromq.  A dbms system 
running on another process maintains its state and responds to 
insert/set/delete/queries in some usable/rational sequence, where state means 
that queries are already a function of time.

The t. model is completely unsuited to thread state.  It may be possible to do 
(". t. '') 'myglobal =: f worldstatedata', but you never should.  Instead 
something like myglobal =: > f t. '' worldstatedata  is still a 
"semi-functional" approach where worldstatedata is snapshotted at time of call, 
a functional result is returned, and then a "transactional/acid" update is 
made.  No thread complexity/management/races/surprises involved because the 
result of f worldsnapshot is functional (same input = same output).

Killing a pyx is important if only to stop infinite loops/non termination, but 
also in distributed search, when some other thread has found "the solution".  
Even if a user makes the discouraged practice of assigning a global inside a 
thread, perhaps just like jbreak.bat, killing a thread/process only responds at 
some memory coherently possible point.  Killing a pyx before finished, will 
either result in no change to the global if it were the last action/update in 
the non-terminating "command", or if it was a "state toggle" (perhaps thread 
locking/access management) repeatedly called by the function, then the thread 
was already unpredictable as to the state, and like objects/classes, code needs 
a "reset toggle states" function in case of errors that cause the normal 
"resets" not to execute.  Killing a thread is an "ordinary error condition" 
that state dependence must be prepared for.

An interesting way to have t. manage state like a zeromq "service" would be to 
use the '' n parameter for managing names/state in threads.  So, if  0 T. 
'myname1 myname2' is called, then that whenever those names are used as 
arguments to functions, the function would be applied in that thread, and that 
thread "owns the data", and access to "that data" would automatically even 
without t. call, run in that thread.

On Sunday, April 10, 2022, 07:50:57 a.m. EDT, Henry Rich <[email protected]> 
wrote: 

Killing a thread is going to be a problem. What do you do with the 
values the thread was using?  They will never be freed.  You can't just 
free all the memory the thread allocated, because some of it may be 
values now being used by other threads.

My thought was that you would have to interrupt the system and use the 
debugger to force an error in the thread.  That would be as close as 
possible to killing it.

Henry Rich

On 4/9/2022 6:52 PM, 'Pascal Jasmin' via Beta wrote:
> There should probably be a T. command to delete all threads (to reset total 
> allocations without restarting J), and to "kill a pyx and stop a thread 
> running that pyx"
>
>
>
>
>
>
> On Saturday, April 9, 2022, 06:22:03 p.m. EDT, Henry Rich 
> <[email protected]> wrote:
>
>
>
>
>
> Threads can be created with options: core placement etc.  The '' is for
> future expansion
>
> Likewise, threads can execute with options; for example, maybe the
> master thread should be reserved to distribute work to the others.
>
> How many cores you use is up to you.  (I couldn't find an OS-independent
> way to decide).  JE generally does few data-dependent branches, and I
> expect it would not be a good idea to use two hyperthreads in one core;
> but you'll have to make that decision.
>
> Henry Rich
>
> On 4/9/2022 5:38 PM, 'Pascal Jasmin' via Beta wrote:
>> If I have a ryzen processor with 6 cores, 12 (intel term hypervisor) 
>> execution units should I run 0 T. '' 10-11 times to "give JE" up to that 
>> many threads?  a way to shorthand that repetition:
>>
>> taskS =. 3 : '0 T. '''' '^:
>> 10 taskS ''
>>
>> assumptions:
>> 1 T. '' returns the last task id created.
>> 2 T. '' returns a list of all task ids
>> 3 T. '' returns 0
>> 4 T. '' returns empty/i.0
>>
>> is there a future expectation to use n in t. or T. for other than '' ?  What 
>> might that be?
>>
>> t =: t.'' NB. must have convenience, if '' is constant.
>>
>>
>>       a =. i.4 10000000
>>
>>      timex '+/&:> +/"1 a'
>>
>> 0.0208033
>>
>> timex ' +/t "1 a'  NB. should just be overhead of sending 40m integers 
>> accross 4 tasks
>>
>> 0.0768311
>>
>> timex '+/&:> +/t "1 a'  NB. includes waiting for results for final sum
>>
>> 0.0935162
>>
>> I think these are good results. For an unsuitable threaded operation.  It 
>> scales by datavolume.  5gb/sec.  The gathering is quick.
>>
>>
>>
>>
>> On Saturday, April 9, 2022, 03:04:28 p.m. EDT, Henry Rich 
>> <[email protected]> wrote:
>>
>>
>>
>>
>>
>> The big news in J904 is multithreading.  J verbs can now be run in
>> different cores, sharing the J global namespaces.
>>
>> [A version of J with limited support for multithreading was revealed
>> last year by John Ference of Monument AI.  John's work was an important
>> proof of concept, but the J904 version is an independent development
>> using pthreads instead of OpenMP.  John has agreed with Jsoftware to
>> give Jsoftware access to the code and testcases used in his work.  We
>> have not used the code.]
>>
>> Two primitives support multithreading:
>>
>> (0 T. '')   creates a thread on a new core.  A thread running a verb is
>> called a /task/. Even if you don't plan to create tasks, you should
>> create threads to allow JE to use them for its own purposes.
>>
>> ([x] u t. '' y) executes ([x] <@:u y) as a task to produce a boxed
>> result.  The thread that executed t. continues to run.  But the result
>> is a special kind of box called a /pyx/.
>>
>> ['pyx' is Greek for box, and has been brought into English on occasion
>> when a word has been needed for a special box.  The most recent such
>> borrowing until now is for the box used by the Royal Mint to hold coins
>> for assay.]
>>
>> A pyx looks on the outside like any other box.  You can pass it as an
>> argument.  But as soon as you look inside it, your thread is blocked
>> until the task producing the pyx's value has completed. Then you see the
>> value, and your thread continues.
>>
>> Tasks share the global namespaces and thus can freely read, write, stomp
>> on, and delete names visible to other tasks.
>>
>> We encourage you to experiment with threads and tasks.  We are sure that
>> a layer of synchronizing primitives - semaphores, locks, and mutexes -
>> will be needed, but we would like to get practical experience with tasks
>> before we implement them.  Suggestions welcome.
>>
>> Henry Rich
>>
>>
>>
>>
>>
>>
>

-- 
This email has been checked for viruses by AVG.
https://www.avg.com

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jbeta] J904 multithreading

Reply via email to