Thank you Hartmut, 

Yes I want to pass different arguments to different invocation. For example I 
can have different sets of channels that some localities will need to use to 
communicate.

I am able to use the parallel_executor as follow:

-----
hpx::parallel::execution::parallel_executor exec;

hpx::when_all(hpx::parallel::execution::bulk_async_execute(
      exec, &spmd_bulk, args)).get();

——

Now I am trying to understand where those tasks are launched.

When I run this using -hpx::threads 1 with 8 MPI ranks the hpx::find_here() is 
telling me that all my spmd_bulk are running on the same locality (even if I 
have 8 localities), as a consequence the execution is serialized. 

I thought that this executor was supposed to spawn the tasks over different 
localities, but probably I am missing something (or some parameters).

How can I spawn my tasks over multiple localities?

Should I use something like this instead? 
https://stellar-group.github.io/hpx/docs/html/hpx/manual/containers/partitioned_vector_views/spmd_block.html
 
<https://stellar-group.github.io/hpx/docs/html/hpx/manual/containers/partitioned_vector_views/spmd_block.html>
Thank you,
Steve




> On 13 Oct 2017, at 06:08, Hartmut Kaiser <[email protected]> wrote:
> 
> Steve,
> 
>> Yes, I got the things to work distributing the channels from the main
>> task, but I guess simply using hpx:async to spawn the tasks in an SPMD
>> fashion is not the best to scale, but it’s the only way I have now to pass
>> all these channels to each locality.
>> 
>> Using the other executors that you mentioned how can I pass different
>> arguments to different localities (e.g., the channels)?
> 
> Do you want to pass different arguments to different invocations and still 
> use tree-spawning?
> 
>> Is there any example of using tree-like spawning or bulk_async_execute
>> other then the tests?
> 
> For efficient _local_ tree-style spawning you can use the parallel_executor. 
> See here for an example: 
> https://github.com/STEllAR-GROUP/hpx/blob/master/tests/unit/parallel/executors/parallel_executor.cpp#L94-L113.
>  Please note that the 3rd parameter to bulk_async_execute could be any 
> (input) range, for instance a boost::irange(0, num) or similar, it doesn't 
> have to be a real container.
> 
> Remote tree-style spawning can be done using broadcast (or 
> broadcast_with_index), see here for an example: 
> https://github.com/STEllAR-GROUP/hpx/blob/master/tests/unit/lcos/broadcast.cpp
> 
> HTH
> Regards Hartmut
> ---------------
> http://boost-spirit.com
> http://stellar.cct.lsu.edu
> 
> 
>> 
>> Thanks,
>> Steve
>> 
>> 
>> 
>> 
>> 
>> 
>> On 12 Oct 2017, at 09:53, Hartmut Kaiser <[email protected]> wrote:
>> 
>> Steve,
>> 
>> 
>> I see your point, but the problem is that I cannot know when the receiver
>> is going to be executes and so when it will receive the data.
>> 
>> Is there a way to wait on this channel until some task connects to and
>> make a get? This way I can keep the channel (and the task) alive until the
>> data has been consumed. I think there should be some mechanism to say
>> “wait for N tasks to connect”.
>> 
>> That's part of the deeper problem I hinted at. Let me think about this
>> some more before I respond.
>> 
>> 
>> I am reporting my other question:
>> More in general, when you have many channels at scale, do you
>> think is better to use a register_as/connect_to mechanism or to pass alle
>> the necessary channels to each locality at the beginning when I do the
>> initial SPMD spawn?
>> 
>> We usually use the register_as/connect_to mechanism as it separates
>> concerns nicely. But YMMV, so please try it out for yourself what works
>> best in your case. Sending a channel over the wire might overcome the
>> lifetime issues we're discussing as this would keep the channel alive no
>> matter what.
>> 
>> Regards Hartmut
>> ---------------
>> http://boost-spirit.com
>> http://stellar.cct.lsu.edu
>> 
>> 
>> 
>> 
>> Thank you,
>> Steve
>> 
>> 
>> 
>> 
>> 
>> On 12 Oct 2017, at 09:28, Hartmut Kaiser <[email protected]> wrote:
>> 
>> Steve,
>> 
>> 
>> Going back to the channels discussion, how do you know if a channel has
>> been already registered or not?
>> 
>> Look at the following code below (running with 2 localities):
>> - they create or connect to the same channel name
>> - if I introduce a delay in the receiver locality I get the following
>> error:
>> 
>> Assertion failed: (buffer_map_.empty()), function ~receive_buffer,
>> file hpx_install/include/hpx/lcos/local/receive_buffer.hpp, line 101.
>> 
>> I think I understand what is going on. This assert says that you're trying
>> to destroy a channel with data sitting in the pipeline. This is caused by
>> your code:
>> 
>>  {
>>       hpx::lcos::channel<int> c(hpx::find_here());
>>       c.register_as(“name");
>>       c.set(3);
>>  }
>> 
>> Which makes the channel go out of scope before the other locality had a
>> chance to connect and to extract the value.
>> 
>> There is also a deeper problem lurking behind the scenes, but that's
>> something I need to think about more before doing anything about it.
>> 
>> Regards Hartmut
>> ---------------
>> http://boost-spirit.com
>> http://stellar.cct.lsu.edu
>> 
>> 
>> 
>> Here is the sample code:
>> —————————
>> 
>> 
>> void spmd_int(DataFlow::ShardId cid){
>> if(cid%2==0){
>>  hpx::lcos::channel<int> c(hpx::find_here());
>>  c.register_as(“name");
>>  c.set(3);
>> }
>> else{
>>  usleep(1000000); // delay
>>  hpx::lcos::channel<int> c;
>>  c.connect_to("name");
>>  hpx::future<int> p = c.get();
>>  p.get();
>> }
>> }
>> 
>> 
>> int hpx_main(boost::program_options::variables_map& vm){
>> std::vector<hpx::naming::id_type> localities =
>>          hpx::find_all_localities();
>> 
>> int loc_count = 0;
>> for(auto loc: localities){
>> 
>>  spmd_int_action act;
>>  hpx::async(act, loc, loc_count);
>> 
>>  loc_count++;
>> }
>> }
>> 
>> —————————
>> 
>> 
>> What is happening here?
>> If I add a
>> c.connect_to("name");
>> 
>> to the the same task that does the registration (after
>> c.register_as(“name");) it works, but I don’t like it (and in my actual
>> application I still get this error).
>> 
>> More in general, when you have many of this channels at scale, do you
>> think is better to use a register_as/connect_to mechanism or to pass alle
>> the necessary channels to each locality at the beginning when I do the
>> initial SPMD spawn?
>> 
>> Thanks,
>> Steve
>> 
>> 
>> On 11 Oct 2017, at 14:53, Steve Petruzza <[email protected]> wrote:
>> 
>> Thanks Hartmut, it all makes sense now.
>> 
>> 
>> 
>> On 11 Oct 2017, at 14:51, Hartmut Kaiser <[email protected]> wrote:
>> 
>> 
>> 
>> I think I’ve found a workaround.
>> 
>> If I use a typedef as following:
>> 
>> typedef std::vector<char> vec_char;
>> 
>> HPX_REGISTER_CHANNEL(vec_char);
>> 
>> It works, but if I try to use directly:
>> HPX_REGISTER_CHANNEL(std::vector<char>)
>> 
>> this gives me the error I reported before.
>> The issue might be in the expansion of the macro HPX_REGISTER_CHANNEL.
>> 
>> Yes, that confirms my suspicion. I will have a looks what's going on.
>> 
>> Doh! The problem is that the (literal) parameter you give to the macro has
>> to conform to the rules of a valid symbol name, i.e. no special characters
>> are allowed (no '<', '>', etc.). Sorry, this has to be documented properly
>> somewhere, and I forgot to mention it in the first place.
>> 
>> The 'workaround' you propose is the only way to circumvent problems. There
>> is nothing we can do about it.
>> 
>> Also, wrt your comment that everything is working if you use
>> hpx::lcos::local::channel instead - this is not surprising. The local
>> channel type is representing a channel which can be used inside a given
>> locality only (no remote operation, just inter-thread/inter-task
>> communication), hence its name. Those channels don't require the use of
>> the ugly macros, thus there is no problem.
>> 
>> HTH
>> Regards Hartmut
>> ---------------
>> http://boost-spirit.com
>> http://stellar.cct.lsu.edu
>> 
>> 
>> 
>> 
>> Thanks!
>> Regards Hartmut
>> ---------------
>> http://boost-spirit.com
>> http://stellar.cct.lsu.edu
>> 
>> 
>> 
>> 
>> Steve
>> 
>> 
>> 
>> 
>> On 10 Oct 2017, at 18:38, Steve Petruzza <[email protected]> wrote:
>> 
>> Sorry, regarding the version that I am using it is the commit of your
>> split_future for vector:
>> 
>> Adding split_future for std::vector
>> 
>> - this fixes #2940
>> 
>> commit 8ecf8197f9fc9d1cd45a7f9ee61a7be07ba26f46
>> 
>> Steve
>> 
>> 
>> 
>> On 10 Oct 2017, at 18:33, Steve Petruzza <[email protected]> wrote:
>> 
>> hpx::find_here()
>> 
>> 
> 
> 

_______________________________________________
hpx-users mailing list
[email protected]
https://mail.cct.lsu.edu/mailman/listinfo/hpx-users

Reply via email to