Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-25 Thread Dirk Eddelbuettel


On 25 August 2023 at 15:37, Uwe Ligges wrote:
| 
| 
| On 23.08.2023 16:00, Scott Ritchie wrote:
| > Hi Uwe,
| > 
| > I agree and have also been burnt myself by programs occupying the 
| > maximum number of cores available.
| > 
| > My understanding is that in the absence of explicit parallelisation, use 
| > of data.table in a package should not lead to this type of behaviour?
| 
| Yes, that would be my hope, too.

No everybody involved with data.table thinks using 50% is already a
compromise giving up performance, see eg Jan's comment from yesterday (and
everything leading up to it):

   https://github.com/Rdatatable/data.table/issues/5658#issuecomment-1691831704

*You* have a local constraint (that is perfectly reasonable) as *you* run
multiple package tests. So *you* should set a low value for OMP_THREAD_LIMIT.

Many users spend top dollars to have access to high-powered machines for
high-powered analyses. They do want all cores.

There simply cannot be one setting that addresses all situations. Please set
a low limit as your local deployment requires it.

Dirk

-- 
dirk.eddelbuettel.com | @eddelbuettel | e...@debian.org

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-25 Thread Uwe Ligges




On 23.08.2023 16:00, Scott Ritchie wrote:

Hi Uwe,

I agree and have also been burnt myself by programs occupying the 
maximum number of cores available.


My understanding is that in the absence of explicit parallelisation, use 
of data.table in a package should not lead to this type of behaviour?


Yes, that would be my hope, too.

Best,
Uwe Ligges




Best,

Scott

On Wed, 23 Aug 2023 at 14:30, Uwe Ligges 
<mailto:lig...@statistik.tu-dortmund.de>> wrote:


I (any many collegues here) have been caught several times by the
following example:

1. did something in parallel on a cluster, set up via
parallel::makeCluster().
2. e.g. allocated 20 cores and got them on one single machine
3. ran some code in parallel via parLapply()

Bang! 400 threads;
So I have started 20 parallel processes, each of which is using the
automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted
by the cluster to 20 (rather than 1).

Hence, I really believe a default should always be small, not only in
examples and tests, but generally. And people who aim for more
should be
able to increase the defaults.

Do you believe a software that auto-occupies a 96 core machines with 96
threads by default is sensible?

Best,
Uwe Ligges






On 21.08.2023 21:59, Berry Boessenkool wrote:
 >
 > If you add that to each exported function, isn't that a lot of
code to read + maintain?
 > Also, it seems like unnecessary computational overhead.
 >  From a software design point of view, it might be nicer to set
that in the examples + tests.
 >
 > Regards,
 > Berry
 >
 > 
 > From: R-package-devel mailto:r-package-devel-boun...@r-project.org>> on behalf of Scott
Ritchie mailto:sritchi...@gmail.com>>
 > Sent: Monday, August 21, 2023 19:23
 > To: Dirk Eddelbuettel mailto:e...@debian.org>>
 > Cc: r-package-devel@r-project.org
<mailto:r-package-devel@r-project.org>
    mailto:r-package-devel@r-project.org>>
 > Subject: Re: [R-pkg-devel] Trouble with long-running tests on
CRAN debian server
 >
 > Thanks Dirk and Ivan,
 >
 > I took a slightly different work-around of forcing the number of
threads to
 > 1 when running functions of the test dataset in the package, by
adding the
 > following to each user facing function:
 >
 > ```
 >    # Check if running on package test_data, and if so, force
data.table to
 > be
 >    # single threaded so that we can avoid a NOTE on CRAN submission
 >    if (isTRUE(all.equal(x, ukbnmr::test_data))) {
 >      registered_threads <- getDTthreads()
 >      setDTthreads(1)
 >      on.exit({ setDTthreads(registered_threads) }) # re-register
so no
 > unintended side effects for users
 >    }
 > ```
 > (i.e. here x is the input argument to the function)
 >
 > It took some trial and error to get to pass the CRAN tests; the
number of
 > columns in the input data was also contributing to the problem.
 >
 > Best,
 >
 > Scott
 >
 >
 > On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel mailto:e...@debian.org>> wrote:
 >
 >>
 >> On 21 August 2023 at 16:05, Ivan Krylov wrote:
 >> | Dirk is probably right that it's a good idea to have
OMP_THREAD_LIMIT=2
 >> | set on the CRAN check machine. Either that, or place the
responsibility
 >> | on data.table for setting the right number of threads by
default. But
 >> | that's a policy question: should a CRAN package start no more
than two
 >> | threads/child processes even if it doesn't know it's running in an
 >> | environment where the CPU time / elapsed time limit is two?
 >>
 >> Methinks that given this language in the CRAN Repository Policy
 >>
 >>    If running a package uses multiple threads/cores it must
never use more
 >>    than two simultaneously: the check farm is a shared resource
and will
 >>    typically be running many checks simultaneously.
 >>
 >> it would indeed be nice if this variable, and/or equivalent
ones, were set.
 >>
 >> As I mentioned before, I had long added a similar throttle (not for
 >> data.table) in a package I look after (for work, even). So a similar
 >> throttler with optionality is below. I'll add this to my `dang`
package
 >> collecting various functions.
 >>
 >> A usage example follows. It does nothing by default, ensuring
'full power'
 >> but reflects the minimum of two 

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-23 Thread Jeff Newmiller
I think one should be very cautious about overriding "standard" mechanisms for 
controlling software infrastructure like OpenMP.  You risk making the task of 
navigating the already-complex task of configuring the software environment 
even more complex by increasing the number of places you have to look in to 
find out why the mechanism documented by OpenMP is having no effect.

It may be that R Core agrees with you and creates an R-specific setting to 
control this... but IMO it should be accompanied by warning messages to help 
people figure out why their real work is underperforming if they link with 
compiled code that is supposed to make use of threads.

On August 23, 2023 7:24:46 AM PDT, Uwe Ligges  
wrote:
>
>
>On 23.08.2023 15:58, Jeff Newmiller wrote:
>> To whom are you addressing this question? The OpenMP developers who define 
>> the missing-OMP_THREAD_LIMIT behaviour and-or supply default config files? 
>> The CRAN server administrators who set the variable in their site-wide 
>> configuration intentionally or unintentionally? Or the package authors 
>> expected to kludge in settings to override those defaults for CRAN testing 
>> while not overriding them in normal use?
>
>Of course , the CRAN teams controls the env vars on the CRAN servers, but not 
>on a server a user might use. And a user is typically unaware that a package 
>uses multithreading.
>R users are typically not developers with a lot of insight in computer 
>science. Most R users I know would not even know how to set an env var.
>
>So why do you ecxpect your users to set an appropriate OMP_THREAD_LIMIT? 
>Particularly when they aim at parallelization, they have to set it to 1.
>I advocate not only to limit the number of cores for CRAN but also (and 
>inparticular)  the default! Something we cannot check easily.
>
>
>An alternative would be to teach R to set OMP_THREAD_LIMIT=1 locally by 
>default and a mechanism to change that for users.
>
>Best,
>Uwe Ligges
>
>
>> 
>> I would vote for explicitly addressing this (rhetorical?) question to the 
>> CRAN server administrators...
>> 
>> On August 23, 2023 6:31:01 AM PDT, Uwe Ligges 
>>  wrote:
>>> I (any many collegues here) have been caught several times by the following 
>>> example:
>>> 
>>> 1. did something in parallel on a cluster, set up via 
>>> parallel::makeCluster().
>>> 2. e.g. allocated 20 cores and got them on one single machine
>>> 3. ran some code in parallel via parLapply()
>>> 
>>> Bang! 400 threads;
>>> So I have started 20 parallel processes, each of which is using the 
>>> automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted by 
>>> the cluster to 20 (rather than 1).
>>> 
>>> Hence, I really believe a default should always be small, not only in 
>>> examples and tests, but generally. And people who aim for more should be 
>>> able to increase the defaults.
>>> 
>>> Do you believe a software that auto-occupies a 96 core machines with 96 
>>> threads by default is sensible?
>>> 
>>> Best,
>>> Uwe Ligges
>>> 
>>> 
>>> 
>>> 
>>> 
>>> 
>>> On 21.08.2023 21:59, Berry Boessenkool wrote:
>>>> 
>>>> If you add that to each exported function, isn't that a lot of code to 
>>>> read + maintain?
>>>> Also, it seems like unnecessary computational overhead.
>>>>   From a software design point of view, it might be nicer to set that in 
>>>> the examples + tests.
>>>> 
>>>> Regards,
>>>> Berry
>>>> 
>>>> 
>>>> From: R-package-devel  on behalf of 
>>>> Scott Ritchie 
>>>> Sent: Monday, August 21, 2023 19:23
>>>> To: Dirk Eddelbuettel 
>>>> Cc: r-package-devel@r-project.org 
>>>> Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian 
>>>> server
>>>> 
>>>> Thanks Dirk and Ivan,
>>>> 
>>>> I took a slightly different work-around of forcing the number of threads to
>>>> 1 when running functions of the test dataset in the package, by adding the
>>>> following to each user facing function:
>>>> 
>>>> ```
>>>> # Check if running on package test_data, and if so, force data.table to
>>>> be
>>>> # single threaded so that we can avoid a NOTE on CRAN submission
>>>> if (isTRUE(all.equal(x, ukbnmr::test_data))) {
>>>>   registered

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-23 Thread Uwe Ligges




On 23.08.2023 15:58, Jeff Newmiller wrote:

To whom are you addressing this question? The OpenMP developers who define the 
missing-OMP_THREAD_LIMIT behaviour and-or supply default config files? The CRAN 
server administrators who set the variable in their site-wide configuration 
intentionally or unintentionally? Or the package authors expected to kludge in 
settings to override those defaults for CRAN testing while not overriding them 
in normal use?


Of course , the CRAN teams controls the env vars on the CRAN servers, 
but not on a server a user might use. And a user is typically unaware 
that a package uses multithreading.
R users are typically not developers with a lot of insight in computer 
science. Most R users I know would not even know how to set an env var.


So why do you ecxpect your users to set an appropriate OMP_THREAD_LIMIT? 
Particularly when they aim at parallelization, they have to set it to 1.
I advocate not only to limit the number of cores for CRAN but also (and 
inparticular)  the default! Something we cannot check easily.



An alternative would be to teach R to set OMP_THREAD_LIMIT=1 locally by 
default and a mechanism to change that for users.


Best,
Uwe Ligges




I would vote for explicitly addressing this (rhetorical?) question to the CRAN 
server administrators...

On August 23, 2023 6:31:01 AM PDT, Uwe Ligges  
wrote:

I (any many collegues here) have been caught several times by the following 
example:

1. did something in parallel on a cluster, set up via parallel::makeCluster().
2. e.g. allocated 20 cores and got them on one single machine
3. ran some code in parallel via parLapply()

Bang! 400 threads;
So I have started 20 parallel processes, each of which is using the 
automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted by the 
cluster to 20 (rather than 1).

Hence, I really believe a default should always be small, not only in examples 
and tests, but generally. And people who aim for more should be able to 
increase the defaults.

Do you believe a software that auto-occupies a 96 core machines with 96 threads 
by default is sensible?

Best,
Uwe Ligges






On 21.08.2023 21:59, Berry Boessenkool wrote:


If you add that to each exported function, isn't that a lot of code to read + 
maintain?
Also, it seems like unnecessary computational overhead.
  From a software design point of view, it might be nicer to set that in the 
examples + tests.

Regards,
Berry


From: R-package-devel  on behalf of Scott 
Ritchie 
Sent: Monday, August 21, 2023 19:23
To: Dirk Eddelbuettel 
Cc: r-package-devel@r-project.org 
Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

Thanks Dirk and Ivan,

I took a slightly different work-around of forcing the number of threads to
1 when running functions of the test dataset in the package, by adding the
following to each user facing function:

```
# Check if running on package test_data, and if so, force data.table to
be
# single threaded so that we can avoid a NOTE on CRAN submission
if (isTRUE(all.equal(x, ukbnmr::test_data))) {
  registered_threads <- getDTthreads()
  setDTthreads(1)
  on.exit({ setDTthreads(registered_threads) }) # re-register so no
unintended side effects for users
}
```
(i.e. here x is the input argument to the function)

It took some trial and error to get to pass the CRAN tests; the number of
columns in the input data was also contributing to the problem.

Best,

Scott


On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:



On 21 August 2023 at 16:05, Ivan Krylov wrote:
| Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
| set on the CRAN check machine. Either that, or place the responsibility
| on data.table for setting the right number of threads by default. But
| that's a policy question: should a CRAN package start no more than two
| threads/child processes even if it doesn't know it's running in an
| environment where the CPU time / elapsed time limit is two?

Methinks that given this language in the CRAN Repository Policy

If running a package uses multiple threads/cores it must never use more
than two simultaneously: the check farm is a shared resource and will
typically be running many checks simultaneously.

it would indeed be nice if this variable, and/or equivalent ones, were set.

As I mentioned before, I had long added a similar throttle (not for
data.table) in a package I look after (for work, even). So a similar
throttler with optionality is below. I'll add this to my `dang` package
collecting various functions.

A usage example follows. It does nothing by default, ensuring 'full power'
but reflects the minimum of two possible options, or an explicit count:

  > dang::limitDataTableCores(verbose=TRUE)
  Limiting data.table to '12'.
  > Sys.setenv("OMP_THREAD_LIMIT"=3);
dang::limitDataTableCores(verbose=TRUE)
  Limiti

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-23 Thread Scott Ritchie
Hi Uwe,

I agree and have also been burnt myself by programs occupying the maximum
number of cores available.

My understanding is that in the absence of explicit parallelisation, use of
data.table in a package should not lead to this type of behaviour?

Best,

Scott

On Wed, 23 Aug 2023 at 14:30, Uwe Ligges 
wrote:

> I (any many collegues here) have been caught several times by the
> following example:
>
> 1. did something in parallel on a cluster, set up via
> parallel::makeCluster().
> 2. e.g. allocated 20 cores and got them on one single machine
> 3. ran some code in parallel via parLapply()
>
> Bang! 400 threads;
> So I have started 20 parallel processes, each of which is using the
> automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted
> by the cluster to 20 (rather than 1).
>
> Hence, I really believe a default should always be small, not only in
> examples and tests, but generally. And people who aim for more should be
> able to increase the defaults.
>
> Do you believe a software that auto-occupies a 96 core machines with 96
> threads by default is sensible?
>
> Best,
> Uwe Ligges
>
>
>
>
>
>
> On 21.08.2023 21:59, Berry Boessenkool wrote:
> >
> > If you add that to each exported function, isn't that a lot of code to
> read + maintain?
> > Also, it seems like unnecessary computational overhead.
> >  From a software design point of view, it might be nicer to set that in
> the examples + tests.
> >
> > Regards,
> > Berry
> >
> > 
> > From: R-package-devel  on behalf
> of Scott Ritchie 
> > Sent: Monday, August 21, 2023 19:23
> > To: Dirk Eddelbuettel 
> > Cc: r-package-devel@r-project.org 
> > Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN
> debian server
> >
> > Thanks Dirk and Ivan,
> >
> > I took a slightly different work-around of forcing the number of threads
> to
> > 1 when running functions of the test dataset in the package, by adding
> the
> > following to each user facing function:
> >
> > ```
> ># Check if running on package test_data, and if so, force data.table
> to
> > be
> ># single threaded so that we can avoid a NOTE on CRAN submission
> >if (isTRUE(all.equal(x, ukbnmr::test_data))) {
> >  registered_threads <- getDTthreads()
> >  setDTthreads(1)
> >  on.exit({ setDTthreads(registered_threads) }) # re-register so no
> > unintended side effects for users
> >}
> > ```
> > (i.e. here x is the input argument to the function)
> >
> > It took some trial and error to get to pass the CRAN tests; the number of
> > columns in the input data was also contributing to the problem.
> >
> > Best,
> >
> > Scott
> >
> >
> > On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:
> >
> >>
> >> On 21 August 2023 at 16:05, Ivan Krylov wrote:
> >> | Dirk is probably right that it's a good idea to have
> OMP_THREAD_LIMIT=2
> >> | set on the CRAN check machine. Either that, or place the
> responsibility
> >> | on data.table for setting the right number of threads by default. But
> >> | that's a policy question: should a CRAN package start no more than two
> >> | threads/child processes even if it doesn't know it's running in an
> >> | environment where the CPU time / elapsed time limit is two?
> >>
> >> Methinks that given this language in the CRAN Repository Policy
> >>
> >>If running a package uses multiple threads/cores it must never use
> more
> >>than two simultaneously: the check farm is a shared resource and will
> >>typically be running many checks simultaneously.
> >>
> >> it would indeed be nice if this variable, and/or equivalent ones, were
> set.
> >>
> >> As I mentioned before, I had long added a similar throttle (not for
> >> data.table) in a package I look after (for work, even). So a similar
> >> throttler with optionality is below. I'll add this to my `dang` package
> >> collecting various functions.
> >>
> >> A usage example follows. It does nothing by default, ensuring 'full
> power'
> >> but reflects the minimum of two possible options, or an explicit count:
> >>
> >>  > dang::limitDataTableCores(verbose=TRUE)
> >>  Limiting data.table to '12'.
> >>  > Sys.setenv("OMP_THREAD_LIMIT"=3);
> >> dang::limitDataTableCores(verbose=TRUE)
> >>  Limiting data.table to '3'.
> >>  > o

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-23 Thread Jeff Newmiller
To whom are you addressing this question? The OpenMP developers who define the 
missing-OMP_THREAD_LIMIT behaviour and-or supply default config files? The CRAN 
server administrators who set the variable in their site-wide configuration 
intentionally or unintentionally? Or the package authors expected to kludge in 
settings to override those defaults for CRAN testing while not overriding them 
in normal use?

I would vote for explicitly addressing this (rhetorical?) question to the CRAN 
server administrators...

On August 23, 2023 6:31:01 AM PDT, Uwe Ligges  
wrote:
>I (any many collegues here) have been caught several times by the following 
>example:
>
>1. did something in parallel on a cluster, set up via parallel::makeCluster().
>2. e.g. allocated 20 cores and got them on one single machine
>3. ran some code in parallel via parLapply()
>
>Bang! 400 threads;
>So I have started 20 parallel processes, each of which is using the 
>automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted by the 
>cluster to 20 (rather than 1).
>
>Hence, I really believe a default should always be small, not only in examples 
>and tests, but generally. And people who aim for more should be able to 
>increase the defaults.
>
>Do you believe a software that auto-occupies a 96 core machines with 96 
>threads by default is sensible?
>
>Best,
>Uwe Ligges
>
>
>
>
>
>
>On 21.08.2023 21:59, Berry Boessenkool wrote:
>> 
>> If you add that to each exported function, isn't that a lot of code to read 
>> + maintain?
>> Also, it seems like unnecessary computational overhead.
>>  From a software design point of view, it might be nicer to set that in the 
>> examples + tests.
>> 
>> Regards,
>> Berry
>> 
>> 
>> From: R-package-devel  on behalf of 
>> Scott Ritchie 
>> Sent: Monday, August 21, 2023 19:23
>> To: Dirk Eddelbuettel 
>> Cc: r-package-devel@r-project.org 
>> Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian 
>> server
>> 
>> Thanks Dirk and Ivan,
>> 
>> I took a slightly different work-around of forcing the number of threads to
>> 1 when running functions of the test dataset in the package, by adding the
>> following to each user facing function:
>> 
>> ```
>># Check if running on package test_data, and if so, force data.table to
>> be
>># single threaded so that we can avoid a NOTE on CRAN submission
>>if (isTRUE(all.equal(x, ukbnmr::test_data))) {
>>  registered_threads <- getDTthreads()
>>  setDTthreads(1)
>>  on.exit({ setDTthreads(registered_threads) }) # re-register so no
>> unintended side effects for users
>>}
>> ```
>> (i.e. here x is the input argument to the function)
>> 
>> It took some trial and error to get to pass the CRAN tests; the number of
>> columns in the input data was also contributing to the problem.
>> 
>> Best,
>> 
>> Scott
>> 
>> 
>> On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:
>> 
>>> 
>>> On 21 August 2023 at 16:05, Ivan Krylov wrote:
>>> | Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
>>> | set on the CRAN check machine. Either that, or place the responsibility
>>> | on data.table for setting the right number of threads by default. But
>>> | that's a policy question: should a CRAN package start no more than two
>>> | threads/child processes even if it doesn't know it's running in an
>>> | environment where the CPU time / elapsed time limit is two?
>>> 
>>> Methinks that given this language in the CRAN Repository Policy
>>> 
>>>If running a package uses multiple threads/cores it must never use more
>>>than two simultaneously: the check farm is a shared resource and will
>>>typically be running many checks simultaneously.
>>> 
>>> it would indeed be nice if this variable, and/or equivalent ones, were set.
>>> 
>>> As I mentioned before, I had long added a similar throttle (not for
>>> data.table) in a package I look after (for work, even). So a similar
>>> throttler with optionality is below. I'll add this to my `dang` package
>>> collecting various functions.
>>> 
>>> A usage example follows. It does nothing by default, ensuring 'full power'
>>> but reflects the minimum of two possible options, or an explicit count:
>>> 
>>>  > dang::limitDataTableCores(verbose=TRUE)
>>>  Limiting data.table to '12'.
>>>  > Sys.set

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-23 Thread Uwe Ligges
I (any many collegues here) have been caught several times by the 
following example:


1. did something in parallel on a cluster, set up via 
parallel::makeCluster().

2. e.g. allocated 20 cores and got them on one single machine
3. ran some code in parallel via parLapply()

Bang! 400 threads;
So I have started 20 parallel processes, each of which is using the 
automatically set max. 20 threads as OMP_THREAD_LIMIT was also adjusted 
by the cluster to 20 (rather than 1).


Hence, I really believe a default should always be small, not only in 
examples and tests, but generally. And people who aim for more should be 
able to increase the defaults.


Do you believe a software that auto-occupies a 96 core machines with 96 
threads by default is sensible?


Best,
Uwe Ligges






On 21.08.2023 21:59, Berry Boessenkool wrote:


If you add that to each exported function, isn't that a lot of code to read + 
maintain?
Also, it seems like unnecessary computational overhead.
 From a software design point of view, it might be nicer to set that in the 
examples + tests.

Regards,
Berry


From: R-package-devel  on behalf of Scott 
Ritchie 
Sent: Monday, August 21, 2023 19:23
To: Dirk Eddelbuettel 
Cc: r-package-devel@r-project.org 
Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

Thanks Dirk and Ivan,

I took a slightly different work-around of forcing the number of threads to
1 when running functions of the test dataset in the package, by adding the
following to each user facing function:

```
   # Check if running on package test_data, and if so, force data.table to
be
   # single threaded so that we can avoid a NOTE on CRAN submission
   if (isTRUE(all.equal(x, ukbnmr::test_data))) {
 registered_threads <- getDTthreads()
 setDTthreads(1)
 on.exit({ setDTthreads(registered_threads) }) # re-register so no
unintended side effects for users
   }
```
(i.e. here x is the input argument to the function)

It took some trial and error to get to pass the CRAN tests; the number of
columns in the input data was also contributing to the problem.

Best,

Scott


On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:



On 21 August 2023 at 16:05, Ivan Krylov wrote:
| Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
| set on the CRAN check machine. Either that, or place the responsibility
| on data.table for setting the right number of threads by default. But
| that's a policy question: should a CRAN package start no more than two
| threads/child processes even if it doesn't know it's running in an
| environment where the CPU time / elapsed time limit is two?

Methinks that given this language in the CRAN Repository Policy

   If running a package uses multiple threads/cores it must never use more
   than two simultaneously: the check farm is a shared resource and will
   typically be running many checks simultaneously.

it would indeed be nice if this variable, and/or equivalent ones, were set.

As I mentioned before, I had long added a similar throttle (not for
data.table) in a package I look after (for work, even). So a similar
throttler with optionality is below. I'll add this to my `dang` package
collecting various functions.

A usage example follows. It does nothing by default, ensuring 'full power'
but reflects the minimum of two possible options, or an explicit count:

 > dang::limitDataTableCores(verbose=TRUE)
 Limiting data.table to '12'.
 > Sys.setenv("OMP_THREAD_LIMIT"=3);
dang::limitDataTableCores(verbose=TRUE)
 Limiting data.table to '3'.
 > options(Ncpus=2); dang::limitDataTableCores(verbose=TRUE)
 Limiting data.table to '2'.
 > dang::limitDataTableCores(1, verbose=TRUE)
 Limiting data.table to '1'.
 >

That makes it, in my eyes, preferable to any unconditional 'always pick 1
thread'.

Dirk


##' Set threads for data.table respecting possible local settings
##'
##' This function set the number of threads \pkg{data.table} will use
##' while reflecting two possible machine-specific settings from the
##' environment variable \sQuote{OMP_THREAD_LIMIT} as well as the R
##' option \sQuote{Ncpus} (uses e.g. for parallel builds).
##' @title Set data.table threads respecting default settingss
##' @param ncores A numeric or character variable with the desired
##' count of threads to use
##' @param verbose A logical value with a default of \sQuote{FALSE} to
##' operate more verbosely
##' @return The return value of the \pkg{data.table} function
##' \code{setDTthreads} which is called as a side-effect.
##' @author Dirk Eddelbuettel
##' @export
limitDataTableCores <- function(ncores, verbose = FALSE) {
 if (missing(ncores)) {
 ## start with a simple fallback: 'Ncpus' (if set) or else 2
 ncores <- getOption("Ncpus", 2L)
 ## also consider OMP_THREAD_LIMIT (cf Writing R Extensions), gets
NA if envvar unset
 ompcore

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Berry Boessenkool


If you add that to each exported function, isn't that a lot of code to read + 
maintain?
Also, it seems like unnecessary computational overhead.
>From a software design point of view, it might be nicer to set that in the 
>examples + tests.

Regards,
Berry


From: R-package-devel  on behalf of 
Scott Ritchie 
Sent: Monday, August 21, 2023 19:23
To: Dirk Eddelbuettel 
Cc: r-package-devel@r-project.org 
Subject: Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

Thanks Dirk and Ivan,

I took a slightly different work-around of forcing the number of threads to
1 when running functions of the test dataset in the package, by adding the
following to each user facing function:

```
  # Check if running on package test_data, and if so, force data.table to
be
  # single threaded so that we can avoid a NOTE on CRAN submission
  if (isTRUE(all.equal(x, ukbnmr::test_data))) {
registered_threads <- getDTthreads()
setDTthreads(1)
on.exit({ setDTthreads(registered_threads) }) # re-register so no
unintended side effects for users
  }
```
(i.e. here x is the input argument to the function)

It took some trial and error to get to pass the CRAN tests; the number of
columns in the input data was also contributing to the problem.

Best,

Scott


On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:

>
> On 21 August 2023 at 16:05, Ivan Krylov wrote:
> | Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
> | set on the CRAN check machine. Either that, or place the responsibility
> | on data.table for setting the right number of threads by default. But
> | that's a policy question: should a CRAN package start no more than two
> | threads/child processes even if it doesn't know it's running in an
> | environment where the CPU time / elapsed time limit is two?
>
> Methinks that given this language in the CRAN Repository Policy
>
>   If running a package uses multiple threads/cores it must never use more
>   than two simultaneously: the check farm is a shared resource and will
>   typically be running many checks simultaneously.
>
> it would indeed be nice if this variable, and/or equivalent ones, were set.
>
> As I mentioned before, I had long added a similar throttle (not for
> data.table) in a package I look after (for work, even). So a similar
> throttler with optionality is below. I'll add this to my `dang` package
> collecting various functions.
>
> A usage example follows. It does nothing by default, ensuring 'full power'
> but reflects the minimum of two possible options, or an explicit count:
>
> > dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '12'.
> > Sys.setenv("OMP_THREAD_LIMIT"=3);
> dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '3'.
> > options(Ncpus=2); dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '2'.
> > dang::limitDataTableCores(1, verbose=TRUE)
> Limiting data.table to '1'.
> >
>
> That makes it, in my eyes, preferable to any unconditional 'always pick 1
> thread'.
>
> Dirk
>
>
> ##' Set threads for data.table respecting possible local settings
> ##'
> ##' This function set the number of threads \pkg{data.table} will use
> ##' while reflecting two possible machine-specific settings from the
> ##' environment variable \sQuote{OMP_THREAD_LIMIT} as well as the R
> ##' option \sQuote{Ncpus} (uses e.g. for parallel builds).
> ##' @title Set data.table threads respecting default settingss
> ##' @param ncores A numeric or character variable with the desired
> ##' count of threads to use
> ##' @param verbose A logical value with a default of \sQuote{FALSE} to
> ##' operate more verbosely
> ##' @return The return value of the \pkg{data.table} function
> ##' \code{setDTthreads} which is called as a side-effect.
> ##' @author Dirk Eddelbuettel
> ##' @export
> limitDataTableCores <- function(ncores, verbose = FALSE) {
> if (missing(ncores)) {
> ## start with a simple fallback: 'Ncpus' (if set) or else 2
> ncores <- getOption("Ncpus", 2L)
> ## also consider OMP_THREAD_LIMIT (cf Writing R Extensions), gets
> NA if envvar unset
> ompcores <- as.integer(Sys.getenv("OMP_THREAD_LIMIT"))
> ## and then keep the smaller
> ncores <- min(na.omit(c(ncores, ompcores)))
> }
> stopifnot("Package 'data.table' must be installed." =
> requireNamespace("data.table", quietly=TRUE))
> stopifnot("Argument 'ncores' must be numeric or character" =
> is.numeric(ncores) || is.character(ncores))
> if (verbose) message("Limiting data.table to '", ncores, "'.")
> data

Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Scott Ritchie
Thanks Dirk and Ivan,

I took a slightly different work-around of forcing the number of threads to
1 when running functions of the test dataset in the package, by adding the
following to each user facing function:

```
  # Check if running on package test_data, and if so, force data.table to
be
  # single threaded so that we can avoid a NOTE on CRAN submission
  if (isTRUE(all.equal(x, ukbnmr::test_data))) {
registered_threads <- getDTthreads()
setDTthreads(1)
on.exit({ setDTthreads(registered_threads) }) # re-register so no
unintended side effects for users
  }
```
(i.e. here x is the input argument to the function)

It took some trial and error to get to pass the CRAN tests; the number of
columns in the input data was also contributing to the problem.

Best,

Scott


On Mon, 21 Aug 2023 at 14:38, Dirk Eddelbuettel  wrote:

>
> On 21 August 2023 at 16:05, Ivan Krylov wrote:
> | Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
> | set on the CRAN check machine. Either that, or place the responsibility
> | on data.table for setting the right number of threads by default. But
> | that's a policy question: should a CRAN package start no more than two
> | threads/child processes even if it doesn't know it's running in an
> | environment where the CPU time / elapsed time limit is two?
>
> Methinks that given this language in the CRAN Repository Policy
>
>   If running a package uses multiple threads/cores it must never use more
>   than two simultaneously: the check farm is a shared resource and will
>   typically be running many checks simultaneously.
>
> it would indeed be nice if this variable, and/or equivalent ones, were set.
>
> As I mentioned before, I had long added a similar throttle (not for
> data.table) in a package I look after (for work, even). So a similar
> throttler with optionality is below. I'll add this to my `dang` package
> collecting various functions.
>
> A usage example follows. It does nothing by default, ensuring 'full power'
> but reflects the minimum of two possible options, or an explicit count:
>
> > dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '12'.
> > Sys.setenv("OMP_THREAD_LIMIT"=3);
> dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '3'.
> > options(Ncpus=2); dang::limitDataTableCores(verbose=TRUE)
> Limiting data.table to '2'.
> > dang::limitDataTableCores(1, verbose=TRUE)
> Limiting data.table to '1'.
> >
>
> That makes it, in my eyes, preferable to any unconditional 'always pick 1
> thread'.
>
> Dirk
>
>
> ##' Set threads for data.table respecting possible local settings
> ##'
> ##' This function set the number of threads \pkg{data.table} will use
> ##' while reflecting two possible machine-specific settings from the
> ##' environment variable \sQuote{OMP_THREAD_LIMIT} as well as the R
> ##' option \sQuote{Ncpus} (uses e.g. for parallel builds).
> ##' @title Set data.table threads respecting default settingss
> ##' @param ncores A numeric or character variable with the desired
> ##' count of threads to use
> ##' @param verbose A logical value with a default of \sQuote{FALSE} to
> ##' operate more verbosely
> ##' @return The return value of the \pkg{data.table} function
> ##' \code{setDTthreads} which is called as a side-effect.
> ##' @author Dirk Eddelbuettel
> ##' @export
> limitDataTableCores <- function(ncores, verbose = FALSE) {
> if (missing(ncores)) {
> ## start with a simple fallback: 'Ncpus' (if set) or else 2
> ncores <- getOption("Ncpus", 2L)
> ## also consider OMP_THREAD_LIMIT (cf Writing R Extensions), gets
> NA if envvar unset
> ompcores <- as.integer(Sys.getenv("OMP_THREAD_LIMIT"))
> ## and then keep the smaller
> ncores <- min(na.omit(c(ncores, ompcores)))
> }
> stopifnot("Package 'data.table' must be installed." =
> requireNamespace("data.table", quietly=TRUE))
> stopifnot("Argument 'ncores' must be numeric or character" =
> is.numeric(ncores) || is.character(ncores))
> if (verbose) message("Limiting data.table to '", ncores, "'.")
> data.table::setDTthreads(ncores)
> }
>
> |
> | --
> | Best regards,
> | Ivan
> |
> | __
> | R-package-devel@r-project.org mailing list
> | https://stat.ethz.ch/mailman/listinfo/r-package-devel
>
> --
> dirk.eddelbuettel.com | @eddelbuettel | e...@debian.org
>

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Dirk Eddelbuettel


On 21 August 2023 at 16:05, Ivan Krylov wrote:
| Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
| set on the CRAN check machine. Either that, or place the responsibility
| on data.table for setting the right number of threads by default. But
| that's a policy question: should a CRAN package start no more than two
| threads/child processes even if it doesn't know it's running in an
| environment where the CPU time / elapsed time limit is two?

Methinks that given this language in the CRAN Repository Policy

  If running a package uses multiple threads/cores it must never use more
  than two simultaneously: the check farm is a shared resource and will
  typically be running many checks simultaneously.

it would indeed be nice if this variable, and/or equivalent ones, were set.

As I mentioned before, I had long added a similar throttle (not for
data.table) in a package I look after (for work, even). So a similar
throttler with optionality is below. I'll add this to my `dang` package
collecting various functions.

A usage example follows. It does nothing by default, ensuring 'full power'
but reflects the minimum of two possible options, or an explicit count:

> dang::limitDataTableCores(verbose=TRUE)
Limiting data.table to '12'.
> Sys.setenv("OMP_THREAD_LIMIT"=3); dang::limitDataTableCores(verbose=TRUE)
Limiting data.table to '3'.
> options(Ncpus=2); dang::limitDataTableCores(verbose=TRUE)
Limiting data.table to '2'.
> dang::limitDataTableCores(1, verbose=TRUE)
Limiting data.table to '1'.
>

That makes it, in my eyes, preferable to any unconditional 'always pick 1 
thread'.

Dirk


##' Set threads for data.table respecting possible local settings
##'
##' This function set the number of threads \pkg{data.table} will use
##' while reflecting two possible machine-specific settings from the
##' environment variable \sQuote{OMP_THREAD_LIMIT} as well as the R
##' option \sQuote{Ncpus} (uses e.g. for parallel builds).
##' @title Set data.table threads respecting default settingss
##' @param ncores A numeric or character variable with the desired
##' count of threads to use
##' @param verbose A logical value with a default of \sQuote{FALSE} to
##' operate more verbosely
##' @return The return value of the \pkg{data.table} function
##' \code{setDTthreads} which is called as a side-effect.
##' @author Dirk Eddelbuettel
##' @export
limitDataTableCores <- function(ncores, verbose = FALSE) {
if (missing(ncores)) {
## start with a simple fallback: 'Ncpus' (if set) or else 2
ncores <- getOption("Ncpus", 2L)
## also consider OMP_THREAD_LIMIT (cf Writing R Extensions), gets NA if 
envvar unset
ompcores <- as.integer(Sys.getenv("OMP_THREAD_LIMIT"))
## and then keep the smaller
ncores <- min(na.omit(c(ncores, ompcores)))
}
stopifnot("Package 'data.table' must be installed." = 
requireNamespace("data.table", quietly=TRUE))
stopifnot("Argument 'ncores' must be numeric or character" = 
is.numeric(ncores) || is.character(ncores))
if (verbose) message("Limiting data.table to '", ncores, "'.")
data.table::setDTthreads(ncores)
}

| 
| -- 
| Best regards,
| Ivan
| 
| __
| R-package-devel@r-project.org mailing list
| https://stat.ethz.ch/mailman/listinfo/r-package-devel

-- 
dirk.eddelbuettel.com | @eddelbuettel | e...@debian.org

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Ivan Krylov
On Mon, 21 Aug 2023 13:28:54 +0100
Scott Ritchie  wrote:

> Examples with CPU time > 2.5 times elapsed time
> user system elapsed ratio
> remove_technical_variation 2.603  0.0270.94 2.798

In this context, "user" means the time spent executing userspace code
(as opposed to work done on behalf of the process by the operating
system kernel, "system"), and "elapsed" is the real time. Some threads
or child processes are definitely at work here.

Dirk is probably right that it's a good idea to have OMP_THREAD_LIMIT=2
set on the CRAN check machine. Either that, or place the responsibility
on data.table for setting the right number of threads by default. But
that's a policy question: should a CRAN package start no more than two
threads/child processes even if it doesn't know it's running in an
environment where the CPU time / elapsed time limit is two?

-- 
Best regards,
Ivan

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Mark Padgham



On 21/08/2023 14:34, Dirk Eddelbuettel wrote:

On 21 August 2023 at 15:16, Ivan Krylov wrote:
| On Mon, 21 Aug 2023 12:02:55 +0100
| Scott Ritchie  wrote:
|
| > remotes::install_github("sritchie73/ukbnmr")
| > library(ukbnmr)
| > system.time({ remove_technical_variation(test_data) })
|
| data.tables, you say? Can you show us the NOTE message you're getting?
| It could be that your example takes too much CPU time (as opposed to
| "real", "wallclock" time) due to running too many threads started by
| data.table.

Yep, and that is a new test AFAIK.

| It's not obvious why data.table would start too many threads (it's
| supposed to honour the limits that CRAN expresses in environment
| variables), but at least it should be easy to check and discount.

It grabs all it can get which is what you want for performance (I am on a
six-core machine here):

   $ R -q
   > library(data.table)
   data.table 1.14.8 using 6 threads (see ?getDTthreads).  Latest news: 
r-datatable.com
   >

and it honors variables if set

   $ OMP_THREAD_LIMIT=2 R -q
   > library(data.table)
   data.table 1.14.8 using 2 threads (see ?getDTthreads).  Latest news: 
r-datatable.com
   >

so I presume that variable is NOT set by CRAN.  It might help if it were.

Dirk


I had to update a package recently to get around this by putting
explicit 'data.table::setDTthreads(1)' in all examples, tests, and
vignettes. The incoming checks now do these CPU/elapsed tests, so you
can test by submitting, and if you're still over the ratio it will
auto-reject and tell you there. That was the only way to get my
submission to pass incoming.

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Dirk Eddelbuettel


On 21 August 2023 at 15:16, Ivan Krylov wrote:
| On Mon, 21 Aug 2023 12:02:55 +0100
| Scott Ritchie  wrote:
| 
| > remotes::install_github("sritchie73/ukbnmr")
| > library(ukbnmr)
| > system.time({ remove_technical_variation(test_data) })
| 
| data.tables, you say? Can you show us the NOTE message you're getting?
| It could be that your example takes too much CPU time (as opposed to
| "real", "wallclock" time) due to running too many threads started by
| data.table.

Yep, and that is a new test AFAIK.
 
| It's not obvious why data.table would start too many threads (it's
| supposed to honour the limits that CRAN expresses in environment
| variables), but at least it should be easy to check and discount.

It grabs all it can get which is what you want for performance (I am on a
six-core machine here):

  $ R -q
  > library(data.table)
  data.table 1.14.8 using 6 threads (see ?getDTthreads).  Latest news: 
r-datatable.com
  > 

and it honors variables if set

  $ OMP_THREAD_LIMIT=2 R -q
  > library(data.table)
  data.table 1.14.8 using 2 threads (see ?getDTthreads).  Latest news: 
r-datatable.com
  > 

so I presume that variable is NOT set by CRAN.  It might help if it were.

Dirk

-- 
dirk.eddelbuettel.com | @eddelbuettel | e...@debian.org

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Scott Ritchie
Hi Ivan,

Here is the NOTE generated by CRAN:

* checking examples ... [5s/2s] NOTE
Examples with CPU time > 2.5 times elapsed time
user system elapsed ratio
remove_technical_variation 2.603  0.0270.94 2.798

This doesn't appear to be related to data.table threads, here is what
I see after explicitly setting setDTthreads(1)

On my own machine (OSX Monterey, arm64 M1 processor):

> system.time({ remove_technical_variation(test_data) })
   user  system elapsed
  0.460   0.004   0.466

And on my University's cluster (RHEL 7, intel xeon platinum 8276 CPU @ 2.2 GHz):

> system.time({ remove_technical_variation(test_data) })
   user  system elapsed
  1.108   0.020   1.130

Runtimes are similar on these two machines when using an older version
of ukbnmr that has a 5x- larger test dataset (50 rows instead of 10
rows).

Best,

Scott


On Mon, 21 Aug 2023 at 13:16, Ivan Krylov  wrote:

> On Mon, 21 Aug 2023 12:02:55 +0100
> Scott Ritchie  wrote:
>
> > remotes::install_github("sritchie73/ukbnmr")
> > library(ukbnmr)
> > system.time({ remove_technical_variation(test_data) })
>
> data.tables, you say? Can you show us the NOTE message you're getting?
> It could be that your example takes too much CPU time (as opposed to
> "real", "wallclock" time) due to running too many threads started by
> data.table.
>
> It's not obvious why data.table would start too many threads (it's
> supposed to honour the limits that CRAN expresses in environment
> variables), but at least it should be easy to check and discount.
>
> --
> Best regards,
> Ivan
>

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] Trouble with long-running tests on CRAN debian server

2023-08-21 Thread Ivan Krylov
On Mon, 21 Aug 2023 12:02:55 +0100
Scott Ritchie  wrote:

> remotes::install_github("sritchie73/ukbnmr")
> library(ukbnmr)
> system.time({ remove_technical_variation(test_data) })

data.tables, you say? Can you show us the NOTE message you're getting?
It could be that your example takes too much CPU time (as opposed to
"real", "wallclock" time) due to running too many threads started by
data.table.

It's not obvious why data.table would start too many threads (it's
supposed to honour the limits that CRAN expresses in environment
variables), but at least it should be easy to check and discount.

-- 
Best regards,
Ivan

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel