Re: Rework inter-procedural analysis

2017-06-14 Thread Matthias Boehm
sure - I'll try to add some documentation of IPA, probably directly inlined into the code. Unfortunately, a too verbose dev documentation quickly gets outdated because nobody updates it - let's see if we find the sweet spot that works for the project. Regards, Matthias On Wed, Jun 14, 2017 at

Re: Rework inter-procedural analysis

2017-06-14 Thread Nakul Jindal
Thank you Matthias for agreeing to do this! "Having a very verbose doc quickly gets outdated" is a problem many projects deal with. We can have the community comment on PRs that changes those parts, if the documentation does not reflect the submitted change. As a starting point, since you are

Re: Parfor loop interdependencies

2017-06-14 Thread Matthias Boehm
Generally, the parfor dependency analysis applies a series of tests including traditional techniques from high-performance compilers combined with additional rules for common cases. This dependency analysis tries to proof that there are no loop-carried dependencies - so yes, false positives can

Re: Rework inter-procedural analysis

2017-06-14 Thread Nakul Jindal
Hi Matthias, If its not too much trouble, could you please create a design document for this change. This will help the rest of the contributors work on this component as well. Thanks, Nakul On Wed, Jun 14, 2017 at 12:00 AM, Matthias Boehm wrote: > just a quick heads

Jenkins build is back to normal : SystemML-DailyTest #1051

2017-06-14 Thread jenkins
See

Re: Rework inter-procedural analysis

2017-06-14 Thread dusenberrymw
Agreed. More documentation, especially within the optimizer portion of the engine, is quite useful. Given that a large number of our bugs and performance issues stem from this area, it would be good for it to be clean and well documented so that future bug searches/fixes can be completed in a

Parfor loop interdependencies

2017-06-14 Thread dusenberrymw
While working on depthwise convolution, I wanted to make use of a parfor loop to run multiple convolutions at once (there is a vectorized approach as well, but that is out of scope for this question), but I'm running into issues with false flags for loop interdependencies. Within the parfor