echuraev opened a new pull request, #15137:
URL: https://github.com/apache/tvm/pull/15137

   In PR #8313 a parameter `max_function_args` was introduced. It leads to 
limit number of function argument and in case when this value is exceeded then 
concatenation layer is split to a several concat operations.
   
   I faced a problem on Adreno GPU that for kernel with big number of arguments 
the enqueueNDRange was crashed without any errors. The problem appeared because 
of the huge number of arguments. But in this case not only concat layer was a 
root cause of the problem. Also after fusing several operations the final 
functions had a big number of arguments.
   
   As it was discussed in #8313, adding a limitation on the number of function 
arguments to the FuseOps pass might be a good improvement. In this PR I 
introduced such mechanism for limitation number of function arguments for 
FuseOps pass and add an arguments limit to OpenCL devices at 128 parameters.
   
   cc: @Hzfengsy, @masahi, @csullivan 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to