MasterJH5574 opened a new pull request, #19539:
URL: https://github.com/apache/tvm/pull/19539

   Six CUDA sources in src/runtime/contrib used LOG(FATAL) via transitive 
includes that #19483 trimmed; add the explicit <tvm/runtime/logging.h> include 
to thrust.cu, attention_kernels.cu, and the four cutlass kernel headers 
(fp16/fp8 sm90/sm100, gemm_runner, fp8_groupwise_scaled_gemm).
   
   cache_kernels.cu used the bare Array{...} alias that #19483 removed; switch 
to ffi::Array<Tensor>{...}.
   
   attention_kernels.cu registered FFI functions whose parameters were raw 
DLTensor*; the new reflection registry requires TypeSchema, so wrap both 
TVM_FFI_STATIC_INIT_BLOCK registrations to take Tensor and forward to the 
unchanged launchers via GetDLTensorPtr() (with const_cast for the output 
tensors, matching the mt_random_engine / cudnn pattern).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to