MasterJH5574 opened a new pull request, #19539:
URL: https://github.com/apache/tvm/pull/19539
Six CUDA sources in src/runtime/contrib used LOG(FATAL) via transitive
includes that #19483 trimmed; add the explicit <tvm/runtime/logging.h> include
to thrust.cu, attention_kernels.cu, and the four cutlass kernel headers
(fp16/fp8 sm90/sm100, gemm_runner, fp8_groupwise_scaled_gemm).
cache_kernels.cu used the bare Array{...} alias that #19483 removed; switch
to ffi::Array<Tensor>{...}.
attention_kernels.cu registered FFI functions whose parameters were raw
DLTensor*; the new reflection registry requires TypeSchema, so wrap both
TVM_FFI_STATIC_INIT_BLOCK registrations to take Tensor and forward to the
unchanged launchers via GetDLTensorPtr() (with const_cast for the output
tensors, matching the mt_random_engine / cudnn pattern).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]