mei-ye opened a new pull request #10391:
URL: https://github.com/apache/tvm/pull/10391
@masahi
[vulkan] Add integer dot product (4xint8, 4xuint8) tensorization for the
vulkan SPIR-V target. Currently only autotvm path is supported.
Prerequisites for compilation: (1) Use VulkanSDK 1.2.198 release with SPIR-V
integer dot product suppport (2) set(USE_SPIRV_KHR_INTEGER_DOT_PRODUCT ON) in
config.cmake and build (3) Use a driver that supports
VK_KHR_shader_integer_dot_product extension.
The compiled binary can only be run on a hardware that supports relevant
ISA. This work is tested on AMD RDNA2 famillies (e.g., Rembrandt and RX6800).
To compile on a device that supports this extension, use target: vulkan
-from_device=0
To compile on a device that supports int8 but does not support this
extension, add "-supports_integer_dot_product=1" or "-mattr=+dotprod" to the
target string.
To support pre-released vulkan and SPIR-V extensions, we need SPIR-V tool
and header file from Khronos github, use the option: USE_KHRONOS_SPIRV in
config.cmake.
Example to use this feature can be found in:
gallery/how_to/deploy_models/deploy_prequantized.py and
gallery/how_to/deploy_models/deploy_prequantized_tflite.py
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]