zxybazh opened a new pull request, #12385:
URL: https://github.com/apache/tvm/pull/12385

   >The C5 and C5d 12xlarge, 24xlarge, and metal instances feature custom 2nd 
generation [Intel](https://aws.amazon.com/intel/) Xeon Scalable Processors 
(Cascade Lake) with a sustained all-core turbo frequency of 3.6GHz and maximum 
single core turbo frequency of 3.9GHz. The other C5 and C5d instance sizes will 
either launch on the 2nd generation Intel Xeon Scalable Processor or the 1st 
generation Intel Xeon Platinum 8000 series (Skylake-SP) processor with a 
sustained all core Turbo frequency of up to 3.4GHz, and single core turbo up to 
3.5 GHz using Intel Turbo Boost Technology.
   
   >The C5 and C5d 12xlarge, 24xlarge, and metal instance sizes enable Vector 
Neural Network Instructions (AVX-512 VNNI*) which will help speed up typical 
machine learning operations like convolution, and automatically improve 
inference performance over a wide range of deep learning workloads.
   
   
   According to [introduction to AWS EC2 C5 
targets](https://aws.amazon.com/ec2/instance-types/c5/), the C5 12x and 24x 
machine are `cascade lake` architecture instead of `skylake`, and enable Vector 
Neural Network Instructions (AVX-512 VNNI*) which can be useful for 
AutoTensorization.
   
   This PR fixes the target definition of the 2 above mentioned C5 machine to 
support VNNI.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to