You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I am interested in TVM's work on cutlass. I am a bit confused about the results table. Does the inclusion of cuDNN and TensorRT in the table mean that TVM is running models using the kernels/sub-graphs provided by cuDNN and TensorRT? If that is the case, why did TVM add support for cutlass when TensorRT is faster?
The text was updated successfully, but these errors were encountered:
Does the inclusion of cuDNN and TensorRT in the table mean that TVM is running models using the kernels/sub-graphs provided by cuDNN and TensorRT
Yes for cuDNN but no for TRT. The TRT result is for reference only, the results were obtained by running TRTorch on resnet or running the BERT demo in the TensorRT repo. And CUTLASS is generally faster than cuDNN.
Hello! I am interested in TVM's work on cutlass. I am a bit confused about the results table. Does the inclusion of cuDNN and TensorRT in the table mean that TVM is running models using the kernels/sub-graphs provided by cuDNN and TensorRT? If that is the case, why did TVM add support for cutlass when TensorRT is faster?
The text was updated successfully, but these errors were encountered: