Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracking] PagedKVCache Quantization #2663 中的TVM版本 #2880

Closed
XJY990705 opened this issue Sep 4, 2024 · 1 comment
Closed

[Tracking] PagedKVCache Quantization #2663 中的TVM版本 #2880

XJY990705 opened this issue Sep 4, 2024 · 1 comment
Labels
status: tracking Tracking work in progress

Comments

@XJY990705
Copy link

Overview

我需要测试 #2663 中量化后的性能,从源码下载了https://github.com/davidpissarra/mlc-llm/tree/kv-cache-quantization 这个分支中制定的tvm版本
image
f5f048b版本,但是在编译mlc-llm时报错,应该是tvm的版本不匹配导致的。请问我应该怎样解决?

Action Items

  • [ ]

Links to Related Issues and PRs

https://github.com/davidpissarra/mlc-llm/tree/kv-cache-quantization

@XJY990705 XJY990705 added the status: tracking Tracking work in progress label Sep 4, 2024
@XJY990705
Copy link
Author

Already solved

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status: tracking Tracking work in progress
Projects
Status: Done
Development

No branches or pull requests

2 participants