Skip to content

LMDeploy Release v0.6.2

Compare
Choose a tag to compare
@lvhan028 lvhan028 released this 29 Oct 06:42
· 143 commits to main since this release
522108c

Highlights

  • PyTorch engine supports graph mode on ascend platform, doubling the inference speed
  • Support llama3.2-vision models in PyTorch engine
  • Support Mixtral in TurboMind engine, achieving 20+ RPS using SharedGPT dataset with 2 A100-80G GPUs

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.1...v0.6.2