LMDeploy Release V0.1.0a0
What's Changed
🚀 Features
- Add extra_requires to reduce dependencies by @RunningLeon in #580
- TurboMind 2 by @lzhangzz in #590
- Support loading hf model directly by @irexyc in #685
💥 Improvements
- Fix Tokenizer encode by @AllentDan in #645
- Optimize for throughput by @lzhangzz in #701
- Replace mmengine with mmengine-lite by @zhouzaida in #715
🐞 Bug fixes
- Fix init of batch state by @lzhangzz in #682
- fix turbomind stream canceling by @grimoire in #686
- [Fix] Fix load_checkpoint_in_model bug by @HIT-cwh in #690
- Fix wrong eos_id and bos_id obtained through grpc api by @lvhan028 in #644
- Fix cache/output length calculation by @lzhangzz in #738
- [Fix] Skip empty batch by @lzhangzz in #747
📚 Documentations
- [Docs] Update Supported Matrix by @pppppM in #679
- [Docs] Update KV8 Docs by @pppppM in #681
- [Doc] Update restful api doc by @AllentDan in #662
- Check-in user guide about turbomind config by @lvhan028 in #680
🌐 Other
New Contributors
- @zhouzaida made their first contribution in #715
Full Changelog: v0.0.14...v0.1.0a0