Skip to content

Actions: mlc-ai/mlc-llm

Build Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
624 workflow runs
624 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Model] Use optimized group gemm for Mixtral (#1988)
Build Docs #24: Commit c74f176 pushed by tqchen
March 20, 2024 20:28 2m 6s main
March 20, 2024 20:28 2m 6s
[Fix] Fix serve model to adapt the latest Allocator signature (#1989)
Build Docs #23: Commit d4ec25e pushed by MasterJH5574
March 20, 2024 19:42 2m 3s main
March 20, 2024 19:42 2m 3s
[Serving][Grammar] Utility to convert json schema to EBNF grammar (#1…
Build Docs #22: Commit a0484bd pushed by tqchen
March 20, 2024 14:25 2m 52s main
March 20, 2024 14:25 2m 52s
[SpecDecode] Fix sampler selection. (#1971)
Build Docs #21: Commit 39d0865 pushed by MasterJH5574
March 20, 2024 02:36 2m 12s main
March 20, 2024 02:36 2m 12s
March 20, 2024 02:35 2m 35s
[Fix] Fix MLC_MULTI_ARCH with arch sm_90a (#1984)
Build Docs #19: Commit 5485782 pushed by vinx13
March 19, 2024 23:54 2m 11s main
March 19, 2024 23:54 2m 11s
[Serving][Grammar] Support specifying the main rule in grammar (#1982)
Build Docs #18: Commit bed4f53 pushed by tqchen
March 19, 2024 22:31 2m 7s main
March 19, 2024 22:31 2m 7s
[Fix] Fix handling of non-numerical cuda arch (#1976)
Build Docs #17: Commit 587e341 pushed by MasterJH5574
March 19, 2024 02:50 2m 7s main
March 19, 2024 02:50 2m 7s
[REST] REST API Deprecated (#1973)
Build Docs #16: Commit 3cbc169 pushed by MasterJH5574
March 19, 2024 01:32 2m 17s main
March 19, 2024 01:32 2m 17s
[Serve] Hot fix for the mixtral serving (#1975)
Build Docs #15: Commit 058c583 pushed by tqchen
March 19, 2024 00:06 2m 21s main
March 19, 2024 00:06 2m 21s
[Model][Serve] Add support for LLaVa model in serving engine (#1974)
Build Docs #14: Commit 949ff2d pushed by MasterJH5574
March 18, 2024 20:44 3m 2s main
March 18, 2024 20:44 3m 2s
March 18, 2024 17:18 2m 22s
[REST] Update Rest API docs for the latest serve flow (#1972)
Build Docs #12: Commit 386af8d pushed by Kartik14
March 18, 2024 14:24 2m 6s main
March 18, 2024 14:24 2m 6s
[Model] Migrate Mistral to use PagedKVCache (#1967)
Build Docs #11: Commit edffce4 pushed by tqchen
March 16, 2024 22:42 1m 59s main
March 16, 2024 22:42 1m 59s
[Serving][Fix] Fix JSON output check in test_server.py (#1966)
Build Docs #10: Commit d6b86d1 pushed by MasterJH5574
March 16, 2024 15:08 2m 12s main
March 16, 2024 15:08 2m 12s
[SLM] Small correction on Stablelm and Qwen2. (#1958)
Build Docs #9: Commit 73f2b27 pushed by tqchen
March 16, 2024 09:33 2m 9s main
March 16, 2024 09:33 2m 9s
Unify schema for conversation template and embed into mlc-chat-config…
Build Docs #8: Commit 994f928 pushed by rickzx
March 15, 2024 16:22 2m 5s main
March 15, 2024 16:22 2m 5s
[Serving][Grammar] Add grammar termination as a stop condition (#1964)
Build Docs #7: Commit c7d52c4 pushed by tqchen
March 15, 2024 13:20 2m 25s main
March 15, 2024 13:20 2m 25s
[CompilerFlag] Detect if FlashInfer is enabled from libinfo (#1941)
Build Docs #6: Commit 09fe1bc pushed by MasterJH5574
March 15, 2024 02:25 2m 2s main
March 15, 2024 02:25 2m 2s
[Model] Use static hidden size in mixtral scatter_output (#1959)
Build Docs #5: Commit d546134 pushed by MasterJH5574
March 14, 2024 20:28 2m 51s main
March 14, 2024 20:28 2m 51s
March 14, 2024 14:25 2m 17s
[Fix] Fetching the Git-LFS tokenizer files (#1954)
Build Docs #3: Commit c0b2ccd pushed by tqchen
March 14, 2024 02:15 2m 21s main
March 14, 2024 02:15 2m 21s
[Fix] Fix embedding shape check in ChatModule (#1953)
Build Docs #2: Commit 8d192ef pushed by MasterJH5574
March 13, 2024 17:51 2m 17s main
March 13, 2024 17:51 2m 17s
[CI] Add windows ci (#1942)
Build Docs #1: Commit 8a29ee1 pushed by tqchen
March 13, 2024 12:50 2m 45s main
March 13, 2024 12:50 2m 45s
ProTip! You can narrow down the results and go further in time using created:<2024-03-13 or the other filters available.