Skip to content

Actions: mlc-ai/mlc-llm

Build Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
466 workflow runs
466 workflow runs

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
[SLM] Introduce microsoft/Phi-3 vision (#2658)
Build Docs #441: Commit cdbd3ed pushed by mengshyu
July 24, 2024 02:27 7m 10s main
July 24, 2024 02:27 7m 10s
[Model] Support Llama3.1 (#2682)
Build Docs #440: Commit ecae55c pushed by tqchen
July 23, 2024 21:13 6m 37s main
July 23, 2024 21:13 6m 37s
[Model] Fix annotation typos (#2672)
Build Docs #439: Commit a49abcc pushed by Hzfengsy
July 22, 2024 07:49 6m 57s main
July 22, 2024 07:49 6m 57s
support mistral-nemo (#2676)
Build Docs #438: Commit b1834f8 pushed by Hzfengsy
July 22, 2024 07:49 6m 25s main
July 22, 2024 07:49 6m 25s
[Engine] Defer the collection of decode inputs in prefill (#2668)
Build Docs #437: Commit 4c4f060 pushed by MasterJH5574
July 18, 2024 01:30 11m 33s main
July 18, 2024 01:30 11m 33s
[SLM] Starcoder2 Multi-GPU support (#2662)
Build Docs #436: Commit c06bb39 pushed by MasterJH5574
July 17, 2024 02:41 7m 9s main
July 17, 2024 02:41 7m 9s
[Model] Support SmolLM (#2667)
Build Docs #435: Commit 52c0638 pushed by MasterJH5574
July 17, 2024 02:40 6m 31s main
July 17, 2024 02:40 6m 31s
[Fix] Fix prefix cache reuse with eagle mode (#2664)
Build Docs #434: Commit 8290a97 pushed by MasterJH5574
July 16, 2024 16:52 6m 32s main
July 16, 2024 16:52 6m 32s
[Engine] Lazy recompute in GetRunningRequestStateEntries (#2655)
Build Docs #433: Commit baeb195 pushed by MasterJH5574
July 15, 2024 18:52 7m 45s main
July 15, 2024 18:52 7m 45s
[Model] Support Starcoder2 (#2657)
Build Docs #432: Commit 5bedaec pushed by MasterJH5574
July 15, 2024 13:38 6m 37s main
July 15, 2024 13:38 6m 37s
[PrefixCache] Defer sequence extension (#2654)
Build Docs #431: Commit 17ad72c pushed by MasterJH5574
July 14, 2024 14:30 6m 54s main
July 14, 2024 14:30 6m 54s
[Engine] Reduce action post-process overhead (#2653)
Build Docs #430: Commit 2345900 pushed by tqchen
July 13, 2024 18:33 6m 34s main
July 13, 2024 18:33 6m 34s
[Fix][Bitmask] Mask dummy padded tokens for grammar (#2651)
Build Docs #429: Commit cbf6ae0 pushed by tqchen
July 12, 2024 10:34 6m 44s main
July 12, 2024 10:34 6m 44s
[Fix][Tokenizer] Fix failure in decoding tokens for ByteLevel BPE (#2…
Build Docs #428: Commit 64d8dc6 pushed by tqchen
July 11, 2024 17:23 6m 25s main
July 11, 2024 17:23 6m 25s
[Fix] Fix KV cache single-page copy kernel (#2644)
Build Docs #427: Commit 16a79ab pushed by MasterJH5574
July 11, 2024 14:17 6m 46s main
July 11, 2024 14:17 6m 46s
Fix for RWKV new config and new format vocab (#2632)
Build Docs #426: Commit 7d73cfa pushed by tqchen
July 8, 2024 13:46 6m 51s main
July 8, 2024 13:46 6m 51s
[Model] Support Internlm2.5 (#2630)
Build Docs #425: Commit c7756f9 pushed by MasterJH5574
July 8, 2024 03:56 7m 36s main
July 8, 2024 03:56 7m 36s
[Serving] Merge multiple token embedding lookup into one (#2629)
Build Docs #424: Commit c6122d7 pushed by MasterJH5574
July 8, 2024 03:55 7m 7s main
July 8, 2024 03:55 7m 7s
[SLM] Internlm2 Multi-GPU support (#2626)
Build Docs #423: Commit 5165a58 pushed by MasterJH5574
July 8, 2024 03:55 7m 3s main
July 8, 2024 03:55 7m 3s
[Fix] Fix the chunked prefill condition (#2628)
Build Docs #422: Commit ebf5617 pushed by MasterJH5574
July 5, 2024 18:35 7m 16s main
July 5, 2024 18:35 7m 16s
[Fix] Mark the decode requests in hybrid prefill (#2621)
Build Docs #421: Commit 5b63980 pushed by MasterJH5574
July 4, 2024 04:21 6m 56s main
July 4, 2024 04:21 6m 56s
[Android] Update include path for tvm runtime src (#2616)
Build Docs #420: Commit adc6ee6 pushed by MasterJH5574
July 2, 2024 19:57 7m 25s main
July 2, 2024 19:57 7m 25s
July 2, 2024 15:41 7m 4s
[SLM] Add support for InternLM2 architecture (#2608)
Build Docs #418: Commit 2d32094 pushed by MasterJH5574
July 2, 2024 15:40 7m 11s main
July 2, 2024 15:40 7m 11s
Update debug_compare (#2612)
Build Docs #417: Commit c09b108 pushed by MasterJH5574
July 2, 2024 15:37 7m 15s main
July 2, 2024 15:37 7m 15s