Skip to content

Actions: InternLM/lmdeploy

publish-docker

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
834 workflow runs
834 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support qwen2-vl AWQ quantization (#2787)
publish-docker #830: Commit b4834ea pushed by lvhan028
November 25, 2024 10:21 30d 0h 0m 4s main
November 25, 2024 10:21 30d 0h 0m 4s
[Feature] support minicpm-v_2_6 for pytorch engine. (#2767)
publish-docker #829: Commit 324237b pushed by lvhan028
November 21, 2024 11:01 30d 0h 0m 4s main
November 21, 2024 11:01 30d 0h 0m 4s
feature: support qwen2.5 fuction_call (#2737)
publish-docker #828: Commit 96fa668 pushed by lvhan028
November 19, 2024 03:22 30d 0h 0m 3s main
November 19, 2024 03:22 30d 0h 0m 3s
bump version to v0.6.3 (#2754)
publish-docker #827: Commit 0c80baa pushed by lvhan028
November 16, 2024 04:31 1h 18m 29s v0.6.3
November 16, 2024 04:31 1h 18m 29s
bump version to v0.6.3 (#2754)
publish-docker #826: Commit 0c80baa pushed by lvhan028
November 16, 2024 04:29 30d 0h 0m 4s main
November 16, 2024 04:29 30d 0h 0m 4s
set wrong head_dim for mistral-nemo (#2761)
publish-docker #825: Commit 9ecc44a pushed by lvhan028
November 15, 2024 11:19 30d 0h 0m 3s main
November 15, 2024 11:19 30d 0h 0m 3s
Remove use_fast=True when loading tokenizer for lite auto_awq (#2758)
publish-docker #824: Commit 21f2866 pushed by lvhan028
November 14, 2024 16:42 30d 0h 0m 2s main
November 14, 2024 16:42 30d 0h 0m 2s
feat: support multi cards in ascend graph mode (#2755)
publish-docker #823: Commit 8e0076a pushed by lvhan028
November 14, 2024 06:30 30d 0h 0m 3s main
November 14, 2024 06:30 30d 0h 0m 3s
Support molmo in turbomind (#2716)
publish-docker #822: Commit fd8906c pushed by lvhan028
November 14, 2024 05:07 30d 0h 0m 3s main
November 14, 2024 05:07 30d 0h 0m 3s
Support chemvlm (#2738)
publish-docker #821: Commit a21def9 pushed by lvhan028
November 14, 2024 03:34 30d 0h 0m 3s main
November 14, 2024 03:34 30d 0h 0m 3s
optimize dlinfer moe (#2741)
publish-docker #820: Commit 7250318 pushed by lvhan028
November 13, 2024 10:40 30d 0h 0m 3s main
November 13, 2024 10:40 30d 0h 0m 3s
fix issue that mono-internvl failed to fallback pytorch engine (#2744)
publish-docker #819: Commit 20544d3 pushed by lvhan028
November 13, 2024 09:38 30d 0h 0m 4s main
November 13, 2024 09:38 30d 0h 0m 4s
Check server input (#2719)
publish-docker #818: Commit 9f6ff9b pushed by lvhan028
November 13, 2024 07:37 30d 0h 0m 4s main
November 13, 2024 07:37 30d 0h 0m 4s
Support mixtral moe AWQ quantization. (#2725)
publish-docker #817: Commit adf7c36 pushed by lvhan028
November 13, 2024 04:21 30d 0h 0m 4s main
November 13, 2024 04:21 30d 0h 0m 4s
Support Qwen2-MoE models (#2723)
publish-docker #816: Commit d2d4209 pushed by lvhan028
November 13, 2024 03:27 30d 0h 0m 3s main
November 13, 2024 03:27 30d 0h 0m 3s
fix assert pad >= 0 failed when inter_size is not a multiple of group…
publish-docker #815: Commit e751708 pushed by lvhan028
November 12, 2024 13:15 30d 0h 0m 3s main
November 12, 2024 13:15 30d 0h 0m 3s
Remove one of the duplicate bos tokens (#2708)
publish-docker #814: Commit 67a8538 pushed by lvhan028
November 12, 2024 08:40 30d 0h 0m 3s main
November 12, 2024 08:40 30d 0h 0m 3s
Support ep, column major moe kernel. (#2690)
publish-docker #813: Commit 4a8d745 pushed by lvhan028
November 11, 2024 13:11 30d 0h 0m 4s main
November 11, 2024 13:11 30d 0h 0m 4s
Support Mono-InternVL with PyTorch backend (#2727)
publish-docker #812: Commit 06aea5d pushed by lvhan028
November 11, 2024 03:09 30d 0h 0m 2s main
November 11, 2024 03:09 30d 0h 0m 2s
[Feature]: support LlavaForConditionalGeneration with turbomind infer…
publish-docker #811: Commit 78ab485 pushed by lvhan028
November 8, 2024 11:31 30d 0h 0m 4s main
November 8, 2024 11:31 30d 0h 0m 4s
Flatten cache and add flashattention (#2676)
publish-docker #810: Commit 2bed018 pushed by lvhan028
November 8, 2024 03:51 30d 0h 0m 2s main
November 8, 2024 03:51 30d 0h 0m 2s
bump version to 0.6.2.post1 (#2717)
publish-docker #809: Commit 4fc9479 pushed by lvhan028
November 7, 2024 07:41 1h 10m 9s v0.6.2.post1
November 7, 2024 07:41 1h 10m 9s
fix tp exit code for pytorch engine (#2718)
publish-docker #808: Commit a4012ef pushed by lvhan028
November 7, 2024 03:21 30d 0h 0m 3s main
November 7, 2024 03:21 30d 0h 0m 3s
support turbomind head_dim 64 (#2715)
publish-docker #807: Commit e7886b4 pushed by lvhan028
November 6, 2024 07:27 30d 0h 0m 2s main
November 6, 2024 07:27 30d 0h 0m 2s
fix decoding kernel for deepseekv2 (#2688)
publish-docker #806: Commit 354028b pushed by lvhan028
November 6, 2024 06:53 30d 0h 0m 2s main
November 6, 2024 06:53 30d 0h 0m 2s