merge cog-trt-llm into this repo #38

yorickvP · 2024-05-14T09:52:30Z

subtree merge cog-trt-llm
ignore cog-trt-llm subdir in runners
use cog-trt-llm as root dir in builder

…ATH to fix libnvinfer.so.9 import

…-transfer

Update README.md

Co-authored-by: Nathan Raw <[email protected]>

Make instructions more noob friendly & fix papercuts

Read TRTLLM_DIR from env with fallback

technillogue · 2024-05-15T17:30:41Z

flake.nix

-        # copy cog-trt-llm source into /src
-        cognix.postCopyCommands = ''
-          cp ${config.deps.cog-trt-llm}/{*.py,cog-trt-llm-config.yaml} $out/src/
-        '';
+        cognix.rootPath = lib.mkForce "${./cog-trt-llm}";


doesn't this result in a pretty different image?

/src will be different, just containing the data from cog-trt-llm instead of a weird merge. But that should be it.

I feel like I'd prefer to keep the weird merge as it is, I think

@technillogue, could you tldr the costs/benefits as you see them? I don't think I understand your preference, but I'd like to :)

I was just thinking of keeping the output images the way things are now, but I don't actually care that much

joehoover · 2024-05-16T15:31:16Z

This seems good to me. Presumably you've tested the builds?

technillogue · 2024-05-17T21:41:39Z

cog-trt-llm/.gitmodules

+[submodule "tensorrtllm_backend"]
+	path = tensorrtllm_backend
+	url = https://github.com/triton-inference-server/tensorrtllm_backend.git


we should probably drop this

haven't dropped it because that would break the 'legacy' dockerfile cog-trt-llm build

technillogue · 2024-05-17T21:41:51Z

cog-trt-llm/.gitignore

+*.bin
+*.safetensors
+*.cog
+*.hypothesis
+*.pytest_cache
+__pycache__/
+*.pyc
+models/*


this doesn't do anything

joehoover and others added 30 commits January 2, 2024 11:33

Initial commit

60de69e

add /usr/local/tensorrt/targets/x86_64-linux-gnu/lib/ to LD_LIBRARY_P…

b4092cf

…ATH to fix libnvinfer.so.9 import

Add pget install to dockerfile

b3870e7

successfully compiled gpt2 with cog predict -- hello world

c796d17

Add config and brain dump gpt2 readme

2fd0ef8

model downloading and uploading, in progress

9d1a549

everything works

669fe83

output a tar instead of tar.gz

f8f76a1

local smoke test

4ef1574

Add .gitignore file and remove test cache files

d90730c

Add development notes and implementation details

389e830

Add instructions for running a dev environment

36cdf0e

Refactor tarball creation in TRTLLMBuilder

3dbcb4f

added huggingface authentication for gated models

091a6de

Add TensorRT-LLM submodule and update Dockerfile and config

ac6e0d9

Add instructions for running a dev environment

e2f7dcd

added hf-enable-hf-transfer

26d3b27

added starcoder config, changed predict.py, added requirements for hf…

4bcefea

…-transfer

readme backticks

2513a85

added a run to predict.py; fixed model configs

21951cd

downloader no symlinks, check if model already on disk. fix borked

fdd5761

Clean up README and update model configuration

c05c68b

Remove TensorRT-LLM submodule

320a01d

Update Dockerfile dependencies and file paths

30c5816

merge downloader from main

aac0e0f

Add get_gpu_info function to utils.py

2756219

Update predict.py with TRT-LLM backend and GPU info

9915561

merge with main

11e9fed

add async cog

a62453c

Remove unused imports from config_parser.py

4cc28ea

hamelsmu and others added 22 commits March 18, 2024 09:01

Update README.md

34167b6

Update README.md

c211923

Update README.md

a774328

Update README.md

fd123ea

Update README.md

8ef9b67

Update README.md

897dcc5

Update README.md

579dade

Merge pull request #11 from replicate/hamelsmu-patch-1

7e8cda8

Update README.md

Read TRTLLM_DIR from env with fallback

ee50f89

fix dockerfile

1315d58

rm .cog

35d433b

fix instructions

b451660

clarify

ab4c373

add notes

d789904

add notes

3f3345f

fit out dir

958a5bb

Update README.md

d8f685e

Co-authored-by: Nathan Raw <[email protected]>

Update README.md

e83ed0e

Merge pull request #12 from replicate/fix-cog-build

5cd75fd

Make instructions more noob friendly & fix papercuts

Merge pull request #13 from replicate/yorickvp/flexibility

1f092d8

Read TRTLLM_DIR from env with fallback

Merge cog-trt-llm into a subdirectory

450527e

Switch from fetched cog-trt-llm to subdirectory

8e0cb7a

yorickvP requested review from technillogue and joehoover May 14, 2024 09:52

technillogue reviewed May 15, 2024

View reviewed changes

Move cog-trt-llm ignore to .dockerignore

b6142a3

technillogue approved these changes May 17, 2024

View reviewed changes

technillogue merged commit 2421897 into main May 17, 2024
1 check passed

technillogue deleted the yorickvp/merge-cog-trt-llm branch May 17, 2024 23:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge cog-trt-llm into this repo #38

merge cog-trt-llm into this repo #38

yorickvP commented May 14, 2024

technillogue May 15, 2024

yorickvP May 16, 2024

technillogue May 16, 2024

joehoover May 17, 2024

technillogue May 17, 2024

joehoover commented May 16, 2024

technillogue May 17, 2024

yorickvP May 18, 2024

technillogue May 17, 2024

merge cog-trt-llm into this repo #38

merge cog-trt-llm into this repo #38

Conversation

yorickvP commented May 14, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joehoover commented May 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment