Add example on LLM offline inference #31

JasonLo · 2024-12-20T21:55:31Z

This example may be valuable for:

Generating large volumes of synthetic data with open-source LLMs.
Performing large-scale, structured data extraction from text.
Embedding extensive text datasets.
Cost-effectively executing LLM-driven tasks at scale without relying on expensive commercial solutions.

For more details, please refer to vllm_batch_inference/README.md.

iross

Thanks for submitting this! Sorry it took so long to look at it 😂. Let me know what you think of the changes I mentioned. I wish the uv fix was less ugly.. Your suggestion of building it from source sounds painful, but at least then we wouldn't be stomping permissions all over /root/...

vllm_batch_inference/run.sh

vllm_batch_inference/Dockerfile

vllm_batch_inference/README.md

vllm_batch_inference/job.sub

vllm_batch_inference/run.sh

agitter · 2025-03-21T19:27:52Z

Thanks for the contribution @JasonLo!

Since v0.8, vLLM has updated its environment management to use `uv`, which can make running the container in user mode slightly more challenging.

JasonLo · 2025-03-21T21:36:03Z

Thank you, @iross and @agitter, for your input. I have tested the changes mentioned above, and they seem to be working well now. I tested a job with job_id = 1455169.

…ts in run.sh

JasonLo added 13 commits December 13, 2024 14:40

tmp hand off to remote

270a903

checkpoint... looking anther machine to test...

286c0e8

add minimal example

9385e99

remove non-essentials

815a3a7

simplify and clean

9c237fb

add shell dockerfile back

68c607b

single batch example complete

c024108

minor fix sampling params

9718a47

add advanced example

335a3ec

avoid multiple LLM instantiation, remove single batch example.

21e0706

update vllm to v0.7.3

a956c5d

update vllm version

823c78a

update vllm to v0.8.1

9626d24

iross self-assigned this Mar 20, 2025

iross self-requested a review March 21, 2025 14:49

iross reviewed Mar 21, 2025

View reviewed changes

vllm_batch_inference/run.sh Show resolved Hide resolved

vllm_batch_inference/Dockerfile Outdated Show resolved Hide resolved

vllm_batch_inference/README.md Show resolved Hide resolved

vllm_batch_inference/job.sub Show resolved Hide resolved

agitter reviewed Mar 21, 2025

View reviewed changes

vllm_batch_inference/run.sh Outdated Show resolved Hide resolved

JasonLo added 4 commits March 21, 2025 15:59

Fix Dockerfile UV-related issues.

495865d

Since v0.8, vLLM has updated its environment management to use `uv`, which can make running the container in user mode slightly more challenging.

Add TORCHINDUCTOR_CACHE_DIR environment variable to run.sh

01bb05b

update docker_image to a fixed version

e93219a

Set when_to_transfer_output to ON_EXIT_OR_EVICT in job.sub

bbd7356

JasonLo added 2 commits March 21, 2025 23:11

enhance automatic multi-GPU management.

b06b19e

Update README and code comments for vLLM v0.8.1; remove unused commen…

173be72

…ts in run.sh

iross approved these changes Mar 24, 2025

View reviewed changes

iross merged commit efd419a into CHTC:master Mar 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add example on LLM offline inference #31

Add example on LLM offline inference #31

JasonLo commented Dec 20, 2024

iross left a comment

agitter commented Mar 21, 2025

JasonLo commented Mar 21, 2025

Add example on LLM offline inference #31

Add example on LLM offline inference #31

Conversation

JasonLo commented Dec 20, 2024

iross left a comment

Choose a reason for hiding this comment

agitter commented Mar 21, 2025

JasonLo commented Mar 21, 2025