feat: add support for TPUs on llm-d reference architecture by syeda-anjum · Pull Request #420 · GoogleCloudPlatform/accelerated-platforms

syeda-anjum · 2026-05-01T17:42:33Z

Overview

This PR adds support for Gemma-4 on TPUs with llm-d, including Kustomize templates and a new README guide.

Key Changes

Added docs/platforms/gke/base/use-cases/inference-ref-arch/llmd/llmd-vllm-with-hf-model-tpu.md.
Added Kustomize manifests for v6e-gemma-4-26b-a4b,v6e-gemma-4-31b and 'v6e-qwen3-32b' in online-inference-tpu/llmd/vllm.

Impact of Change

Enables users to deploy Gemma-4 and Qwen-3 on TPUs using llm-d and provides documentation for it.

References

…-subscriber

…on llm-d documentation

syeda-anjum and others added 5 commits April 18, 2026 20:41

llm-d on TPUs, new branch

46383c3

fix: rename template file to match script expectation in async-pubsub…

a1cef20

…-subscriber

feat: add Gemma-4 templates and README for llm-d on TPUs

44e185e

removing v5e examples as the accelerator is not officially supported …

74a9442

…on llm-d documentation

Merge branch 'main' into sanjum-llmdontpus

d71ecf8

syeda-anjum changed the title ~~feat: add Gemma-4 templates and README for llm-d on TPUs~~ feat: add support for TPUs on llm-d reference architecture May 20, 2026

syeda-anjum requested a review from gushob21 May 20, 2026 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for TPUs on llm-d reference architecture#420

feat: add support for TPUs on llm-d reference architecture#420
syeda-anjum wants to merge 5 commits into
mainfrom
sanjum-llmdontpus

syeda-anjum commented May 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

syeda-anjum commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Key Changes

Impact of Change

References

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

syeda-anjum commented May 1, 2026 •

edited

Loading