Skip to content

hon9kon9ize/bart-translation-zh-yue-onnx

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bart model with onnxruntime deploying to Google Cloud Run

This repository contains the code for deploying a ONNX bart translation model to Google Cloud Run, It is based on the insights shared in the blog post 'My Journey to a serverless transformers pipeline on Google Cloud' from the Hugging Face website."

Prerequisites

Deploying to Google Cloud Run

gcloud builds submit --tag gcr.io/<project-id>/bart-translation-zh-yue-onnx
gcloud run deploy --image gcr.io/<project-id>/bart-translation-zh-yue-onnx --platform managed \
  --command "serve" \
  --cpu 4 \
  --memory 8Gi \

About

Bart model with onnxruntime deploying to Google Cloud Run

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published