Web AI allows you to run many models in the web browser via WebGPU allowing users to run LLMs like Gemma 3N multimodal and beyond in real-time on video feeds. I was wondering if you could also offer your demo in pure Web AI form to run the model client side either via Google's LiteRT.js or Microsoft ONNX Runtime Web?
Web AI allows you to run many models in the web browser via WebGPU allowing users to run LLMs like Gemma 3N multimodal and beyond in real-time on video feeds. I was wondering if you could also offer your demo in pure Web AI form to run the model client side either via Google's LiteRT.js or Microsoft ONNX Runtime Web?