Ability to run litertlm models like Gemma 4 E4B

Would it be possible to load litertlm models like Gemma 4, which would run on my GPU (Google pixel 6 pro/GrapheneOS) and run much faster than gguf models? 
I have used other apps like "https://github.com/jegly/Box" that can do it.