You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Would it be possible to load litertlm models like Gemma 4, which would run on my GPU (Google pixel 6 pro/GrapheneOS) and run much faster than gguf models?
I have used other apps like "https://github.com/jegly/Box" that can do it.
Would it be possible to load litertlm models like Gemma 4, which would run on my GPU (Google pixel 6 pro/GrapheneOS) and run much faster than gguf models?
I have used other apps like "https://github.com/jegly/Box" that can do it.