Skip to content

Ability to run litertlm models like Gemma 4 E4B #108

@Jakarrrg

Description

@Jakarrrg

Would it be possible to load litertlm models like Gemma 4, which would run on my GPU (Google pixel 6 pro/GrapheneOS) and run much faster than gguf models?
I have used other apps like "https://github.com/jegly/Box" that can do it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions