[WIP] Reduce deps core #593

Blaizzy · 2025-11-19T22:16:14Z

Summary:
This PR removes the dependency on torch, torchvision, and transformers by porting the necessary processors directly into mlx-vlm. It also restructures pyproject.toml to support optional installations.

Changes:

Removed Dependencies: Core installation no longer requires Torch or Transformers.
New Extras: Added optional flags for [trainer], [server], and [audio].
Refactoring:
- Replaced mlx-audio with soundfile.
- Moved audio imports to be lazy-loaded within functions to avoid crashes for users without audio dependencies.
- Cleaned up redundant imports in utils.py.
Docs: Added installation instructions for optional dependencies to the README.

- Added new optional dependencies: `trainer` for dataset tooling and `server` for FastAPI support. - Updated `audio` dependency to include `soundfile`. - Enhanced README with a detailed table of optional dependencies and installation commands.

altaic · 2025-11-21T03:42:11Z

Sort of related, have you considered replacing py-opencv which pulls in a rather hefty set of deps (120+)? It looks like it's currently only used to load and resize the frames of videos.

Blaizzy added 2 commits November 18, 2025 20:46

revert requests

d3b6a08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] Reduce deps core #593

[WIP] Reduce deps core #593

Blaizzy commented Nov 19, 2025

Uh oh!

altaic commented Nov 21, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[WIP] Reduce deps core #593

Are you sure you want to change the base?

[WIP] Reduce deps core #593

Conversation

Blaizzy commented Nov 19, 2025

Uh oh!

altaic commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

altaic commented Nov 21, 2025 •

edited

Loading