Skip to content

Multi-node slurm training? #4448

Open
Open
@JulioZhao97

Description

@JulioZhao97

Hello guys, I wonder if there is an example for multi-node training on slurm? The multi-node training example you provided is training on multiple machines, which don't apply on slurm where each the communication of each node is managed by slurm manager.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions