Default number of jobs is determined at https://github.com/vchuravy/ParallelTestRunner.jl/blob/26ec17954570cc0269b6ade76d573e0200e76952/src/ParallelTestRunner.jl#L289-L294 and it's based on a combination of number of cores and available memory. However, GitHub-hosted Apple Silicon runners have only 3 cores and 7 GB of memory, resulting this heuristic to choose only one runner. We may want to tweak the heuristics, to allow at least 2 workers out-of-the-box on Apple Silicon (and hope nothing crashes badly).
Ref: EnzymeAD/Enzyme.jl#2662 (comment).