Contracts used in testing (e.g. defined Foundry often using Contract.t.sol) are not filtered correctly: https://github.com/muellerberndt/scabench/blob/b506cf0f36688b60222d3a72bb01544ced71d447/baseline-runner/baseline_runner.py#L258-L272
It seems that they need to have test in their names.